Always private
DuckDuckGo never tracks your searches.
Learn More
You can hide this reminder in Search Settings
All regions
Argentina
Australia
Austria
Belgium (fr)
Belgium (nl)
Brazil
Bulgaria
Canada (en)
Canada (fr)
Catalonia
Chile
China
Colombia
Croatia
Czech Republic
Denmark
Estonia
Finland
France
Germany
Greece
Hong Kong
Hungary
Iceland
India (en)
Indonesia (en)
Ireland
Israel (en)
Italy
Japan
Korea
Latvia
Lithuania
Malaysia (en)
Mexico
Netherlands
New Zealand
Norway
Pakistan (en)
Peru
Philippines (en)
Poland
Portugal
Romania
Russia
Saudi Arabia
Singapore
Slovakia
Slovenia
South Africa
Spain (ca)
Spain (es)
Sweden
Switzerland (de)
Switzerland (fr)
Taiwan
Thailand (en)
Turkey
Ukraine
United Kingdom
US (English)
US (Spanish)
Vietnam (en)
Safe search: moderate
Strict
Moderate
Off
Any time
Any time
Past day
Past week
Past month
Past year
  1. avsp2017.loria.fr

    mostly based on PCA, image transforms such as DCT, wavelets, and scattering [6], or image descriptors like LBPs [7] and HOGs [8]. Although such features have recently been em-ployed in conjunction with deep learning methods for visual G. Potamianos wishes to acknowledge support for this work by the EU Horizon 2020 project BabyRobot, under grant ...
  2. isca-archive.org

    However, little or no attention has been paid to the effects of ROI physical coverage and resolution on the resulting recognition performance within the deep learning framework. In this paper, we investigate such choices for a visual-only speech recognition system based on CNNs and long short-term memory models that we present in detail.
  3. semanticscholar.org

    However, little or no attention has been paid to the effects of ROI physical coverage and resolution on the resulting recognition performance within the deep learning framework. In this paper, we investigate such choices for a visual-only speech recognition system based on CNNs and long short-term memory models that we present in detail.
  4. semanticscholar.org

    DOI: 10.21437/AVSP.2017-13 Corpus ID: 3531386; Exploring ROI size in deep learning based lipreading @inproceedings{Koumparoulis2017ExploringRS, title={Exploring ROI size in deep learning based lipreading}, author={Alexandros Koumparoulis and Gerasimos Potamianos and Youssef Mroueh and Steven J. Rennie}, booktitle={AVSP ..}, year={2017} }
  5. semanticscholar.org

    Figure 4: A schematic of the CNN employed for visual feature extraction in Section 3.1 (see also Table 2). - "Exploring ROI size in deep learning based lipreading" ... "Exploring ROI size in deep learning based lipreading" Skip to search form Skip to main content Skip to account menu. Semantic Scholar's Logo. Search 217,736,891 papers from all ...
  6. Mentioning: 12 - Automatic speechreading systems have increasingly exploited deep learning advances, resulting in dramatic gains over traditional methods. State-of-the-art systems typically employ convolutional neural networks (CNNs), operating on a video region-of-interest (ROI) that contains the speaker's mouth. However, little or no attention has been paid to the effects of ROI physical ...
  7. researchr.org

    Exploring ROI size in deep learning based lipreading. Alexandros Koumparoulis, Gerasimos Potamianos, Youssef Mroueh, Steven J. Rennie. Exploring ROI size in deep learning based lipreading. In Slim Ouni, Chris Davis 0001, Alexandra Jesse, Jonas Beskow, editors, Auditory-Visual Speech Processing, AVSP 2017, Stockholm, Sweden, 25-26 August 2017.
  8. Can’t find what you’re looking for?

    Help us improve DuckDuckGo searches with your feedback

Custom date rangeX