Always private
DuckDuckGo never tracks your searches.
Learn More
You can hide this reminder in Search Settings
All regions
Argentina
Australia
Austria
Belgium (fr)
Belgium (nl)
Brazil
Bulgaria
Canada (en)
Canada (fr)
Catalonia
Chile
China
Colombia
Croatia
Czech Republic
Denmark
Estonia
Finland
France
Germany
Greece
Hong Kong
Hungary
Iceland
India (en)
Indonesia (en)
Ireland
Israel (en)
Italy
Japan
Korea
Latvia
Lithuania
Malaysia (en)
Mexico
Netherlands
New Zealand
Norway
Pakistan (en)
Peru
Philippines (en)
Poland
Portugal
Romania
Russia
Saudi Arabia
Singapore
Slovakia
Slovenia
South Africa
Spain (ca)
Spain (es)
Sweden
Switzerland (de)
Switzerland (fr)
Taiwan
Thailand (en)
Turkey
Ukraine
United Kingdom
US (English)
US (Spanish)
Vietnam (en)
Safe search: moderate
Strict
Moderate
Off
Any time
Any time
Past day
Past week
Past month
Past year
Showing results excluding:
  • typeset.io

All Results

  1. avsp2017.loria.fr

    employing the entire lower face as the ROI. Index Terms: lipreading, speechreading, visual speech recog-nition, region-of-interest, CNN, LSTM, deep learning. 1. Introduction Lately, there has been renewed research interest in automatic speechreading (or lipreading) systems, harvesting recent ad-vances in the computer vision and automatic speech ...
  2. isca-archive.org

    However, little or no attention has been paid to the effects of ROI physical coverage and resolution on the resulting recognition performance within the deep learning framework. In this paper, we investigate such choices for a visual-only speech recognition system based on CNNs and long short-term memory models that we present in detail.
  3. semanticscholar.org

    However, little or no attention has been paid to the effects of ROI physical coverage and resolution on the resulting recognition performance within the deep learning framework. In this paper, we investigate such choices for a visual-only speech recognition system based on CNNs and long short-term memory models that we present in detail.
  4. semanticscholar.org

    DOI: 10.21437/AVSP.2017-13 Corpus ID: 3531386; Exploring ROI size in deep learning based lipreading @inproceedings{Koumparoulis2017ExploringRS, title={Exploring ROI size in deep learning based lipreading}, author={Alexandros Koumparoulis and Gerasimos Potamianos and Youssef Mroueh and Steven J. Rennie}, booktitle={AVSP ..}, year={2017} }
  5. Mentioning: 12 - Automatic speechreading systems have increasingly exploited deep learning advances, resulting in dramatic gains over traditional methods. State-of-the-art systems typically employ convolutional neural networks (CNNs), operating on a video region-of-interest (ROI) that contains the speaker's mouth. However, little or no attention has been paid to the effects of ROI physical ...
  6. Exploring ROI size in deep learning based lipreading. ... Exploring ROI size in deep learning based lipreading. In Slim Ouni, Chris Davis 0001, Alexandra Jesse, Jonas Beskow, editors, Auditory-Visual Speech Processing, AVSP 2017, Stockholm, Sweden, 25-26 August 2017. pages 64-69, ISCA, 2017.
  7. Can’t find what you’re looking for?

    Help us improve DuckDuckGo searches with your feedback

Custom date rangeX