Some observations on computer lip-reading: moving from the dream to the reality
Article
Bear, Y., Owen, Gari, Harvey, Richard and Theobald, Barry-John 2014. Some observations on computer lip-reading: moving from the dream to the reality. Proceedings of SPIE. 9253. https://doi.org/10.1117/12.2067464
Authors | Bear, Y., Owen, Gari, Harvey, Richard and Theobald, Barry-John |
---|---|
Abstract | In the quest for greater computer lip-reading performance there are a number of tacit assumptions which are either present in the datasets (high resolution for example) or in the methods (recognition of spoken visual units called "visemes" for example). Here we review these and other assumptions and show the surprising result that computer lip-reading is not heavily constrained by video resolution, pose, lighting and other practical factors. However, the working assumption that visemes, which are the visual equivalent of phonemes, are the best unit for recognition does need further examination. We conclude that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system. © (2014) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only. |
Keywords | Lip-reading; speech recognition; pattern recognition |
Journal | Proceedings of SPIE |
Journal citation | 9253 |
ISSN | 0277-786X |
1996-756X | |
Year | 2014 |
Publisher | SPIE (Society of Photo-optical Instrumentation Engineers) |
Accepted author manuscript | License CC BY-NC-ND |
Digital Object Identifier (DOI) | https://doi.org/10.1117/12.2067464 |
Web address (URL) | http://proceedings.spiedigitallibrary.org/volume.aspx?conferenceid=3393&volumeid=16739 |
Publication dates | |
07 Oct 2014 | |
Publication process dates | |
Deposited | 28 Feb 2017 |
Copyright information | © SPIE. Proc. SPIE 9253, Optics and Photonics for Counterterrorism, Crime Fighting, and Defence X; and Optical Materials and Biomaterials in Security and Defence Systems Technology XI, 92530G (October 7, 2014) |
Place of publication | United States of America |
ISBN | 9781628413168 |
Book title | Volume 9253 Optics and Photonics for Counterterrorism, Crime Fighting, and Defence X; and Optical Materials and Biomaterials in Security and Defence Systems Technology XI |
Editors | Burges, Douglas, Owen, Gari, Rana, Harbinder, Zamboni, Roberto, Kajzar, François and Szep, Attila A. |
https://repository.uel.ac.uk/item/858q8
Download files
92
total views262
total downloads1
views this month1
downloads this month