A Survey on Lip-Reading with Deep Learning

dc.contributor.authorErbey, Ali
dc.contributor.authorBarışçı, Necaattin
dc.date.accessioned2025-01-21T14:20:41Z
dc.date.available2025-01-21T14:20:41Z
dc.date.issued2022
dc.description.abstractVery successful results have been obtained in areas such as computer vision and voice recognition when applying deep learning methods. Technologies that facilitate the lives of people have been developed as a result of the successes of deep learning within these areas. One of these technologies is voice recognition devices. Research has shown that these devices do not give good results in noisy environments; although, they do give good results in silent environments. With deep learning methods, voice recognition in noisy environments can be achieved using visual signals. Thanks to computerized vision, the success of voice recognition devices can be increased with the analysis of human lips in order to determine what the speaker is saying. In this study, lip-reading studies using deep learning methods published between 2017 and 2020 were examined and data sets were introduced. As a result of the study, it is seen that CNN and LSTM architectures are used more intensively in lip-reading studies, hybrid models are preferred more and the success rates are increasing day by day. In this context, it is seen that technologies that can be used in line with the need can be developed by conducting more academic studies on lip reading.
dc.identifier.dergipark1038899
dc.identifier.doi10.29137/umagd.1038899
dc.identifier.issn1308-5514
dc.identifier.issue2-844
dc.identifier.startpage860
dc.identifier.urihttps://dergipark.org.tr/tr/download/article-file/2141481
dc.identifier.urihttps://dergipark.org.tr/tr/pub/umagd/issue/71387/1038899
dc.identifier.urihttps://doi.org/10.29137/umagd.1038899
dc.identifier.urihttps://hdl.handle.net/20.500.12587/19255
dc.identifier.volume1
dc.language.isoen
dc.publisherKırıkkale Üniversitesi
dc.relation.ispartofUluslararası Mühendislik Araştırma ve Geliştirme Dergisi
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_20241229
dc.subjectLipreading
dc.subjectDeep Learning
dc.subjectConvolutional Neural Networks
dc.subjectArtificial Neural Networks
dc.subjectEngineering
dc.subjectMühendislik
dc.titleA Survey on Lip-Reading with Deep Learning
dc.typeArticle

Dosyalar