Automated Sign to Speech Conversion Model using Deep Learning
##plugins.themes.academic_pro.article.main##
Abstract
Unable to communicate verbally is a disability. In order to exchange thoughts and interact, there exist several
ways. The most predominant method involves use of hand-gestures. The prime motive of the proposed research
work is to bridge the research gap in Sign Language Recognition with maximum efficiency. The goal is to replace
the human mediator with a machine to minimize human interference. This paper focuses on the recognition of ASL
in real-time. In automatic sign language translator design the challenging part lies in selecting a good classifier
to classify the static input gestures with high accuracy. CNN architecture is used to design a classifier for sign
language recognition in the proposed system. The model and the pipeline architecture is developed by keras based
convolutional neural network to classify 27 alphabets that is 26 English language alphabets and a unique character,
space. With different parameter configurations, the system has trained the classifier with different parameters and
tabulated the results. The proposed study achieved an efficiency of 99.88% on the test set. The result shows that
the model accuracy improves as more data is fetched from various subjects for training.
##plugins.themes.academic_pro.article.details##
This work is licensed under a Creative Commons Attribution 4.0 International License.
References
- Abraham, Abey amp; .V, R. 2018. Real-time conversion of sign language to speech and prediction of gestures using artificial neural network. Procedia Computer Science. Vol.10, No.435. DOI: https://doi.org/10.1016/j.procs.2018.10.435
- Chen, B., Deng, W., and Du, J. 2017. Noisy softmax: Improving the generalization ability of dcnn via postponing the early softmax saturation. DOI: https://doi.org/10.1109/CVPR.2017.428
- Dumitrescu, D. and Boiangiu, C.-A. 2019. A study of image upsampling and downsampling filters. Computers 8, 2, 30. DOI: https://doi.org/10.3390/computers8020030
- Gedraite, E. S. and Hadad, M. 2011. Investigation on the effect of a gaussian blur in image filtering and segmentation. In Proceedings ELMAR-2011. IEEE, 393–396.
- Kang, L., Ye, P., Li, Y., and Doermann, D. 2014. Convolutional neural networks for noreference image quality assessment. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1733–1740. DOI: https://doi.org/10.1109/CVPR.2014.224
- Liu, W., Islamaj Dogan, R. ˘ , Kim, S., Comeau, D. C., Kim, W., Yeganova, L., Lu, Z., and Wilbur, W. J. 2014. Author name disambiguation for p ub m ed. Journal of the Association for Information Science and Technology 65, 4, 765–781. DOI: https://doi.org/10.1002/asi.23063
- Manikandan, K., Patidar, A., Walia, P., and Roy, A. B. 2018. Hand gesture detection and conversion to speech and text. arXiv preprint arXiv:1811.11997 .
- Newby, P. R. 2011. Accuracy, precision, extraction, citation and valediction. Pan, Z., Yu, W., Yi, X., Khan, A., Yuan, F., and Zheng, Y. 2019. Recent progress on generative adversarial networks (gans): A survey. IEEE Access 7, 36322–36333. DOI: https://doi.org/10.1109/ACCESS.2019.2905015
- Pramada, S., Saylee, D., Pranita, N., Samiksha, N., and Vaidya, M. 2013. Intelligent sign language recognition using image processing. IOSR Journal of Engineering (IOSRJEN) 3, 2, 45–51. DOI: https://doi.org/10.9790/3021-03224551