B366815020525

Sign Language Detection and Recognition using Image Processing for Improved Communication
Nishtha Bhagyawant¹, Gauri Tamondkar², Sneha Yadav³, Shwethashree Kenche⁴, Sunny Sall⁵

¹Nishtha Bhagyawant, Department of Computer Engineering, St. John College of Engineering and Management, Palghar (Maharashtra), India.

²Gauri Tamondkar, Department of Computer Engineering, St. John College of Engineering and Management, Palghar (Maharashtra), India.

³Sneha Yadav, Department of Computer Engineering, St. John College of Engineering and Management, Palghar (Maharashtra), India.

⁴Shwethashree Kenche, Department of Computer Engineering, St. John College of Engineering and Management, Palghar (Maharashtra), India.

⁵Sunny Sall, Department of Computer Engineering, St. John College of Engineering and Management, Palghar (Maharashtra), India.

Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: This study presents an advanced deep learning framework for the real-time recognition and translation of Indian Sign Language (ISL). Our approach integrates Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks to effectively capture the spatial and temporal features of ISL gestures. The CNN component extracts rich visual features from the input sign language videos, while the LSTM component models the dynamic temporal patterns inherent in the gesture sequences. We evaluated our system using a comprehensive ISL dataset comprising 700 fully annotated videos, which represent 100 spoken language sentences. To assess the effectiveness of our approach, we compared two model architectures: CNN-LSTM and SVM-LSTM. The CNN-LSTM model achieved a training accuracy of 84%, demonstrating superior performance in capturing visual and sequential information. In contrast, the SVM-LSTM model achieved a training accuracy of 66%, indicating comparatively lower effectiveness in this context. One of the key challenges faced during the system’s development was overfitting, primarily due to computational constraints and the limited size of the dataset. Nevertheless, the model exhibited promising results through careful tuning of hyperparameters and various optimisation strategies, suggesting its potential for real-world applications. This paper also discusses the data preprocessing techniques employed, including video frame extraction, normalisation, and data augmentation, which were critical in enhancing model performance. By addressing the complexities of sign language recognition, our work enhances communication accessibility for individuals who rely on ISL, promoting greater inclusivity through technology.

Keywords: Sign Language (SL), OpenCV, CNN, LSTM, hand gesture, real-time, Deep Learning (DL)
Scope of the Article: Image Processing and Recognition

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US

Share this entry

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US