New! Sign up for our free email newsletter.
Science News
from research organizations

Engineers bring sign language to 'life' using AI to translate in real-time

Breakthrough technology translates American Sign Language into text

Date:
April 9, 2025
Source:
Florida Atlantic University
Summary:
American Sign Language (ASL) recognition systems often struggle with accuracy due to similar gestures, poor image quality and inconsistent lighting. To address this, researchers developed a system that translates gestures into text with 98.2% accuracy, operating in real time under varying conditions. Using a standard webcam and advanced tracking, it offers a scalable solution for real-world use, with MediaPipe tracking 21 keypoints on each hand and YOLOv11 classifying ASL letters precisely.
Share:
FULL STORY

For millions of deaf and hard-of-hearing individuals around the world, communication barriers can make everyday interactions challenging. Traditional solutions, like sign language interpreters, are often scarce, expensive and dependent on human availability. In an increasingly digital world, the demand for smart, assistive technologies that offer real-time, accurate and accessible communication solutions is growing, aiming to bridge this critical gap.

American Sign Language (ASL) is one of the most widely used sign languages, consisting of distinct hand gestures that represent letters, words and phrases. Existing ASL recognition systems often struggle with real-time performance, accuracy and robustness across diverse environments.

A major challenge in ASL systems lies in distinguishing visually similar gestures such as "A" and "T" or "M" and "N," which often leads to misclassifications. Additionally, the dataset quality presents significant obstacles, including poor image resolution, motion blur, inconsistent lighting, and variations in hand sizes, skin tones and backgrounds. These factors introduce bias and reduce the model's ability to generalize across different users and environments.

To tackle these challenges, researchers from the College of Engineering and Computer Science at Florida Atlantic University have developed an innovative real-time ASL interpretation system. Combining the object detection power of YOLOv11 with MediaPipe's precise hand tracking, the system can accurately recognize ASL alphabet letters in real time. Using advanced deep learning and key hand point tracking, it translates ASL gestures into text, enabling users to interactively spell names, locations and more with remarkable accuracy.

At its core, a built-in webcam serves as a contact-free sensor, capturing live visual data that is converted into digital frames for gesture analysis. MediaPipe identifies 21 keypoints on each hand to create a skeletal map, while YOLOv11 uses these points to detect and classify ASL letters with high precision.

"What makes this system especially notable is that the entire recognition pipeline -- from capturing the gesture to classifying it -- operates seamlessly in real time, regardless of varying lighting conditions or backgrounds," said Bader Alsharif, the first author and a Ph.D. candidate in the FAU Department of Electrical Engineering and Computer Science. "And all of this is achieved using standard, off-the-shelf hardware. This underscores the system's practical potential as a highly accessible and scalable assistive technology, making it a viable solution for real-world applications."

Results of the study, published in the journal Sensors, confirm the system's effectiveness, which achieved a 98.2% accuracy (mean Average Precision, [email protected]) with minimal latency. This finding highlights the system's ability to deliver high precision in real-time, making it an ideal solution for applications that require fast and reliable performance, such as live video processing and interactive technologies.

With 130,000 images, the ASL Alphabet Hand Gesture Dataset includes a wide variety of hand gestures captured under different conditions to help models generalize better. These conditions cover diverse lighting environments (bright, dim and shadowed), a range of backgrounds (both outdoor and indoor scenes), and various hand angles and orientations to ensure robustness.

Each image is carefully annotated with 21 keypoints, which highlight essential hand structures such as fingertips, knuckles and the wrist. These annotations provide a skeletal map of the hand, allowing models to distinguish between similar gestures with exceptional accuracy.

"This project is a great example of how cutting-edge AI can be applied to serve humanity," said Imad Mahgoub, Ph.D., co-author and Tecore Professor in the FAU Department of Electrical Engineering and Computer Science. "By fusing deep learning with hand landmark detection, our team created a system that not only achieves high accuracy but also remains accessible and practical for everyday use. It's a strong step toward inclusive communication technologies."

The deaf population in the U.S. is approximately 11 million, or 3.6% of the population, and about 15% of American adults (37.5 million) experience hearing difficulties.

"The significance of this research lies in its potential to transform communication for the deaf community by providing an AI-driven tool that translates American Sign Language gestures into text, enabling smoother interactions across education, workplaces, health care and social settings," said Mohammad Ilyas, Ph.D., co-author and a professor in the FAU Department of Electrical Engineering and Computer Science. "By developing a robust and accessible ASL interpretation system, our study contributes to the advancement of assistive technologies to break down barriers for the deaf and hard of hearing population."

Future work will focus on expanding the system's capabilities from recognizing individual ASL letters to interpreting full ASL sentences. This would enable more natural and fluid communication, allowing users to convey entire thoughts and phrases seamlessly.

"This research highlights the transformative power of AI-driven assistive technologies in empowering the deaf community," said Stella Batalama, Ph.D., dean of the College of Engineering and Computer Science. "By bridging the communication gap through real-time ASL recognition, this system plays a key role in fostering a more inclusive society. It allows individuals with hearing impairments to interact more seamlessly with the world around them, whether they are introducing themselves, navigating their environment, or simply engaging in everyday conversations. This technology not only enhances accessibility but also supports greater social integration, helping create a more connected and empathetic community for everyone."

Study co-authors are Easa Alalwany, Ph.D., a recent Ph.D. graduate of the FAU College of Engineering and Computer Science and an assistant professor at Taibah University in Saudi Arabia; Ali Ibrahim, Ph.D., a Ph.D. graduate of the FAU College of Engineering and Computer Science.


Story Source:

Materials provided by Florida Atlantic University. Original written by Gisele Galoustian. Note: Content may be edited for style and length.


Journal Reference:

  1. Bader Alsharif, Easa Alalwany, Ali Ibrahim, Imad Mahgoub, Mohammad Ilyas. Real-Time American Sign Language Interpretation Using Deep Learning and Keypoint Tracking. Sensors, 2025; 25 (7): 2138 DOI: 10.3390/s25072138

Cite This Page:

Florida Atlantic University. "Engineers bring sign language to 'life' using AI to translate in real-time." ScienceDaily. ScienceDaily, 9 April 2025. <www.sciencedaily.com/releases/2025/04/250409114945.htm>.
Florida Atlantic University. (2025, April 9). Engineers bring sign language to 'life' using AI to translate in real-time. ScienceDaily. Retrieved October 26, 2025 from www.sciencedaily.com/releases/2025/04/250409114945.htm
Florida Atlantic University. "Engineers bring sign language to 'life' using AI to translate in real-time." ScienceDaily. www.sciencedaily.com/releases/2025/04/250409114945.htm (accessed October 26, 2025).

Explore More

from ScienceDaily

RELATED STORIES