US20250104689
2025-03-27
Physics
G10L13/02
The system provides speech assistance during virtual meetings by detecting speech impediments in real-time and generating an avatar synchronized with the participant's speech. This avatar offers visual feedback, helping participants with speech disorders communicate more effectively. The system uses a speech impediment detection engine to analyze speech data, ensuring seamless integration into communication sessions.
With the growing reliance on virtual meetings for work, education, and social interactions, individuals with speech disorders face unique challenges. The audio-visual nature of these meetings can exacerbate anxiety and hinder communication for those with conditions like stuttering. Existing solutions do not adequately address the needs of users with speech disorders, highlighting a need for improved accessibility in virtual meeting platforms.
The invention employs a data processing system with a processor and memory to execute several key functions. These include extracting audio features from speech data and using a machine-learning model to detect speech impediments. Upon detection, an avatar is generated and synchronized with the participant's speech, providing real-time visual feedback. This approach leverages lip synchronization technology to enhance user experience during communication sessions.
The method involves receiving requests for speech assistance and extracting audio features from the virtual meeting's speech data. A machine-learning model detects any speech impediments, triggering the automatic generation of an avatar. This avatar is displayed on the participant's interface, providing real-time feedback through synchronized lip movements and visual cues, such as color changes to indicate the severity of impediments.
This technical solution addresses the lack of real-time speech assistance in existing platforms by offering a mechanism that detects and responds to speech disorders dynamically. Benefits include improved accessibility for users with speech disorders, reduced communication barriers in virtual meetings, and enhanced user confidence. The system provides a user-friendly way to visualize and manage speech impediments, fostering more inclusive communication environments.