US20250095633
2025-03-20
Physics
G10L13/086
The proposed system integrates smart glasses, a smart mobile terminal, and a cloud server to facilitate real-time translation using artificial intelligence. The smart glasses capture speech in a designated language and transmit it to the mobile terminal. This speech is then converted into text and sent to a cloud server equipped with large language models (LLMs) for translation. The translated text is converted back into speech on the mobile terminal and played back through the smart glasses, offering users an interactive and practical translation experience.
The system comprises three main components: smart glasses, a smart mobile terminal, and a cloud server. The smart glasses feature a speech pickup device and an output device connected to a processor and memory. The mobile terminal processes speech-to-text conversion and text-to-speech playback, while the cloud server handles text translation using LLMs. These components work in unison to deliver accurate translations, enhancing the functionality of smart glasses beyond basic tasks like calls or music playback.
The translation process begins with the smart glasses capturing speech in a source language. This audio is transmitted via Bluetooth to the mobile terminal, where it is converted into text. The text is sent to the cloud server, translated into the target language using LLMs, and returned as translated text. Finally, the mobile terminal converts this text back into speech for playback through the smart glasses, ensuring seamless communication in different languages.
The system's AI-driven translation capabilities aim to make smart glasses more practical and engaging for users by expanding their functionality. By incorporating advanced LLMs such as generative AI models or multimodal models, the system not only improves translation accuracy but also enhances user experience through increased interactivity and intelligence. This approach addresses existing limitations of smart glasses by providing a more comprehensive set of features.
This translation technology leverages the scalability and creativity of LLMs to deliver precise translations, thereby increasing user engagement with smart glasses. The integration of these components allows for innovative applications in various scenarios, such as real-time communication across language barriers. By enhancing user interaction and expanding functionality, this system represents a significant advancement in wearable technology.