Invention Title:

VIRTUAL CONVERSATIONAL COMPANION

Publication number:

US20250157463

Publication date:
Section:

Physics

Class:

G10L15/1815

Inventors:

Applicant:

Smart overview of the Invention

The patent application describes a system that enhances human-computer interaction through the use of a virtual conversational companion. This system utilizes advanced natural language processing (NLP) techniques to interpret and respond to spoken user inputs. By leveraging automatic speech recognition (ASR) and natural language understanding (NLU), the system can generate natural language data that forms the basis for creating visual and auditory content, including synthesized speech, images, and avatars.

Functionality

Upon receiving user utterances, the system processes these inputs to determine the desired output content. It uses natural language generation (NLG) techniques to modify the natural language data according to user-defined parameters. The system then generates corresponding visual content, such as images and video data of an avatar, which is synchronized with synthesized speech. This synchronization extends to subtitles, where specific words are emphasized in tandem with their spoken counterparts.

Components and Processes

The system comprises various components that facilitate its operation, including dialog management and response management components. These components manage the flow of conversation and content generation based on user interactions. The system's architecture supports runtime processing to dynamically respond to user inputs, generating appropriate content such as stories or answering questions through a question-and-answer component.

User Interaction

The virtual conversational companion enhances user experience by providing a synchronized output that combines visual and auditory elements. For instance, when a user requests a story featuring specific characters or settings, the system adapts the narrative accordingly. It generates images and avatars that visually represent the story's elements while synchronizing these visuals with the narrative's synthesized speech output.

Compliance and Adaptability

The system is designed with user privacy and legal compliance in mind, ensuring that all processing activities adhere to relevant laws and regulations. It can be configured geographically to comply with jurisdiction-specific requirements. The architecture allows for flexibility in processing orders, enabling the addition or removal of certain processes without deviating from the core functionality.