Invention Title:

SYSTEM AND METHOD FOR LANGUAGE-DRIVEN AVATAR EDITING

Publication number:

US20240153225

Publication date:
Section:

Physics

Class:

G06T19/20

Inventors:

Assignee:

Applicant:

Drawings (4 of 12)

Smart overview of the Invention

A novel method enables the editing of 3D avatar models using language descriptions as inputs. The process starts by receiving a language-based description, which is then transformed into a latent vector. This latent vector serves as the foundation for updating an initial avatar model, resulting in a new three-dimensional avatar that can be displayed to users. This innovative approach streamlines the customization of avatars, making it more accessible and efficient.

Technical Applications and Benefits

The system is designed for use across various digital platforms, including mobile devices and virtual reality (VR) hardware. By leveraging machine learning techniques, this method automates the avatar editing process, eliminating the need for manual 3D modeling skills. Users can create unique avatars based on descriptive language inputs, enhancing their digital interactions without the traditional time-consuming processes associated with avatar customization.

Input Mechanisms and User Interaction

Users can provide input through multiple modalities, including audio, video, text, and images. This flexibility allows for a rich user experience where the desired avatar characteristics can be articulated in various forms. The system utilizes a combination of these inputs to generate a precise language description that drives the avatar creation process.

System Architecture

The architecture includes key components such as processors, memory devices, and user interface hardware that facilitate interaction and processing. Specifically, the system comprises an avatar generation application with modules for language-driven editing and graphical user interfaces. These components work together to encode user inputs into actionable data that modifies the avatar model accordingly.

Advanced Editing Capabilities

The editing process incorporates advanced algorithms to ensure that avatars are not only accurately represented but also visually appealing. The language encoder translates descriptions into latent vectors that inform changes in the avatar's appearance, while a neural 3D editor adjusts the model’s vertices to reflect these changes. This sophisticated approach allows for high levels of customization and realism in digital avatars.