Invention Title:

SYSTEM AND METHOD FOR LANGUAGE-DRIVEN AVATAR EDITING

Publication number:

US20240153225

Publication date:
Section:

Physics

Class:

G06T19/20

Inventors:

Assignee:

Applicant:

Drawings (4 of 12)

Smart overview of the Invention

The system and method for language-driven avatar editing focus on creating and customizing 3D avatars using language inputs. The process involves receiving a language description, generating a latent vector from this input, and updating an initial avatar model to produce a three-dimensional avatar. This approach simplifies avatar creation by allowing users to describe desired changes in natural language rather than manually adjusting features.

Technical Field

This technology is applied in the realm of avatar editing and customization, particularly for mobile devices and virtual reality hardware. It leverages machine learning to automate the process, which traditionally requires significant manual effort and expertise in 3D modeling. The system is designed to streamline avatar creation, making it accessible to users without specialized skills.

System Components

The system includes several key components:

  • Processor: Executes instructions stored in memory.
  • Memory Device: Stores software components like the operating system and avatar editing application.
  • User Interface Hardware: Includes devices such as touch screens, VR hardware, cameras, microphones, and speakers for input and output.
  • Avatar Generation Application: Manages data storage, language-driven editing modules, and graphical interfaces.

Editing Process

The language-driven editing module consists of subcomponents that handle different tasks. A language encoder converts text descriptions into latent vectors, while an image encoder processes 2D images similarly. These vectors are used to compute similarity scores, which guide the neural 3D editor in adjusting the avatar's features. The updated model is rendered into 2D images for user review.

Applications and Advantages

This method enhances customizability by allowing users to create unique avatars without extensive manual input. It offers a more efficient alternative to traditional methods that rely on predefined models and manual adjustments. The integration of language and image processing enables seamless avatar creation across various platforms, including mobile devices and VR headsets.