US20250063142
2025-02-20
Electricity
H04N7/157
The patent application describes a system for immersive virtual reality (VR) communication, enabling users to interact in a virtual environment with real-time 3D representations of each other. The system employs capture devices to record image streams of users, which are then transmitted over networks to VR devices. These devices render the virtual environment and produce user renditions based on the captured data, allowing for a realistic and immersive communication experience.
Advancements in virtual and mixed reality technologies have made it feasible to use headsets or head-mounted displays (HMDs) for virtual meetings. This need has grown due to scenarios like pandemics, where in-person gatherings are restricted. However, inconsistencies in user images due to variations in capture angles, locations, and lighting can hinder the immersive experience in virtual conferences.
The system consists of capture devices for each user to record image streams, networks to transmit these streams, and VR devices for rendering the virtual environment. The VR devices display renditions of users based on the captured data. The virtual environment can be common or individually tailored for each user's perspective, enhancing the immersive experience by adjusting user positions relative to capture devices.
Embodiments include features such as using graphics processing units for data generation and transmission. The system can direct users to optimize their positions for better rendering. User renditions in the virtual environment can undergo lighting adjustments to match the VR content, enhancing realism. The system supports various configurations, including scenarios where only one user has a capture device.
The process begins with a user initiating an immersive call through an application on their VR device or another device. Notifications are sent to both users, allowing them to accept or reject the call. If accepted, users are prompted to wear their VR devices, and video streams are initiated and processed. Users receive cues to position themselves correctly for an effective immersive call experience.