Invention Title:

SYSTEM AND METHOD FOR REAL-TIME THREE-DIMENSIONAL RECONSTRUCTION AND STREAMING OF SPORTS EVENTS AND CONCERTS

Publication number:

US20250384627

Publication date:

2025-12-18

Section:

Physics

Class:

G06T17/10

Inventors:

Fernando De La Torre Frade 🇺🇸 Pittsburgh, PA, United States

Francisco Vicente Carrasco 🇺🇸 Pittsburgh, PA, United States

Albert Mosella Montoro 🇪🇸 Terrasa, Spain

Alejandro Amat Payá 🇺🇸 Pittsburgh, PA, United States

Saswat Subhajyoti Mallick 🇺🇸 Pittsburgh, PA, United States

Bernhard Kerbl 🇦🇺 Graz, Australia

Junkai Huang 🇺🇸 San Jose, CA, United States

Marc Ruiz Olle 🇺🇸 Pittsburgh, PA, United States

Assignee:

Carnegie Mellon University 🇺🇸 Pittsburgh, PA, United States

Applicant:

CARNEGIE MELLON UNIVERSITY 🇺🇸 Pittsburgh, PA, United States

Smart overview of the Invention

The described system and method facilitate real-time, three-dimensional reconstruction of dynamic scenes, particularly for sports events and concerts. Utilizing a two-level parallel computation approach, the system processes multi-view video streams to reconstruct multiple frames and dynamic elements simultaneously. This method leverages distributed processing nodes optimized for parallel execution, enhancing efficiency in creating interactive 3D experiences.

Parallel Processing Strategy

The system employs a dual-level parallelization strategy. The first level involves frame-level parallelization, where consecutive multi-view frames are processed simultaneously across distributed GPUs. The second level focuses on element-level parallelization, allowing for the independent reconstruction of dynamic elements like humans or objects within the scene. These reconstructed elements are then combined into an aggregated representation, followed by refinement to form a per-frame point cloud.

Dynamic and Static Elements

Different reconstruction methods are applied to dynamic and static elements within the scene. Dynamic elements, such as human subjects, are initialized using 3D primitives from a fitted parametric model or a dual-branch renderer. The system refines these primitives through processes like pose estimation and skeleton optimization. Static elements, typically background components, are processed using a splatting-based method tailored to their characteristics.

Optimization and Refinement

The method involves several optimization steps for dynamic scenes. For human elements, the system gathers multi-view frames, estimates a 3D pose model, and uses splatting-based reconstruction techniques. It fits a parametric mesh model to refine the skeleton and appearance. For static elements, the method includes fitting a 3D primitives model of the environment and potentially increasing model density in regions of interest, followed by appearance parameter optimization using spherical harmonics.

Implementation and Enhancements

The invention can be implemented as a computer-implemented method or stored on a non-transitory computer-readable medium. It involves identifying elements in an environment, segmenting frames, optimizing models, and refining them to create a unified 3D model for each time frame. Enhancements include caching changes to avoid recomputation, capturing rendering operations as a static computational graph, and redistributing Gaussians to balance computational loads.