US20250247606
2025-07-31
Electricity
H04N23/632
An electronic device is designed to enhance image generation and editing by leveraging a camera, display, processor, and memory. The device displays a first image and a preview image simultaneously, with the latter captured by the camera. By identifying objects in these images, it uses a generative AI model to transform parts of the first image based on objects in the preview image, resulting in a second image that is displayed on the device.
The device's functionality is driven by instructions stored in memory and executed by its processor. These instructions enable the display of images, identification of objects within them, and transformation of these objects using metadata like focal length and shutter speed. The process includes displaying a preview image over the first image without overlap and generating an edited second image in real-time.
Beyond basic transformations, the device can perform complex edits such as in-painting using AI models. It can also generate text prompts related to object characteristics for further refinement. The second image can be generated locally or via an external server hosting the AI model, enabling collaborative processing for enhanced results.
User input plays a significant role in the device's operation. Users can trigger various functions such as face detection and recognition to identify specific objects. Additionally, the device supports interactive editing through user inputs on graphic objects displayed alongside images, enhancing user engagement and control over the editing process.
The technology is applicable across various electronic devices such as smartphones, tablets, and computers. It supports seamless integration with external devices for obtaining preview images, broadening its usability. This approach not only enriches user experience but also aligns with modern demands for high-quality image processing and editing capabilities.