US20240143543
2024-05-02
Physics
G06F16/113
 
                 
                 
                 
                Overview: The patent application discusses methods, systems, and computer-readable media designed for the automatic archiving of data store log data. It focuses on selecting operation records from logs for archival before deletion, replicating them to an archive, and marking them as archived, thus allowing their deletion from the log. This process aims to facilitate long-term storage of change data efficiently.
Background: With the migration of business applications to cloud environments, data centers have become integral for providing scalable and cost-effective computing resources. These centers utilize virtualization to serve multiple clients, maintaining high service quality and efficient resource utilization. Although database services offer APIs for accessing data objects, historical change data is often inaccessible but valuable for applications like offline data mining and checkpointing.
Embodiments: The described embodiments capture changes to data objects in a log until they expire. A fleet of workers replicates these log entries to an archive, marking them for deletion in the log. Clients can access archived data directly using mapping metadata, benefiting from low-cost storage resources while having access to a longer history of changes.
System Environment: The system includes a data store archiving system with components like change logging, operation record selection, and archive replication functionalities. Clients interact with this system through APIs to read, write, or modify data objects. The provider network supports various storage-related services, accommodating different types of databases and storage interfaces.
Implementation: The archiving system's components can be distributed across multiple data centers and geographical locations, operating automatically and programmatically. This setup ensures efficient archiving without user intervention post-initial configuration. The system's modularity allows it to adapt to different client needs and infrastructure configurations.