UNIFIED DATA MANAGEMENT AND ANALYTICS IN CLOUD-BASED DATA LAKE ENVIRONMENTS
Inventors
Johannes Alberti, Arthur De Bortoli, Roberto Raquze Flores, Nagavijay Gamidi, Rudolf Hennecke, Felipe Einsfeld Kersting, Gabriel Alexandro Pohlod Maciel, Florian Maier, Klaus Nagel, Silvio Normey Gomez, Sailesh Radhakrishnan, Jens Rannacher, Rubens Luiz Rech, JR., Peter Schoenau, Michael te Uhle, Dylan Vollrath, Daniel Ritter, Mihnea Andrei, Amit Pathak, Bjoern Friedmann
Abstract
A cloud-based data platform is disclosed, enabling storage-agnostic data management and analytics in a unified environment. The system may integrate a data lake implemented as a hyperscaler object store, a cloud-based database management system (DBMS), elastic compute resources, and analytics engines. Data may be stored in open table formats, such as Apache Parquet, Delta Lake, and Apache Iceberg, supporting ACID transactions, schema evolution, and efficient query processing. Virtual tables may map data stored in the data lake to the DBMS, enabling in-situ query processing via SQL interfaces. The platform may support advanced features, including change data capture (CDC), time travel, and lifecycle management using a SAGA pattern for atomic operations. Security may be ensured through X.509 certificates, web tokens, and role-based access controls. Elastic compute resources, such as Apache Spark, facilitate large-scale data transformations and analytics.
CPC Classifications
Filing Date
2025-09-16
Application No.
19330441