Kidong Lee’s Post

View profile for Kidong Lee, graphic

Founder of Cloud Chef Labs | Chango | Unified Data Lakehouse Platform | Iceberg centric Data Lakehouses

With data virtualization, your distributed data sources can be joined and queried without data movement. Nowadays, data virtualization is mandatory to build data lakehouses in your organization. But if you want to build your data lakehouses with the support of iceberg, you need to support theses for your data virtualization. - data lakehouse access control with storage security - iceberg catalog - automated iceberg table maintenance And you may also consider query engines used for your data virtualization with the following feature. - No downtime to run queries. - scale out query engines. The below picture shows how data virtualization works in Chango(https://lnkd.in/gHvp6Eud). Users send queries just to endpoints of Chango data virtualization. The queries will be executed by the query engines of trino and spark which can join your distributed data sources for your data virtualization. Chango Trino Gateway is used to provide no downtime to run trino queries with scaling out small trino clusters rather than one monolithic giant trino cluster. For iceberg support, Chango provides Chango REST Catalog which is an Iceberg REST Catalog and automated iceberg table maintenance. Strong data lakehouse access control is also provided with catalog, schema and table level.

  • diagram

To view or add a comment, sign in

Explore topics