Tag: Data Engineering
All the articles with the tag "Data Engineering".
Maintaining Apache Iceberg Tables: Compaction, Snapshot Expiration, and Orphan File Cleanup
Published: at 10:00 AMAn in-depth guide to orchestrating maintenance operations on Apache Iceberg tables, covering bin-packing, sort-based, Z-Order compaction, snapshot expiration, and orphan file removal, with query acceleration details for the Dremio engine.
Apache Iceberg with Spark: Create, MERGE, Upsert, and Evolve Tables End to End
Published: at 10:00 AMA comprehensive developer guide to configuring Apache Spark with Apache Iceberg, executing transactional writes, and managing schema evolution.
Common Misconceptions About Data Lakehouse and Apache Iceberg
Published: at 09:00 AMAddressing common search queries and reader confusion about Data Lakehouse architectures, Apache Iceberg catalogs, partitions, and lock-in.