🌊 Git-like Version Control for Data with Nessie, Iceberg, and Spark
-
Updated
Jan 21, 2025 - Jupyter Notebook
🌊 Git-like Version Control for Data with Nessie, Iceberg, and Spark
Apache Iceberg is an open table format for large analytic datasets that provides ACID transactions, schema evolution, hidden partitioning, and time travel. It works with Spark, Flink, Hive, Presto, Trino, DuckDB, ClickHouse, and many more compute engines. Governed by the Apache Software Foundation under the Apache 2.0 license.
preparations for a transfer to SQL. horizontal tables in different sheets in multiple.xlsx to a single long table
Add a description, image, and links to the table-format topic page so that developers can more easily learn about it.
To associate your repository with the table-format topic, visit your repo's landing page and select "manage topics."