Skip to content

Incremental Changelog Scan#782

Draft
chinmay-bhat wants to merge 8 commits intoapache:mainfrom
chinmay-bhat:incremental_changelog_scan
Draft

Incremental Changelog Scan#782
chinmay-bhat wants to merge 8 commits intoapache:mainfrom
chinmay-bhat:incremental_changelog_scan

Conversation

@chinmay-bhat
Copy link
Contributor

@chinmay-bhat chinmay-bhat commented May 30, 2024

Draft PR. Waiting on incremental append scan #533 to merge.

  • Integration test (test output has both Insert and Delete operations)

I created a new class ChangelogScanTask(ScanTask) to store change_ordinal and other parameters.
The plan_files() in the base class BaseIncrementalScan returns a Iterable[FileScanTask].
pyarrow.project_table() also expects tasks as Iterable[FileScanTask].

TODO:

  • in BaseIncrementalScan, change return type from Iterable[FileScanTask] to Iterable[ScanTask] to allow different scan tasks.
  • [needs discussion] update pyarrow.project_table() to be @singledispatch to allow different behaviours of project_table() depending on the ScanTask type.

@github-actions
Copy link

github-actions bot commented Mar 7, 2026

This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that's incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@iceberg.apache.org list. Thank you for your contributions.

@github-actions github-actions bot added the stale label Mar 7, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant