Skip to content

Conversation

@gene-bordegaray
Copy link
Contributor

@gene-bordegaray gene-bordegaray commented Jan 24, 2026

Which issue does this PR close?

add avg byte size in column stats. This is just a first PR so kept scope small. Will follow up with more propagation and adding more support for different types.

this will help estimating cardinality thus planning in CBO and for distributed.

Mentioned in this point: "Output byte size estimation : rows * avg_row_width for network cost awareness (we discussed this offline, avg_row_width can be computed based on the row type and avg_len/avg_size for collections or length varying types)"

@github-actions github-actions bot added physical-expr Changes to the physical-expr crates core Core DataFusion crate sqllogictest SQL Logic Tests (.slt) common Related to common crate proto Related to proto crate datasource Changes to the datasource crate physical-plan Changes to the physical-plan crate labels Jan 24, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

common Related to common crate core Core DataFusion crate datasource Changes to the datasource crate physical-expr Changes to the physical-expr crates physical-plan Changes to the physical-plan crate proto Related to proto crate sqllogictest SQL Logic Tests (.slt)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant