What would you like to happen?
PCollections of Beam Rows are required to have a fixed schema, making it hard to read or write records with varying logical schemas in the same PCollection.
We need a semi-structured type that can contain a nested level of variable columns. Transforms can choose to unwrap this type to reconstruct the variable columns accordingly.
The Variant type is becoming a good standard across different projects (see Parquet, Spark, Flink). It could also be a good candidate for Beam.
Bonus if such a type is made portable so that other SDKs can use it too
Issue Priority
Priority: 2 (default / most feature requests should be filed as P2)
Issue Components
What would you like to happen?
PCollections of Beam Rows are required to have a fixed schema, making it hard to read or write records with varying logical schemas in the same PCollection.
We need a semi-structured type that can contain a nested level of variable columns. Transforms can choose to unwrap this type to reconstruct the variable columns accordingly.
The Variant type is becoming a good standard across different projects (see Parquet, Spark, Flink). It could also be a good candidate for Beam.
Bonus if such a type is made portable so that other SDKs can use it too
Issue Priority
Priority: 2 (default / most feature requests should be filed as P2)
Issue Components