Skip to content

Empty data handling #295

@Gautzilla

Description

@Gautzilla

Right now, we have to manually filter out empty data from core datastes:

ads = AudioDataset(...)
ads.data = [data for data in ads.data if not data.is_empty]

Maybe we should implement a new version of the feature that existed in the legacy OSEkit, with a threshold of emptiness under which the data is rejected, which would be set to 0 by default?

ads = AudioDataset(..., data_emptniess_threshold = .05) # All AudioData that are 95% empty or more are rejected

The old behaviour of keeping empty data could be enforced by passing None to this parameter or whatever.

Also add trim functions that remove the empty items of data that are located on the sides? It seems like it's something that could be used regularly.

Image

What do you think @mathieudpnt?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions