Skip to content

SaveDatasetSpec should check for existing dataset and apply UpdateExisting policy #477

@leoschwarz

Description

@leoschwarz

Enhancement

When outputs.yml contains a bfabric_dataset spec, register_outputs does not check whether a dataset with the same name already exists on the workunit. If the output registration is re-run (e.g. after a retry or resume), it creates a duplicate dataset instead of updating the existing one.

Expected

SaveDatasetSpec should support an update_existing field (like CopyResourceSpec and SaveLinkSpec already do) and check for an existing dataset before creating a new one, applying the UpdateExisting policy accordingly.

Current behavior

  • CopyResourceSpec — checks for existing resource, respects update_existing
  • SaveLinkSpec — checks for existing link, respects update_existing
  • SaveDatasetSpec — always creates a new dataset, no duplicate check

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions