Description: Improving our format identifications could simplify risk matching. Possible methods:
- If have same format with and without PUID (different tools), only keep one with PUID.
- Deal with names and file extensions that are consistent mismatches.
- Narrow which tools are used with which formats to eliminate ones that are always wrong.
- Acronym mapping for when FITS spells out and NARA doesn't, or vice versa.
- Get version numbers for FITS identifications where it is part of the name.
- Split into multiple IDs if more than one version or PUID?
Description: Improving our format identifications could simplify risk matching. Possible methods: