-
Notifications
You must be signed in to change notification settings - Fork 222
Open
Description
Describe the bug
Error raised when trying to save EMA
ema_model = copy.deepcopy(self.transformer) ERROR 12-04 14:11:23 [distillation_pipeline.py:490] Failed to save EMA weights: FSDP does not support deepcopy. Please use state dict for serialization.
Reproduction
Run distill_dmd.sh with my custom preprocessed dataset.
Environment
- CUDA128, PYTORCH271
- H100*8
Metadata
Metadata
Assignees
Labels
No labels