Proposal
Move import wandb from top-level in train.py (so it always executes on import) into inside the train() function (and guard with if cfg.wandb)
Reason: wandb is large and slow to import; many consumers of TransformerLens don’t train and thus don’t need wandb, so avoiding the import reduces overhead on slower machines.