Conversation
5db0b20 to
c03b739
Compare
|
@lawrence-cj thanks for the PR! Could you also provide some sample outputs? |
| if getattr(self, "vae", None): | ||
| if hasattr(self.vae.config, "scale_factor_temporal"): | ||
| self.vae_scale_factor_temporal = self.vae.config.scale_factor_temporal | ||
| elif hasattr(self.vae.config, "temporal_compression_ratio"): | ||
| # LTX2 VAE uses temporal_compression_ratio | ||
| self.vae_scale_factor_temporal = self.vae.config.temporal_compression_ratio | ||
| else: | ||
| self.vae_scale_factor_temporal = getattr(self.vae, "temporal_compression_ratio", 4) | ||
|
|
||
| if hasattr(self.vae.config, "scale_factor_spatial"): | ||
| self.vae_scale_factor_spatial = self.vae.config.scale_factor_spatial | ||
| elif hasattr(self.vae.config, "spatial_compression_ratio"): | ||
| # LTX2 VAE uses spatial_compression_ratio | ||
| self.vae_scale_factor_spatial = self.vae.config.spatial_compression_ratio | ||
| else: | ||
| self.vae_scale_factor_spatial = getattr(self.vae, "spatial_compression_ratio", 8) | ||
| else: | ||
| self.vae_scale_factor_temporal = 4 | ||
| self.vae_scale_factor_spatial = 8 |
There was a problem hiding this comment.
Hmm, should this be conditioned on the class type of the VAE being used?
sayakpaul
left a comment
There was a problem hiding this comment.
Thanks, I just left one comment. But it looks good to me.
Updated code and result. |
dg845
left a comment
There was a problem hiding this comment.
Thanks for the PR! The code looks good to me. However, running the example script doesn't work for me because I don't have access to the Sana_video/safetensors/sana_ltxvae_sft checkpoint. Would it be possible to provide a checkpoint for testing?
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
This PR adds LTX-VAE support for SANA-Video.
Cc: @dg845 @sayakpaul
GPU memory needed: 47GB for LTX refiner
SANA-Video with LTX2-Refiner:
Result
sana_ltx_refined.mp4