Model/Pipeline/Scheduler description
I would like to request official support in Diffusers for a multi-image input LoRA training pipeline targeting the FLUX.2 Klein model. It seems that existing LoRA training pipelines are designed around single-image conditioning, which limits their applicability for tasks that naturally require multiple reference images. Clear guidance, reference implementations, or examples demonstrating how multi-image conditioning could be handled during training would be highly valuable. Thank you for your continued work on Diffusers, and I would greatly appreciate any insights, recommendations, or future plans related to supporting this capability.
Open source status
Provide useful links for the implementation
No response