Skip to content

Support for Multi-Image Input LoRA Training Pipeline for FLUX.2-Klein (e.g., Style Transfer and Multi-Subject Composition) #13008

@qizhuzhuang

Description

@qizhuzhuang

Model/Pipeline/Scheduler description

I would like to request official support in Diffusers for a multi-image input LoRA training pipeline targeting the FLUX.2 Klein model. It seems that existing LoRA training pipelines are designed around single-image conditioning, which limits their applicability for tasks that naturally require multiple reference images. Clear guidance, reference implementations, or examples demonstrating how multi-image conditioning could be handled during training would be highly valuable. Thank you for your continued work on Diffusers, and I would greatly appreciate any insights, recommendations, or future plans related to supporting this capability.

Open source status

  • The model implementation is available.
  • The model weights are available (Only relevant if addition is not a scheduler).

Provide useful links for the implementation

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions