
Image attached for reference
I tried changing steps, and different inferences, but I never got the "people" to look anything like the source images. I tried different prompts, including the position of the tags, but nothing gave me better results than what I could get from a straight text2image workflow.
I appreciate the opportunity to test and to see the great effort in the AI Image generation space, but it seems like this is better for inpainting-type use cases rather than trying to composite people into photos with accuracy.
I did not try to just put a single person in and then use that generated image to add a second person, something to test later.
Thank You.