Flux2: Tensor tuples can cause issues for checkpointing #12777

dxqb · 2025-12-02T17:29:31Z

addresses #12776

What does this PR do?

This PR keeps the tuples, but moves the splitting from tensors into tuples of tensors to the transformer blocks, to avoid issues with checkpointing. By passing a tensor directly, torch.utils.checkpoint() identifies the tensor and saves it accordingly without running a backward through it multiple times.

This is a draft. If you agree with this change I can make it nicer. Among other things:

type hints are incorrect
splitting might not be necessary anymore, because they are used immediately after

Who can review?

@yiyixuxu and @asomoza

…sues

github-actions · 2026-01-09T15:03:41Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

split tensors inside the transformer blocks to avoid checkpointing is…

0881fe8

…sues

dxqb mentioned this pull request Dec 4, 2025

Flux2: Tensor tuples can cause issues for checkpointing #12776

Open

github-actions bot added the stale Issues that haven't received updates label Jan 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flux2: Tensor tuples can cause issues for checkpointing #12777

Flux2: Tensor tuples can cause issues for checkpointing #12777

Uh oh!

dxqb commented Dec 2, 2025

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Flux2: Tensor tuples can cause issues for checkpointing #12777

Are you sure you want to change the base?

Flux2: Tensor tuples can cause issues for checkpointing #12777

Uh oh!

Conversation

dxqb commented Dec 2, 2025

What does this PR do?

Who can review?

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant