Skip to content

Conversation

@Antlera
Copy link
Collaborator

@Antlera Antlera commented Aug 1, 2025

This PR adds a blog post and images for ZenFlow, introducing its design, benefits, and usage. The blog explains how ZenFlow improves GPU utilization by overlapping computation and communication during offloaded training.

See also:
#7391 – core ZenFlow implementation.
#982 - – benchmarking and fine-tuning example.

Antlera and others added 2 commits July 31, 2025 22:52
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Signed-off-by: Tingfeng Lan <erc8gx@virginia.edu>
Signed-off-by: Tingfeng Lan <erc8gx@virginia.edu>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
@Antlera
Copy link
Collaborator Author

Antlera commented Aug 4, 2025

@sfc-gh-truwase Thanks for the great suggestions — all applied now! The blog should read much clearer now. Let me know if you spot anything else.

@sfc-gh-truwase sfc-gh-truwase merged commit cda3f96 into deepspeedai:master Aug 10, 2025
2 checks passed
LYMDLUT pushed a commit to LYMDLUT/DeepSpeed that referenced this pull request Aug 20, 2025
This PR adds a blog post and images for ZenFlow, introducing its design,
benefits, and usage. The blog explains how ZenFlow improves GPU
utilization by overlapping computation and communication during
offloaded training.

See also:
deepspeedai#7391 – core ZenFlow implementation.
[deepspeedai#982](deepspeedai/DeepSpeedExamples#982) - –
benchmarking and fine-tuning example.

---------

Signed-off-by: Tingfeng Lan <erc8gx@virginia.edu>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>
Signed-off-by: lym <letusgo126@126.com>
mauryaavinash95 pushed a commit to DataStates/DeepSpeed that referenced this pull request Oct 4, 2025
This PR adds a blog post and images for ZenFlow, introducing its design,
benefits, and usage. The blog explains how ZenFlow improves GPU
utilization by overlapping computation and communication during
offloaded training.

See also: 
deepspeedai#7391 – core ZenFlow implementation.
[deepspeedai#982](deepspeedai/DeepSpeedExamples#982) - –
benchmarking and fine-tuning example.

---------

Signed-off-by: Tingfeng Lan <erc8gx@virginia.edu>
Co-authored-by: Olatunji Ruwase <tunji.ruwase@snowflake.com>
Co-authored-by: Hongwei Chen <33092912+hwchen2017@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants