Skip to content

Commit 44014b2

Browse files
committed
Deploying to gh-pages from @ dstackai/dstack@16ddda8 🚀
1 parent a6d8dab commit 44014b2

File tree

111 files changed

+7406
-136
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

111 files changed

+7406
-136
lines changed

404.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1777,6 +1777,34 @@
17771777

17781778

17791779

1780+
<li class="md-nav__item">
1781+
<a href="/docs/reference/cli/dstack/offer/" class="md-nav__link">
1782+
1783+
1784+
1785+
<span class="md-ellipsis">
1786+
1787+
1788+
dstack offer
1789+
1790+
1791+
1792+
</span>
1793+
1794+
1795+
1796+
</a>
1797+
</li>
1798+
1799+
1800+
1801+
1802+
1803+
1804+
1805+
1806+
1807+
17801808
<li class="md-nav__item">
17811809
<a href="/docs/reference/cli/dstack/volume/" class="md-nav__link">
17821810

29.3 KB
Loading

blog/amd-mi300x-inference-benchmark/index.html

Lines changed: 32 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1807,6 +1807,34 @@
18071807

18081808

18091809

1810+
<li class="md-nav__item">
1811+
<a href="../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1812+
1813+
1814+
1815+
<span class="md-ellipsis">
1816+
1817+
1818+
dstack offer
1819+
1820+
1821+
1822+
</span>
1823+
1824+
1825+
1826+
</a>
1827+
</li>
1828+
1829+
1830+
1831+
1832+
1833+
1834+
1835+
1836+
1837+
18101838
<li class="md-nav__item">
18111839
<a href="../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18121840

@@ -3591,7 +3619,7 @@
35913619
<span class="md-ellipsis">
35923620

35933621
<span class="md-typeset">
3594-
vRAM consumption
3622+
VRAM consumption
35953623
</span>
35963624

35973625
</span>
@@ -3912,8 +3940,8 @@ <h3 id="tokensec-and-ttft-per-rps">Token/sec and TTFT per RPS<a class="headerlin
39123940
performance improved notably when the number of requests was below 900.</p>
39133941
</blockquote>
39143942
<p><img src="https://raw.githubusercontent.com/dstackai/benchmarks/refs/heads/main/amd/inference/charts_rps/mean_ttft_tgi_vllm.png" width="725" style="padding: 0 40px 0 50px"/></p>
3915-
<h3 id="vram-consumption">vRAM consumption<a class="headerlink" href="#vram-consumption" title="Permanent link">&para;</a></h3>
3916-
<p>When considering vRAM consumption right after loading model weights, TGI allocates approximately 28% less vRAM compared
3943+
<h3 id="vram-consumption">VRAM consumption<a class="headerlink" href="#vram-consumption" title="Permanent link">&para;</a></h3>
3944+
<p>When considering VRAM consumption right after loading model weights, TGI allocates approximately 28% less VRAM compared
39173945
to vLLM.</p>
39183946
<p><img src="https://raw.githubusercontent.com/dstackai/benchmarks/refs/heads/main/amd/inference/gpu_vram_tgi_vllm.png" width="750" /></p>
39193947
<p>This difference may be related to how vLLM <a href="https://docs.vllm.ai/en/latest/models/performance.html" target="_blank">pre-allocates GPU cache <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a>.</p>
@@ -3931,7 +3959,7 @@ <h2 id="conclusion">Conclusion<a class="headerlink" href="#conclusion" title="Pe
39313959
<li>With vLLM, we used the default backend configuration. With better tuning, we might have achieved improved performance.</li>
39323960
</ul>
39333961
</div>
3934-
<p>In general, the 8x AMD MI300X is a good fit for larger models and allows us to make the most of its vRAM, especially for
3962+
<p>In general, the 8x AMD MI300X is a good fit for larger models and allows us to make the most of its VRAM, especially for
39353963
larger batches.</p>
39363964
<p>If you’d like to support us in doing more benchmarks, please let us know.</p>
39373965
<h2 id="whats-next">What's next?<a class="headerlink" href="#whats-next" title="Permanent link">&para;</a></h2>

blog/amd-on-runpod/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1805,6 +1805,34 @@
18051805

18061806

18071807

1808+
<li class="md-nav__item">
1809+
<a href="../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1810+
1811+
1812+
1813+
<span class="md-ellipsis">
1814+
1815+
1816+
dstack offer
1817+
1818+
1819+
1820+
</span>
1821+
1822+
1823+
1824+
</a>
1825+
</li>
1826+
1827+
1828+
1829+
1830+
1831+
1832+
1833+
1834+
1835+
18081836
<li class="md-nav__item">
18091837
<a href="../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18101838

blog/amd-on-tensorwave/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1807,6 +1807,34 @@
18071807

18081808

18091809

1810+
<li class="md-nav__item">
1811+
<a href="../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1812+
1813+
1814+
1815+
<span class="md-ellipsis">
1816+
1817+
1818+
dstack offer
1819+
1820+
1821+
1822+
</span>
1823+
1824+
1825+
1826+
</a>
1827+
</li>
1828+
1829+
1830+
1831+
1832+
1833+
1834+
1835+
1836+
1837+
18101838
<li class="md-nav__item">
18111839
<a href="../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18121840

blog/archive/2024/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1801,6 +1801,34 @@
18011801

18021802

18031803

1804+
<li class="md-nav__item">
1805+
<a href="../../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1806+
1807+
1808+
1809+
<span class="md-ellipsis">
1810+
1811+
1812+
dstack offer
1813+
1814+
1815+
1816+
</span>
1817+
1818+
1819+
1820+
</a>
1821+
</li>
1822+
1823+
1824+
1825+
1826+
1827+
1828+
1829+
1830+
1831+
18041832
<li class="md-nav__item">
18051833
<a href="../../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18061834

blog/archive/2024/page/2/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1801,6 +1801,34 @@
18011801

18021802

18031803

1804+
<li class="md-nav__item">
1805+
<a href="../../../../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1806+
1807+
1808+
1809+
<span class="md-ellipsis">
1810+
1811+
1812+
dstack offer
1813+
1814+
1815+
1816+
</span>
1817+
1818+
1819+
1820+
</a>
1821+
</li>
1822+
1823+
1824+
1825+
1826+
1827+
1828+
1829+
1830+
1831+
18041832
<li class="md-nav__item">
18051833
<a href="../../../../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18061834

blog/archive/2025/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1803,6 +1803,34 @@
18031803

18041804

18051805

1806+
<li class="md-nav__item">
1807+
<a href="../../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1808+
1809+
1810+
1811+
<span class="md-ellipsis">
1812+
1813+
1814+
dstack offer
1815+
1816+
1817+
1818+
</span>
1819+
1820+
1821+
1822+
</a>
1823+
</li>
1824+
1825+
1826+
1827+
1828+
1829+
1830+
1831+
1832+
1833+
18061834
<li class="md-nav__item">
18071835
<a href="../../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18081836

blog/archive/2025/page/2/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1803,6 +1803,34 @@
18031803

18041804

18051805

1806+
<li class="md-nav__item">
1807+
<a href="../../../../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1808+
1809+
1810+
1811+
<span class="md-ellipsis">
1812+
1813+
1814+
dstack offer
1815+
1816+
1817+
1818+
</span>
1819+
1820+
1821+
1822+
</a>
1823+
</li>
1824+
1825+
1826+
1827+
1828+
1829+
1830+
1831+
1832+
1833+
18061834
<li class="md-nav__item">
18071835
<a href="../../../../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18081836

blog/archive/ambassador-program/index.html

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1799,6 +1799,34 @@
17991799

18001800

18011801

1802+
<li class="md-nav__item">
1803+
<a href="../../../docs/reference/cli/dstack/offer/" class="md-nav__link">
1804+
1805+
1806+
1807+
<span class="md-ellipsis">
1808+
1809+
1810+
dstack offer
1811+
1812+
1813+
1814+
</span>
1815+
1816+
1817+
1818+
</a>
1819+
</li>
1820+
1821+
1822+
1823+
1824+
1825+
1826+
1827+
1828+
1829+
18021830
<li class="md-nav__item">
18031831
<a href="../../../docs/reference/cli/dstack/volume/" class="md-nav__link">
18041832

0 commit comments

Comments
 (0)