Skip to content

docs: add dynamic model mount examples#29

Draft
imeoer wants to merge 1 commit intomodelpack:mainfrom
imeoer:docs
Draft

docs: add dynamic model mount examples#29
imeoer wants to merge 1 commit intomodelpack:mainfrom
imeoer:docs

Conversation

@imeoer
Copy link
Collaborator

@imeoer imeoer commented Mar 19, 2026

This pull request updates the Model CSI Driver documentation and Helm chart configuration to clarify deployment, add new configuration options, and provide comprehensive usage instructions, especially around dynamic model mounting. The most important changes are grouped below:

Documentation improvements and expanded usage examples:

  • The docs/getting-started.md file is rewritten for clarity, now covering both static and dynamic inline mount flows, with detailed instructions for using the Unix Domain Socket (UDS) HTTP API for dynamic model mounting. The doc now explains how to discover the socket, interact with the REST API, filter files, and handle error responses. It also adds troubleshooting guidance and clarifies supported volume attributes. [1] [2] [3]

Helm chart configuration enhancements:

  • Adds two new configuration fields to charts/model-csi-driver/values.yaml: dynamicCsiEndpoint (for legacy dynamic CSI socket compatibility) and metricsAddr (to control the Prometheus metrics listener address). These options are now also rendered into the generated configmap. [1] [2]

Configuration template updates:

  • Updates charts/model-csi-driver/templates/configmap.yaml to include the new dynamic_csi_endpoint and metrics_addr fields in the rendered configuration file, ensuring the driver can be configured for dynamic mounting and metrics.

These changes improve user onboarding, enable advanced dynamic mounting scenarios, and make the Helm chart more configurable for real-world deployments.

Signed-off-by: imeoer <yansong.ys@antgroup.com>
@gemini-code-assist
Copy link

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly improves the user experience for deploying and managing models with the Model CSI Driver by clarifying and expanding its documentation, particularly for dynamic model mounting scenarios. It also enhances the Helm chart's configurability, enabling more flexible and robust deployments by introducing new options for dynamic CSI endpoint compatibility and Prometheus metrics exposure.

Highlights

  • Enhanced Documentation for Model Mounting: The docs/getting-started.md file has been significantly rewritten to provide clearer and more comprehensive instructions for both static and dynamic inline model mounting. It now includes detailed guidance on using the Unix Domain Socket (UDS) HTTP API for dynamic mounting, covering socket discovery, API interaction, file filtering, error handling, and troubleshooting.
  • Helm Chart Configuration Updates: Two new configuration fields, dynamicCsiEndpoint and metricsAddr, have been added to charts/model-csi-driver/values.yaml. These fields allow for configuring a legacy dynamic CSI socket and the Prometheus metrics listener address, respectively. These new options are also now rendered into the generated configmap.yaml.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request significantly improves the documentation for the Model CSI Driver by adding detailed examples for dynamic model mounting via the UDS HTTP API. The Helm chart is also updated to support new configuration options for this feature. The changes are well-structured and the new documentation is comprehensive. I have a couple of minor suggestions to improve the clarity of the examples in the documentation.

Comment on lines +182 to +185
"exclude_file_patterns": [
"model.safetensors.index.json",
"!tiktoken.model"
]

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The example for exclude_file_patterns could be confusing without more context on the pattern syntax. The use of !tiktoken.model suggests an exception to a rule, but no such rule is present in the example. To improve clarity, please consider adding a brief explanation of the pattern syntax, especially the ! prefix for inclusions/exceptions. For example:

"Patterns follow .gitignore conventions. Prefacing a pattern with ! will negate it. This is useful for re-including a file if it was excluded by a broader pattern like * or *.model."

Comment on lines +229 to +237
"started_at": "2025-06-10T20:19:12.797873473+08:00",
"finished_at": "2025-06-10T20:19:15.046158731+08:00"
},
{
"digest": "sha256:70c80fe937f84ce03629c7b397038a1566cac5aeabad92b5344384aa8f13f44c",
"path": "/configuration.json",
"size": 2048,
"started_at": "2025-06-10T20:19:12.79806982+08:00"
}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The example timestamps use the year 2025, which is in the future. This can be slightly confusing for readers. Using a past or the current year would make the example more grounded and realistic.

Suggested change
"started_at": "2025-06-10T20:19:12.797873473+08:00",
"finished_at": "2025-06-10T20:19:15.046158731+08:00"
},
{
"digest": "sha256:70c80fe937f84ce03629c7b397038a1566cac5aeabad92b5344384aa8f13f44c",
"path": "/configuration.json",
"size": 2048,
"started_at": "2025-06-10T20:19:12.79806982+08:00"
}
"started_at": "2024-06-10T20:19:12.797873473+08:00",
"finished_at": "2024-06-10T20:19:15.046158731+08:00"
},
{
"digest": "sha256:70c80fe937f84ce03629c7b397038a1566cac5aeabad92b5344384aa8f13f44c",
"path": "/configuration.json",
"size": 2048,
"started_at": "2024-06-10T20:19:12.79806982+08:00"
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant