Skip to content

[Feature]: AutoDeploy : Automatically apply weight preloading #11925

@taylor-yb-lee

Description

@taylor-yb-lee

🚀 The feature, motivation and pitch

refer to the previous issue : #11819
For some models like in the above isuse , weight preloading increases the loading time dramatically.
For the above issue, we turned off the preloading in the model specific config file.
However, need better approach which decides the preloading strategy automatically.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.

Metadata

Metadata

Assignees

Labels

AutoDeploy<NV> AutoDeploy Backendfeature requestNew feature or request. This includes new model, dtype, functionality support

Type

No type

Projects

Status

Backlog

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions