Skip to content

[Feature] Support asynchronous dynamic lora loading/unloading #8162

@Fridge003

Description

@Fridge003

Checklist

Motivation

Dynamic loading/unloading of lora adaptors has been supported in #7446.

However, the server cannot handle requests during loading/unloading, until this process has been finished.

In production environments, we expect that loading/unloading of lora can be executed asynchronously, so the server can keep handling requests.

cc @lifuhuang @slin1237 @Ying1123

Related resources

#8213

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions