-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Closed
Labels
feature requestNew feature or requestNew feature or request
Description
🚀 The feature, motivation and pitch
When we use KV cache connectors like the LMCache, we load the needed cache from disk/remote/other servers. In the example of using LMCache, the scheduler first looks up the cache pool and then starts loading. But we don't have any try-catch mechanism for the loading process. There can be unexpected behavior in the loading process (power outage, bugs), and we do not have a fallback plan. Now it results in a silent error. In case of a disk load failuer, the vLLM will just continue its computation with the random data.
Alternatives
No response
Additional context
No response
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
maobaolong and njhillchunxiaozheng and maobaolong
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request