-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Open
Labels
enhancementNew feature or requestNew feature or request
Description
I'm working on a voice-controlled application and I want to run small .wav files through whisper fairly often.
What I noticed is that it takes almost 50% of the total time just to load the model every single time I run ./main -m ... "my-short-spoken-command.wav"
I think it'd be nice if like in llama.cpp this project includes a server example so the model only has to be loaded once and stays in memory after loading.
lin72h and flatsiedatsie
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request