Server example?

I'm working on a voice-controlled application and I want to run small .wav files through whisper fairly often.

What I noticed is that it takes almost 50% of the total time just to load the model every single time I run `./main -m ... "my-short-spoken-command.wav"`

I think it'd be nice if like in llama.cpp this project includes a server example so the model only has to be loaded once and stays in memory after loading.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Server example? #1369

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Server example? #1369

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions