Skip to content

Hallucinations and repeats of previous transcriptions when running without reloading model #2445

@nchudleigh

Description

@nchudleigh

I'm running into an issue where subsequent runs of the model are bleeding over results from a previous recording when the model is staying loaded in memory.

I've checked all the inputs to the full transcribe call and there is no difference between the two calls, but it seems that something internal to whisper.cpp is not being reset.

Are there anything that I need to call within whisper.cpp to reset the state of the model?

Example results:

1st transcription (good result):
Hello my name is Mark and this is a boat it floats on water and it's very slow.

2nd transcription (completely different audio does not have any of this transcribed text in it)
Hello my name is Mark. Hello my name is Mark. Hello my name is Mark. Hello my name is Mark.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions