Do sample_to_timestamp calculation with 64 bit precision to avoid overflow #388

boolemancer · 2023-01-08T12:53:43Z

As written, the sample_to_timestamp function will overflow around 22 and a half minutes and it will start returning negative timestamps.

22m23s = 1343s
1343s * 16000 samples/s = 21488000 samples
Passing 21488000 to sample_to_timestamp would result in 100*21488000 (2148800000), which overflows to -2146167296 before that result is divided again by the sample rate.

By doing the multiplication with 64 bit precision, you avoid the overflow and you can now process audio clips longer than 22 minutes.

whisper.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

…o avoid overflow (ggml-org#388) * Do calculation with 64 bit precision to avoid overflow * Update whisper.cpp Co-authored-by: Georgi Gerganov <[email protected]> Co-authored-by: Georgi Gerganov <[email protected]>

Do calculation with 64 bit precision to avoid overflow

4134ebd

ggerganov reviewed Jan 8, 2023

View reviewed changes

whisper.cpp Outdated Show resolved Hide resolved

Update whisper.cpp

6bd8ec9

Co-authored-by: Georgi Gerganov <[email protected]>

ggerganov merged commit 08dc705 into ggml-org:master Jan 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do sample_to_timestamp calculation with 64 bit precision to avoid overflow #388

Do sample_to_timestamp calculation with 64 bit precision to avoid overflow #388

Uh oh!

boolemancer commented Jan 8, 2023

Uh oh!

Uh oh!

Uh oh!

Do sample_to_timestamp calculation with 64 bit precision to avoid overflow #388

Do sample_to_timestamp calculation with 64 bit precision to avoid overflow #388

Uh oh!

Conversation

boolemancer commented Jan 8, 2023

Uh oh!

Uh oh!

Uh oh!