Encoder is broken when CUBLAS is ON

This occurs when using the `tiny`, `small`, `base`, `medium`, and `large` models.
**All models used are not quantized.**

```cpp
    ggml_tensor * tensor = wctx.state->embd_conv;
    std::vector<float> tensor_data(ggml_nelements(tensor));
    ggml_backend_tensor_get(tensor, tensor_data.data(), 0, ggml_nbytes(tensor));
    std::ofstream outFile("encoder_embedding_conv.json");
    outFile << "[";
    for (uint64_t i = 0; i < tensor_data.size() - 1; i++) {
        outFile << tensor_data[i] << ", ";
    }
    outFile << tensor_data[tensor_data.size() - 1] << "]";
    outFile.close();
    return 0;
```

### CUDA:
![image](https://github.com/ggerganov/whisper.cpp/assets/129547291/ecdb9f13-c19e-475a-bde5-f1c9459188e5)
![image](https://github.com/ggerganov/whisper.cpp/assets/129547291/51a70ac5-8a09-4a0a-b0a0-5c32623cec6b)


### CPU:
![image](https://github.com/ggerganov/whisper.cpp/assets/129547291/f4425547-b7e4-4c2c-9ac4-25e099fae4e5)
![image](https://github.com/ggerganov/whisper.cpp/assets/129547291/595f887c-4fa8-4c11-aebb-6d53b60f6996)


[encoder_embedding_conv.zip](https://github.com/ggerganov/whisper.cpp/files/13767505/encoder_embedding_conv.zip)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Encoder is broken when CUBLAS is ON #1688

CUDA:

CPU:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Encoder is broken when CUBLAS is ON #1688

Description

CUDA:

CPU:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions