-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Closed
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is neededhigh priorityVery important issueVery important issue
Description
This occurs when using the tiny
, small
, base
, medium
, and large
models.
All models used are not quantized.
ggml_tensor * tensor = wctx.state->embd_conv;
std::vector<float> tensor_data(ggml_nelements(tensor));
ggml_backend_tensor_get(tensor, tensor_data.data(), 0, ggml_nbytes(tensor));
std::ofstream outFile("encoder_embedding_conv.json");
outFile << "[";
for (uint64_t i = 0; i < tensor_data.size() - 1; i++) {
outFile << tensor_data[i] << ", ";
}
outFile << tensor_data[tensor_data.size() - 1] << "]";
outFile.close();
return 0;
CUDA:
CPU:
jdxin0
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is neededhigh priorityVery important issueVery important issue