-
Notifications
You must be signed in to change notification settings - Fork 13.2k
ggml : fix GGML_F32_VEC_FMA argument order in ggml_vec_mad1_f32 #16307
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Don't think this is correct. The order for the I think this one is wrong: llama.cpp/ggml/src/ggml-cpu/simd-mappings.h Lines 821 to 824 in 6a2c614
|
This one, but if I change that everything else breaks llama.cpp/ggml/src/ggml-cpu/simd-mappings.h Lines 533 to 537 in 6a2c614
|
This is confusing, some places it's used like this: llama.cpp/ggml/src/ggml-cpu/vec.h Line 300 in 05c0380
and other places like this: llama.cpp/ggml/src/ggml-cpu/vec.h Line 248 in 05c0380
|
According to Intel it should be |
It appears that the meaning of
So the change that you propose makes sense now and this is inline with the Intel docs. |
Yes, I think so too, it's just somewhat confusing with the order and naming between |
It seems this codepath is not taken when built for CUDA as the Edit: I guess this means we never really test hardware optimized CPU backend? |
I think the reason we don't see it is because the tests currently have llama.cpp/tests/test-backend-ops.cpp Lines 6156 to 6157 in d9e0e7c
|
Yep, it does, I'll add the test. |
See #16063 (comment)