-
Notifications
You must be signed in to change notification settings - Fork 13.2k
Closed
Labels
bugSomething isn't workingSomething isn't workinghigh priorityVery important issueVery important issue
Description
After the CUDA refactor PR #1703 by @JohannesGaessler was merged i wanted to try it out this morning and measure the performance difference on my ardware.
I use my standard prompts with different models in different sizes.
I use the prebuild versions win-cublas-cu12.1.0-xx64
With the new builds I only get gibberish as a response for all prompts used and all models.
It looks like a random mix of words in different languages.
On my current PC I can only use the win-avx-x64 version, here I still get normal output.
I will use the Cuda-pc again in a few hours, then I can provide sample output or more details.
Am I the only one with this problem?
Thanzex, BlackGlory and djsavvydumduma, mirek190 and vsalibrary
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinghigh priorityVery important issueVery important issue