-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Closed
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
Feature Description
The idea is to be able to convert models using the GPT2 architecture into GGUF. The convert-hf-to-gguf.py should include GPT2, as well as llama.cpp for running the model.
Motivation
There are quite a few models for low resource languages or specific use cases that are fine-tuned on GPT2 architecture.
Possible Implementation
The structure of models is quite similar to Starcoder. From my understanding, you can modify it quite easily by:
convert-hf-to-gguf.py
- Add a new model class
- Modify the set_gguf_parameters() [kv heads] and write_tensors() [maybe you need to transpose the qkv, up-ffn and down-ffn layer] methods
llama.cpp
- Add an new model class
Status
I tried implementing that myself, but am not deep enough into the topic and find it quite hard to understand the libraries structure (is there any good documentation). So, I am probably not able to pull this off by myself, but am happy to support!
Haarr, timpal0l, dpleus and ekgren
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers