GPT2 Architecture Integration

# Feature Description
The idea is to be able to convert models using the GPT2 architecture into GGUF. The convert-hf-to-gguf.py should include GPT2, as well as llama.cpp for running the model.

# Motivation
There are quite a few models for low resource languages or specific use cases that are fine-tuned on GPT2 architecture.

# Possible Implementation
The structure of models is quite similar to Starcoder. From my understanding, you can modify it quite easily by:

convert-hf-to-gguf.py
- Add a new model class
- Modify the set_gguf_parameters() [kv heads] and write_tensors() [maybe you need to transpose the qkv, up-ffn and down-ffn layer] methods

llama.cpp
- Add an new model class

# Status
I tried implementing that myself, but am not deep enough into the topic and find it quite hard to understand the libraries structure (is there any good documentation). So, I am probably not able to pull this off by myself, but am happy to support!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPT2 Architecture Integration #4073

Feature Description

Motivation

Possible Implementation

Status

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

GPT2 Architecture Integration #4073

Description

Feature Description

Motivation

Possible Implementation

Status

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions