[WIP] [STEP 2] split compressor into few quantizers #841

n1ck-guo · 2025-09-23T00:25:32Z

quantizer

Replaces the original compressor's quantize method and is responsible for the specific quantization process
Subclasses (coarse to fine granularity):

mode (RTN, Tune): Different quantizer processes and quantize function logic.
model_type (llm, vlm, diffusion): Different calibration methods, data processing, etc.
data_type (gguf, mxfp8, waquanizer): Requires additional algorithms (imatrix for gguf), special processes (register_act_max_hook for WA, fused_layer_global_scale for nvfp)

Signed-off-by: n1ck-guo <[email protected]>

n1ck-guo added 2 commits September 22, 2025 03:17

[STEP 2] split compressor into few quantizers

bfd6882

Signed-off-by: n1ck-guo <[email protected]>

fix

4e9b5ae

Signed-off-by: n1ck-guo <[email protected]>