gguf-quantization

Name: gguf-quantization
Author: davila7

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

View Source framework-internals

maintainer

davila7

Updated 1/20/2026

Stars

17577

Forks

1576

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use gguf-quantization