gguf-quantization

Name: gguf-quantization
Author: math-inc

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible quantization from 2-8 bit without GPU requirements.

View Source computational-chemistry

maintainer

math-inc

Updated 3/19/2026

Stars

1165

Forks

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use gguf-quantization