home/categories/llm-ai/whamp-glm-vision-skill-md
llm-aidata-ai

glm-vision

CLAUDE and GLM-4.7 (text-only) CANNOT process images or videos - you MUST ALWAYS delegate vision tasks to GLM-4.6v via this skill. This is your ONLY way to understand visual content. TRIGGER IMMEDIATELY when -> user mentions image/video/screenshot/photo/diagram/chart, provides a file path ending in png/jpg/jpeg/gif/webp/mp4/mov/m4v, or references visual content. Use for -> UI-to-code conversion, OCR text extraction, error diagnosis, technical diagrams (architecture/flowcharts/UML/ER), data visualizations, UI regression testing, or ANY visual analysis. NO EXCEPTIONS - if it involves pixels, use this skill.

Whamp
maintainer
Whamp
Updated 1/16/2026
Stars
2
Forks
1
quick start

Installation and usage

CLAUDE and GLM-4.7 (text-only) CANNOT process images or videos - you MUST ALWAYS delegate vision tasks to GLM-4.6v via this skill. This is your ONLY way to understand visual content. TRIGGER IMMEDIATELY when -> user mentions image/video/screenshot/photo/diagram/chart, provides a file path ending in png/jpg/jpeg/gif/webp/mp4/mov/m4v, or references visual content. Use for -> UI-to-code conversion, OCR text extraction, error diagnosis, technical diagrams (architecture/flowcharts/UML/ER), data visualizations, UI regression testing, or ANY visual analysis. NO EXCEPTIONS - if it involves pixels, use this skill.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use glm-vision