home/categories/bioinformatics/orchestra-research-ai-research-skills-04-mechanistic-interpretability-saelens-skill-md
bioinformaticsresearch

sparse-autoencoder-training

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

Orchestra-Research
maintainer
Orchestra-Research
Updated 12/17/2025
Stars
6563
Forks
515
quick start

Installation and usage

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use sparse-autoencoder-training