home/categories/bioinformatics/orchestra-research-ai-research-skills-04-mechanistic-interpretability-saelens-skill-md

bioinformaticsresearch

sparse-autoencoder-training

Name: sparse-autoencoder-training
Author: Orchestra-Research

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use when discovering interpretable features, analyzing superposition, or studying monosemantic representations in language models.

View Source bioinformatics

maintainer

Orchestra-Research

Updated 12/17/2025

Stars

6563

Forks

515

quick start

Installation and usage

Installation

$ install --globalskills.sh

Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use sparse-autoencoder-training