home/categories/data-engineering/orchestra-research-ai-research-skills-08-distributed-training-deepspeed-skill-md
data-engineeringdata-ai
deepspeed
Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
maintainer
Orchestra-Research
Updated 11/20/2025
Stars
6563
Forks
515
quick start
Installation and usage
Expert guidance for distributed training with DeepSpeed - ZeRO optimization stages, pipeline parallelism, FP16/BF16/FP8, 1-bit Adam, sparse attention
Installation
$ install --globalskills.sh
Usage
Once installed, you can use this skill by running the following command in your terminal:
skills use deepspeed