category focus

LLM & AI

Large Language Models and AI agents.

4725 skillsall categories
sorting
stars
current ordering strategy
query
all entries
refine the visible subset
llm-ai
2

prompt-engineer

Craft effective prompts and optimize AI interactions for better results

eddiebe147
eddiebe147
data-ai
open
llm-ai
2

text-to-speech-optimization

Advanced text-to-speech optimization with expressional tone guides, natural speech patterns, prosody control, and human-like conversation techniques to create authentic-sounding AI voices. Use when generating natural speech, creating engaging content, or producing professional voiceovers.

onesmartguy
onesmartguy
data-ai
open
llm-ai
2

nano-banana-pro

Generate images using AI. Use when the user asks to create, generate, or make images, pictures, graphics, illustrations, visuals, or artwork. Also use for image editing with reference images.

idanbeck
idanbeck
data-ai
open
llm-ai
2

gemini-imagegen

Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.

phrazzld
phrazzld
data-ai
open
llm-ai
2

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

vibery-studio
vibery-studio
data-ai
open
llm-ai
2

ai-image-asset-generator

This skill should be used when generating AI image assets for websites, landing pages, or applications. It automatically analyzes page requirements, generates images using Gemini API, removes backgrounds, converts to SVG for interactivity, and places assets in frontend code. Ideal for creating hero images, icons, backgrounds, product mockups, and infographic elements. Use this skill when users need image assets for their web projects.

Kartikk-26
Kartikk-26
data-ai
open
llm-ai
2

comms

Voice communication system for broadcasting updates using ElevenLabs TTS. IMPORTANT: When the user includes voice activation keywords ("with voice comms", "with voice updates", "announce progress", "broadcast updates"), you MUST proactively use this skill to broadcast mission updates as Commander. Broadcast as Commander when: - Starting a complex task or deploying squadron agents - Announcing when squadrons complete their missions - Summarizing consolidated results from multiple agents - Making strategic decisions or priority changes - User explicitly requests voice updates DO NOT wait to be asked - if voice keywords are present, broadcast proactively! Available to: - Main agent (you) - uses Commander voice (Bill, 1.2 speed) - Squadron sub-agents (Red, Gold, Blue, Green) - use their squadron voices

1Shot-Labs
1Shot-Labs
data-ai
open
llm-ai
2

writer

Generate content in your authentic voice across emails, blogs, social media, and reports

krishagel
krishagel
data-ai
open
llm-ai
2

tweaktune-synthesizer

Interactive assistant for designing and generating tweaktune pipelines to synthesize training data for LLMs. Use when user wants to create synthetic datasets for fine-tuning, generate conversations, function calling data, or structured JSON datasets.

qooba
qooba
data-ai
open
llm-ai
2

brand-guidelines

Applies AI Security Assurance's official brand colors, typography, and voice to any artifact. Use when creating presentations, documents, marketing materials, or any content that should reflect the company's professional cybersecurity identity.

AISecurityAssurance
AISecurityAssurance
data-ai
open
llm-ai
2

speed-read

Display text using RSVP speed reading. Use when presenting explanations, summaries, or any text the user should read quickly. Launches speed-read script with the provided text.

Castrozan
Castrozan
data-ai
open
llm-ai
2

gemini-imagen

Generate images from text prompts using Google's Gemini Imagen API. This skill should be used when the user requests image creation, generation, or visualization from text descriptions (e.g., "create an image of...", "generate a picture showing...", "make me an image for...").

AgentiveAU
AgentiveAU
data-ai
open
llm-ai
2

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

toanpv-0639
toanpv-0639
data-ai
open
llm-ai
2

talking-head

Create presenter and UGC-style video content. Use when creating educational videos, testimonials, product demos, or personal brand content.

sanky369
sanky369
data-ai
open
llm-ai
2

say-narration

Use macOS text-to-speech for agent narration and announcements. Sub-agents announce themselves using different language voices speaking English. Use for multi-agent workflows where each agent has a distinct voice identity.

plurigrid
plurigrid
data-ai
open
llm-ai
2

gemini-image-gen

Image generation with Google Gemini API. Models: gemini-2.5-flash-image (fast) or gemini-3-pro-image-preview (quality). For social media graphics, marketing, infographics.

freitasp1
freitasp1
data-ai
open
llm-ai
2

advanced-prompt-crafter

A sophisticated multi-layered prompt engineering system with analysis, optimization, customization, and validation engines for creating high-quality, domain-specific prompts

menoncello
menoncello
data-ai
open
llm-ai
2

prompt-architect

Transformiert Anforderungen in Best-Practice Prompts nach Claude 4.x Standards (Dezember 2025). Basiert auf: - Nate B. Jones 4 Beginner Moves: Shape, Context, Silent Plan, Self-Check - Anthropic Claude 4.x Best Practices: Explizitheit, Contract-Style, Examples beat Adjectives - Pipelines over Prompts Philosophie AKTIVIERT SICH AUTOMATISCH nach clarify-spec oder bei /prompt-architect. Produziert strukturierten, ausfuehrbaren Prompt mit allen Best Practices.

freitasp1
freitasp1
data-ai
open
llm-ai
2

prompt

Use when asked about writing or improving AI prompts, instructions, or system messages.

KJone1
KJone1
data-ai
open
llm-ai
2

text-cleanup

Comprehensive patterns and techniques for removing AI-generated verbosity and slop

v1truv1us
v1truv1us
data-ai
open
llm-ai
2

image-generator

Generates images using GPT Image 1.5 API. Use when users request image creation, illustration, or visual content.

avivsinai
avivsinai
data-ai
open
llm-ai
2

openai-image-gen

Generate images via OpenAI gpt-image-1.5 API. Supports batch generation with custom prompts or random prompt sampling. Features transparent backgrounds, multiple output formats (png/jpeg/webp), quality levels, and size options. Use when user wants to generate images, create image batches, explore visual prompts, or needs AI-generated artwork.

LarsEckart
LarsEckart
data-ai
open
Previous
Page 105 / 197
Next