prompt-engineer
Craft effective prompts and optimize AI interactions for better results
Craft effective prompts and optimize AI interactions for better results
Advanced text-to-speech optimization with expressional tone guides, natural speech patterns, prosody control, and human-like conversation techniques to create authentic-sounding AI voices. Use when generating natural speech, creating engaging content, or producing professional voiceovers.
Generate images using AI. Use when the user asks to create, generate, or make images, pictures, graphics, illustrations, visuals, or artwork. Also use for image editing with reference images.
Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.
This skill should be used when generating AI image assets for websites, landing pages, or applications. It automatically analyzes page requirements, generates images using Gemini API, removes backgrounds, converts to SVG for interactivity, and places assets in frontend code. Ideal for creating hero images, icons, backgrounds, product mockups, and infographic elements. Use this skill when users need image assets for their web projects.
Voice communication system for broadcasting updates using ElevenLabs TTS. IMPORTANT: When the user includes voice activation keywords ("with voice comms", "with voice updates", "announce progress", "broadcast updates"), you MUST proactively use this skill to broadcast mission updates as Commander. Broadcast as Commander when: - Starting a complex task or deploying squadron agents - Announcing when squadrons complete their missions - Summarizing consolidated results from multiple agents - Making strategic decisions or priority changes - User explicitly requests voice updates DO NOT wait to be asked - if voice keywords are present, broadcast proactively! Available to: - Main agent (you) - uses Commander voice (Bill, 1.2 speed) - Squadron sub-agents (Red, Gold, Blue, Green) - use their squadron voices
Interactive assistant for designing and generating tweaktune pipelines to synthesize training data for LLMs. Use when user wants to create synthetic datasets for fine-tuning, generate conversations, function calling data, or structured JSON datasets.
Applies AI Security Assurance's official brand colors, typography, and voice to any artifact. Use when creating presentations, documents, marketing materials, or any content that should reflect the company's professional cybersecurity identity.
Display text using RSVP speed reading. Use when presenting explanations, summaries, or any text the user should read quickly. Launches speed-read script with the provided text.
Generate images from text prompts using Google's Gemini Imagen API. This skill should be used when the user requests image creation, generation, or visualization from text descriptions (e.g., "create an image of...", "generate a picture showing...", "make me an image for...").
Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.
Create presenter and UGC-style video content. Use when creating educational videos, testimonials, product demos, or personal brand content.
Use macOS text-to-speech for agent narration and announcements. Sub-agents announce themselves using different language voices speaking English. Use for multi-agent workflows where each agent has a distinct voice identity.
Image generation with Google Gemini API. Models: gemini-2.5-flash-image (fast) or gemini-3-pro-image-preview (quality). For social media graphics, marketing, infographics.
A sophisticated multi-layered prompt engineering system with analysis, optimization, customization, and validation engines for creating high-quality, domain-specific prompts
Transformiert Anforderungen in Best-Practice Prompts nach Claude 4.x Standards (Dezember 2025). Basiert auf: - Nate B. Jones 4 Beginner Moves: Shape, Context, Silent Plan, Self-Check - Anthropic Claude 4.x Best Practices: Explizitheit, Contract-Style, Examples beat Adjectives - Pipelines over Prompts Philosophie AKTIVIERT SICH AUTOMATISCH nach clarify-spec oder bei /prompt-architect. Produziert strukturierten, ausfuehrbaren Prompt mit allen Best Practices.
Conversation style guidelines. Use when responding to the user.
Comprehensive patterns and techniques for removing AI-generated verbosity and slop
Generates images using GPT Image 1.5 API. Use when users request image creation, illustration, or visual content.
Generate images via OpenAI gpt-image-1.5 API. Supports batch generation with custom prompts or random prompt sampling. Features transparent backgrounds, multiple output formats (png/jpeg/webp), quality levels, and size options. Use when user wants to generate images, create image batches, explore visual prompts, or needs AI-generated artwork.
Generate accurate summaries of long documents and text collections