image-generation
Generate AI images with various themes, styles, and aspect ratios. Use when asked to generate images, create artwork, make graphics, generate backgrounds, or create visual assets.
Generate AI images with various themes, styles, and aspect ratios. Use when asked to generate images, create artwork, make graphics, generate backgrounds, or create visual assets.
Generate, edit, or transform images with Gemini Nano Banana using bundled Python scripts (Flash or Pro) including aspect ratio, resolution, image-to-image edits, logo overlays, and reference images. Use when users request image generation, image edits, image-to-image transformations, logo placement, or specific aspect ratios or resolutions.
Use when users request image generation, AI art creation, image editing with Gemini models, need help crafting prompts, or want brand-styled imagery. Handles both direct generation and interactive prompt design.
Creates expert positioning content, social media posts, and marketing materials for Frank's personal brand and AI coaching business with soul-aligned messaging
Expert prompt engineering for FLUX.1 image generation. Use when users request AI-generated images, artwork, illustrations, or visual content. Converts any visual request into optimized FLUX.1 prompts using layering, descriptive language, technical parameters, and text integration techniques.
This skill should be used when generating pedagogically-aligned slide decks from educational content using NotebookLM. It addresses the convergence toward generic, text-heavy slides by providing structured prompts that create engaging, proficiency-appropriate presentations aligned with specific educational frameworks.
Produce natural, high-quality human-like text. Avoids common AI markers (repetitive patterns, fluff words, false neutrality) in favor of specificity, directness, and genuine voice. Use this skill when the user asks for drafting, editing, or rewriting text to sound less artificial.
Generate and edit images using the Gemini API (Nano Banana). Use this skill when creating images from text prompts, editing existing images, applying style transfers, generating logos with text, creating stickers, product mockups, or any image generation/manipulation task. Supports text-to-image, image editing, multi-turn refinement, and composition from multiple reference images.
Generate or edit images using AI models (FLUX, Gemini). Use for general-purpose image generation including photos, illustrations, artwork, visual assets, concept art, and any image that is not a technical diagram or schematic. For flowcharts, circuits, pathways, and technical diagrams, use the scientific-schematics skill instead.
This skill should be used when the user asks to optimize, polish, or improve text content. It provides text optimization capabilities including polishing, humanization (removing AI tone), content expansion, and content condensation. Automatically removes AI patterns after processing.
AI agent maximizes efficiency and minimizes costs through strategic token usage while maintaining output quality. Use when managing AI interactions, designing prompts, or reducing costs.
This skill should be used when the user asks to rewrite content, transform style, check style consistency, or imitate writing style. It provides multi-channel content rewriting, style consistency checking, and content imitation capabilities. Supports podcast script generation in both solo and duo modes.
Nano Banana Pro (nano-banana-pro) image generation skill. Use this skill when the user asks to "generate an image", "generate images", "create an image", "make an image", uses "nano banana", or requests multiple images like "generate 5 images". Generates images using Google's Gemini 2.5 Flash for any purpose - frontend designs, web projects, illustrations, graphics, hero images, icons, backgrounds, or standalone artwork. Invoke this skill for ANY image generation request.
Generate or edit images using AI models (FLUX, Gemini). Use for general-purpose image generation including photos, illustrations, artwork, visual assets, concept art, and any image that isn't a technical diagram or schematic. For flowcharts, architecture diagrams, and technical diagrams, use the project-diagrams skill instead.
Generate and edit images using Google Gemini or OpenAI GPT Image models. Automatically used for requests like "generate an image", "create an illustration", "edit this image". Also supports sticker sheet generation and splitting for requests like "create some stickers" or "generate multiple icons and split them". Choose between Gemini (Nano Banana) and OpenAI (gpt-image-1.5) providers.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Process NotebookLM audio files: convert, transcribe, add chapters, and prepare for publishing.
Transform content through eight cognitive lenses for different kinds of understanding. Use when the same concept needs exploration through story, debate, simulation, uncertainty, fiction, embodiment, ritual, or reflection.
MIGRATE CLAUDE.md into modular `.claude/rules/` directory structure following Claude Code's rules system. Converts monolithic CLAUDE.md into organized, path-specific rule files with glob patterns. Use when migrating to rules system, modularizing project instructions, splitting CLAUDE.md, organizing memory files. Triggers on "migrate claudemd to rules", "convert claude.md to rules", "modularize claude.md", "split claude.md into rules", "migrate to rules system".
Expert translation methodology and best practices for English-Japanese-Chinese (Traditional) trilingual translation. Use when translating any content between these languages, including handling idioms, cultural nuances, and language-specific expressions. Provides translation workflows and quality assurance methods.
This skill should be used when the user asks to translate content between languages. It supports multiple languages (Chinese, English, Japanese, Korean, French, German, Spanish) with various translation styles (literal, free, professional, casual). Supports automatic polishing and humanization after translation.
Analyze videos using Google's Gemini API - describe content, answer questions, transcribe audio with visual descriptions, reference timestamps, clip videos, and process YouTube URLs. Supports 9 video formats, multiple models (Gemini 2.5/2.0), and context windows up to 2M tokens (6 hours of video).
Process D&D session videos through the complete chunking pipeline. Use when the user requests to process, chunk, transcribe, or analyze a D&D session video file.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.