
Generate read-aloud audio (text-to-speech) using Google Gemini TTS (gemini-3.1-flash-tts-preview). Automatically detects the mode from the input: single-speaker narration for plain text, and multi-speaker dialogue when the input has two "Name:" speaker labels. Supports 30 prebuilt voices, natural-language style control and audio tags, text or file input, and WAV output. Works with both the Gemini Developer API and Vertex AI.
Generate and edit images using Google Gemini (Nano Banana Pro / Nano Banana 2). Automatically selects the best model based on prompt complexity. Supports text-to-image generation, image editing with reference images, configurable aspect ratios, 1K/2K/4K output, Google Search grounding, and batch generation. Works with both Gemini Developer API and Vertex AI.
Package a skill directory into a distributable `.skill` archive placed on the Desktop. Use when the user asks to "package", "bundle", "zip up", "export", "distribute", or "ship" a skill, or mentions creating a `.skill` file from `~/.claude/skills/<skill-name>/`.
Generate music files using Google Gemini Lyria 3. Automatically selects the best model based on the request / purpose: Lyria 3 Pro for full-length, structured, lyric-bearing songs and Lyria 3 Clip for 30-second clips, loops, jingles, and quick previews. Supports genre / mood / instrument prompting, song-structure tags, custom lyrics, MP3 / WAV output, and works with both the Gemini Developer API and Vertex AI.
Salesforce CLIを使ってSalesforceのデータ操作・管理を行うスキル。 取引先・商談・プロジェクト・外注管理のCRUD操作、SOQLクエリ、パイプライン分析、レポート生成を実行する。 ユーザーがSalesforceのデータを照会・更新・分析したいとき、商談のステージを確認・変更したいとき、 プロジェクトや外注の状況を確認したいとき、売上・粗利・パイプラインのレポートが必要なとき、 取引先や案件の情報を調べたいとき、SOQLクエリを実行したいときに使用する。 「Salesforce」「SF」「商談」「取引先」「パイプライン」「案件」「プロジェクト」「外注」「粗利」 「売上」「受注」「失注」「ステージ」「SOQL」などのキーワードが含まれる場合はこのスキルを使う。 Salesforceに関する質問や操作依頼であれば、明示的にスキル名を言及していなくても積極的にこのスキルを使用すること。
PDF page manipulation toolkit for editing PDF structure. Use when Claude needs to work with PDF files for (1) Deleting pages, (2) Reordering or rearranging pages, (3) Inserting pages from other PDFs, (4) Rotating pages, (5) Splitting PDFs into multiple files, (6) Merging multiple PDFs into one, or any other PDF page manipulation tasks.
Search and recommend izakaya (Japanese-style pubs) and restaurants based on party size, budget, area, and additional preferences. Aggregates reviews from gourmet sites (Tabelog, Hot Pepper Gourmet, Gurunavi) and Google Maps, calculates composite ratings, and provides reservation links and Google Maps directions for quick booking.
Generate videos using Google Gemini Veo 3.1. Defaults to the cost-effective Veo 3.1 Lite model; the premium (Veo 3.1) and Fast models are used only when explicitly requested via --pro / --fast. Supports text-to-video and image-to-video (first frame + optional last frame), 16:9 / 9:16, 720p / 1080p (4k on Pro), 4-8s clips, and 1-4 videos per request. Works with both the Gemini Developer API and Vertex AI.
Produce rich, finished video content with React Remotion by orchestrating the repository's media-generation skills (nanobanana for images, veo for video clips, lyria for BGM, gemini-tts for narration) and composing them on a data-driven Remotion timeline. Follows an approval-gated workflow: first return a video composition plan for the user to approve, then generate assets, compose, run a multimodal self-review loop, and deliver only when the result meets the quality bar. Use when the user wants to "create a video", "make a promo / explainer / social clip", or combine images, video, music, and voiceover into one polished video.