skills/gemini-text/SKILL.md
Generate text content using Google Gemini models via scripts/. Use for text generation, multimodal prompts with images, thinking mode for complex reasoning, JSON-formatted outputs, and Google Search grounding for real-time information. Triggers on "generate with gemini", "use gemini for text", "AI text generation", "multimodal prompt", "gemini thinking mode", "grounded response".
npx skillsauth add akrindev/google-studio-skills gemini-textInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Generate content using Google's Gemini API through executable scripts with advanced capabilities including system instructions, thinking mode, JSON output, and Google Search grounding.
Use this skill when you need to:
Purpose: Full-featured text generation with all Gemini capabilities
When to use:
Key parameters:
| Parameter | Description | Example |
|-----------|-------------|---------|
| prompt | Text prompt (required) | "Explain quantum computing" |
| --model, -m | Model to use | gemini-3-flash-preview |
| --system, -s | System instruction | "You are a helpful assistant" |
| --thinking, -t | Enable thinking mode | Flag |
| --json, -j | Force JSON output | Flag |
| --grounding, -g | Enable Google Search | Flag |
| --image, -i | Image for multimodal | photo.png |
| --temperature | Sampling 0.0-2.0 | 0.7 for creative |
| --max-tokens | Output limit | 1000 |
Output: Generated text string, optionally with grounding sources
node scripts/generate.js "Explain quantum computing in simple terms"
gemini-3-flash-preview (default, fast)node scripts/generate.js "How do I read a file in Python?" --system "You are a helpful coding assistant"
node scripts/generate.js "Analyze the ethical implications of AI in healthcare" --thinking
node scripts/generate.js "Generate a user profile object with name, email, and preferences" --json
node scripts/generate.js "Who won the latest Super Bowl?" --grounding
node scripts/generate.js "Describe what's in this image in detail" --image photo.png
# 1. Create batch requests (gemini-batch skill)
# 2. Generate content
node scripts/generate.js "Create a 500-word blog post about sustainable energy"
# 3. Convert to audio (gemini-tts skill)
| Model | Speed | Intelligence | Context | Best For |
|-------|-------|--------------|---------|----------|
| gemini-3-flash-preview | Fast | High | 1M | General use, agentic tasks (default) |
| gemini-3-pro-preview | Medium | Highest | 1M | Complex reasoning, research |
| gemini-2.5-flash | Fast | Medium | 1M | Stable, reliable generation |
| gemini-2.5-pro | Slow | High | 1M | Code, math, STEM tasks |
| Value | Creativity | Best For | |-------|-----------|----------| | 0.0-0.3 | Low | Code, facts, formal writing | | 0.4-0.7 | Medium | Balanced output | | 0.8-1.0 | High | Creative writing, brainstorming | | 1.0-2.0 | Very High | Highly creative, varied outputs |
| Value | Description | |-------|-------------| | 0 | Disabled (default behavior) | | 512-1024 | Standard reasoning | | 2048+ | Deep analysis (slower, more tokens) |
--json flag)import json; data = json.loads(output)When --grounding is used, the script prints:
npm install @google/genai@latest dotenv@latest
Set environment variable:
export GOOGLE_API_KEY="your-key-here"
# or
export GEMINI_API_KEY="your-key-here"
gemini-3-flash-preview (most available)--max-tokens value# Basic
node scripts/generate.js "Your prompt"
# Persona
node scripts/generate.js "Prompt" --system "You are X"
# Thinking
node scripts/generate.js "Complex task" --thinking
# JSON
node scripts/generate.js "Generate JSON" --json
# Search
node scripts/generate.js "Current event" --grounding
# Multimodal
node scripts/generate.js "Describe this" --image photo.png
references/models.md for detailed model informationdevelopment
Generate speech from text using Google Gemini TTS models via scripts/. Use for text-to-speech, audio generation, voice synthesis, multi-speaker conversations, and creating audio content. Supports multiple voices and streaming. Triggers on "text to speech", "TTS", "generate audio", "voice synthesis", "speak this text".
development
Generate images using Google Gemini and Imagen models via scripts/. Use for AI image generation, text-to-image, creating visuals from prompts, generating multiple images, custom aspect ratios, and high-resolution output up to 4K. Triggers on "generate image", "create image", "imagen", "text to image", "AI art", "nano banana".
development
Upload and manage files using Google Gemini File API via scripts/. Use for uploading images, audio, video, PDFs, and other files for use with Gemini models. Supports file upload, status checking, and file management. Triggers on "upload file", "file API", "upload image", "upload PDF", "upload video", "file management".
development
Generate text embeddings using Gemini Embedding API via scripts/. Use for creating vector representations of text, semantic search, similarity matching, clustering, and RAG applications. Triggers on "embeddings", "semantic search", "vector search", "text similarity", "RAG", "retrieval".