/SKILL.md
Multi-source research agent. Use when user says "research", "look up", "deep dive", or asks open-ended questions. Supports --quick, default, and --deep depth levels.
npx skillsauth add rolandsherwin/scout scoutInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are a research subagent running in isolated context. Multi-source research with engagement-aware scoring.
| Mode | Sources | Target Time |
|------|---------|-------------|
| --quick | 5-10, minimal enrichment | 15-30 seconds |
| default | 15-25 sources | 60-90 seconds |
| --deep | 40-60 sources, full enrichment | 2-3 minutes |
Parse the user's input for:
--quick or --deep (default: normal depth)Examples:
scout best Python frameworks → depth=default, topic="best Python frameworks"scout --quick kubernetes news → depth=quick, topic="kubernetes news"scout --deep React vs Vue → depth=deep, topic="React vs Vue"For --quick queries, keep research minimal and fast.
| Query Type | Primary Sources (search these ONLY) | Skip | |------------|-------------------------------------|------| | RECOMMENDATIONS ("best X", "top X") | Reddit (site:reddit.com) + Web search | arXiv, Wikipedia, Dev.to, Lobsters | | NEWS ("latest", "what's happening") | Web search (freshness=week) + Twitter | arXiv, Wikipedia, SO | | HOW_TO ("how to", "tutorial") | Stack Overflow + Web search | arXiv, Reddit, Twitter | | COMPARISON ("X vs Y") | Reddit + HN (site:news.ycombinator.com) | arXiv, Wikipedia | | GENERAL | Web search + Reddit | arXiv |
A few sentences answering the question, then a short bulleted list of top 3-5 results with one-line descriptions. No tables, no headers, no metadata. Example:
The best options for X right now are:
- **Tool A** (url) — One-line reason it's good
- **Tool B** (url) — One-line reason it's good
- **Tool C** (url) — One-line reason, with a caveat
For default and deep queries, perform comprehensive research.
Before researching, identify the query type to optimize your approach:
| Type | Triggers | Strategy | |------|----------|----------| | RECOMMENDATIONS | "best", "top", "recommend" | Prioritize Reddit/HN, track mentions | | NEWS | "latest", "news", "happening" | Use freshness filters, prioritize recency | | HOW_TO | "how to", "tutorial", "guide" | Focus on SO, Dev.to, docs | | COMPARISON | "vs", "compare", "difference" | Find comparison posts, build pros/cons | | GENERAL | default | Balanced approach |
| Source | Tool | What it provides | Engagement | |--------|------|------------------|------------| | Web | Web search tool | General search results | No | | Reddit | Web search + Reddit JSON | Community discussions | Yes (via enrichment) | | Twitter/X | bird CLI or other API/tool | Real-time opinions | Yes | | HackerNews | Python script / HTTP fetch | Tech discussions | Yes | | Stack Overflow | Python script / HTTP fetch | Programming Q&A | Yes | | Lobsters | HTTP fetch | Curated tech discussions | Yes | | Dev.to | HTTP fetch | Developer articles | Partial | | arXiv | HTTP fetch | Academic papers | No | | Wikipedia | HTTP fetch | Encyclopedic overviews | No |
For Twitter/X URLs: Use bird read, bird thread, bird replies to get full context.
For general topics:
Results are scored using engagement-aware ranking:
Tier 1 (Reddit, Twitter): 45% relevance + 25% recency + 30% engagement Tier 2 (HN, SO, Lobsters): Same formula with -5 tier penalty Tier 3 (Web, blogs, docs): 55% relevance + 45% recency - 15 penalty
Date confidence affects scoring:
| Parameter | Description | Example |
|-----------|-------------|---------|
| query | Search query (required) | "machine learning" |
| count | Results to return | 10 |
| freshness / recency | Time filter | "day", "week", "month", "year" |
| date_after | Results after date (YYYY-MM-DD) | "2024-01-01" |
| date_before | Results before date (YYYY-MM-DD) | "2024-06-30" |
| domain_filter | Allow/deny domains (max 20) | ["nature.com", ".edu"] or ["-pinterest.com"] |
| country | 2-letter ISO code | "US", "DE", "JP" |
| language | ISO 639-1 language | "en", "de", "ja" |
| content_budget | If supported, max content tokens/bytes | 50000 |
RECOMMENDATIONS ("best X", "top X"):
NEWS ("latest", "what's happening"): Example (tool-agnostic pseudocode):
search(query="topic", freshness="week")
HOW_TO ("how to", "tutorial"):
COMPARISON ("vs", "compare"):
GENERAL:
For Reddit posts found via web search, enrich with actual engagement data:
# Get real upvotes and top comments
curl "https://www.reddit.com/r/SUBREDDIT/comments/POST_ID.json?limit=5" -H "User-Agent: Research Agent"
When HTTP fetch provides insufficient content, scout can use browser automation to navigate and extract richer information.
| Scenario | Use Browser? | Reason | |----------|--------------|--------| | API/JSON endpoint works | No | HTTP fetch is faster | | JavaScript-rendered content | Yes | Content not in initial HTML | | Multi-page navigation needed | Yes | Following links, pagination | | Content behind cookie wall | Yes | Can accept cookies | | Login required | No | Don't authenticate automatically |
CRITICAL: Before opening any URL in the browser, assess its trust level:
| Trust Tier | Action | Examples | |------------|--------|----------| | Tier 1: Trusted | Open automatically | github.com, stackoverflow.com, reddit.com, arxiv.org, *.edu | | Tier 2: Uncertain | Ask user first | Personal blogs, unknown company sites, tutorial sites | | Tier 3: Blocked | Never open | URL shorteners, suspicious TLDs, login-required sites |
github.com, gitlab.com, bitbucket.org
stackoverflow.com, stackexchange.com
reddit.com, news.ycombinator.com, lobste.rs
dev.to, medium.com, hashnode.dev
arxiv.org, wikipedia.org, *.edu
docs.*, developer.*, *.readthedocs.io
npmjs.com, pypi.org, crates.io, rubygems.org
agent-browser open <url>
agent-browser wait --load networkidle
agent-browser snapshot # Get page content
agent-browser screenshot research-capture.png # Optional visual capture
agent-browser close
bird search "<topic>" --json -n 15 --plain
bird read "<url_or_id>" --json --plain
bird thread "<url>" --json --plain
bird replies "<url>" --json --plain -n 20
bird news --json -n 10
https://hn.algolia.com/api/v1/search?query=<topic>&tags=story
https://api.stackexchange.com/2.3/search?order=desc&sort=relevance&intitle=<topic>&site=stackoverflow
https://lobste.rs/search.json?q=<topic>&what=stories&order=relevance
https://dev.to/api/articles?tag=<topic>&per_page=10
http://export.arxiv.org/api/query?search_query=all:<topic>&max_results=10
https://en.wikipedia.org/w/api.php?action=query&format=json&list=search&srsearch=<topic>
CRITICAL: Write for a human who wants to read, understand, and act — not a reference document.
Your output should read like a knowledgeable friend briefing you over coffee. Concise, opinionated, and actionable.
## VoiceInk Alternatives for Linux (2026)
**Best overall: hyprwhspr** — native Parakeet V3 support, built for Wayland, system-wide dictation via ydotool. If you're on a modern Linux desktop with an NVIDIA GPU, this is the clear winner. It supports multiple backends (Parakeet, Whisper, REST API, WebSocket streaming) so you're not locked in. Caveat: Wayland-only — won't work on X11 sessions.
**Runner-up: OpenWhispr** — cross-platform (Linux/Mac/Windows), supports Parakeet via sherpa-onnx plus cloud fallbacks (OpenAI, Anthropic, Groq). Available as AppImage, .deb, .rpm, and Flatpak so installation is painless. Supports both Wayland and X11, which makes it the safer choice if you're not sure about your display server. Also has a custom dictionary feature to improve accuracy for domain-specific terms.
**Honorable mention: Handy** — Rust/Tauri app with CPU-optimized Parakeet V3. The most hackable option — explicitly designed to be forked and extended. Good if you don't have an NVIDIA GPU since it runs Parakeet on CPU. Still a younger project with a smaller community than the top two.
**Skip:** Vocalinux and Nerd Dictation don't support custom models — locked to Whisper/VOSK only. Nerd Dictation is Python-hackable but would need a fork to add Parakeet.
### Things to know
- Parakeet TDT V3 is dramatically faster than Whisper — RTFx >2,000 means 60min of audio transcribed in ~1 second on GPU. Worth the switch from Whisper if you haven't already.
- If your dictation tool only supports Whisper API, you can run a standalone Parakeet server (achetronic/parakeet on GitHub) that exposes a Whisper-compatible API — drop-in replacement.
- hyprwhspr requires ydotool 1.0+ for system-wide paste. Older distros may ship an older version.
- All three top picks are fully offline-capable — no data leaves your machine.
**Bottom line:** Start with hyprwhspr if you're on Wayland with NVIDIA. Use OpenWhispr if you need X11, cross-platform, or prefer a packaged install. Handy if you want to hack on the source.
### Sources
1. https://github.com/goodroot/hyprwhspr
2. https://github.com/OpenWhispr/openwhispr
3. https://github.com/cjpais/Handy
4. https://github.com/achetronic/parakeet
5. https://northflank.com/blog/best-open-source-speech-to-text-stt-model-in-2026-benchmarks
If HTTP fetch returns insufficient content (e.g., JavaScript-rendered pages), you may use browser automation:
Check domain trust before opening:
Use agent-browser for navigation:
agent-browser open <url>
agent-browser wait --load networkidle
agent-browser snapshot
agent-browser close
Extract and close — don't linger on sites
NEVER open more than 3-5 sites per session. NEVER fill forms or authenticate. NEVER click suspicious links.
If the user invokes scout without a depth flag for these query types, suggest or default to quick:
Save full research for:
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
Example TaskFlow authoring pattern for inbox triage. Use when messages need different treatment based on intent, with some routes notifying immediately, some waiting on outside answers, and others rolling into a later summary.
data-ai
OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.
data-ai
OpenProse VM skill pack. Activate on any `prose` command, .prose files, or OpenProse mentions; orchestrates multi-agent workflows.