skills/capabilities/reddit-scraper/SKILL.md
Scrape and search Reddit posts using Apify. Use when you need to find Reddit discussions, track competitor mentions, monitor product feedback, discover pain points, or analyze subreddit content. Supports keyword filtering, time-based searches, and subreddit-specific queries.
npx skillsauth add athina-ai/goose-skills reddit-scraperInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Scrape Reddit posts and comments using the Apify parseforge/reddit-posts-scraper actor.
Requires APIFY_API_TOKEN env var (or --token flag). Install dependency: pip install requests.
# Top posts from r/growthhacking in last week
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit growthhacking --days 7 --sort top --time week
# Hot posts from multiple subreddits
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit "growthhacking,gtmengineering" --days 7 --sort hot
# Keyword-filtered competitor tracking
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit LLMDevs \
--keywords "Langfuse,Arize,Langsmith" \
--days 30
# Human-readable summary table
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit growthhacking --days 7 --output summary
https://www.reddit.com/r/growthhacking/top/?t=week)parseforge/reddit-posts-scraper actor via REST API| Flag | Default | Description |
|------|---------|-------------|
| --subreddit | required | Subreddit name(s), comma-separated |
| --keywords | none | Keywords to filter (comma-separated, OR logic) |
| --days | 30 | Only include posts from the last N days |
| --max-posts | 50 | Max posts to scrape per subreddit |
| --sort | top | Sort: hot, top, new, rising |
| --time | week | Time window for top sort: hour, day, week, month, year, all |
| --output | json | Output format: json or summary |
| --token | env var | Apify token (prefer APIFY_API_TOKEN env var) |
| --timeout | 300 | Max seconds to wait for the Apify run |
Small or low-traffic subreddits (e.g. r/gtmengineering) may return zero posts with --sort hot because the hot feed is nearly empty. Use --sort top --time week (or month) instead — this scrapes the top-ranked posts over the time window and reliably returns results.
If calling the Apify API directly (e.g. via curl), note these required fields:
{
"startUrls": [{"url": "https://www.reddit.com/r/growthhacking/top/?t=week"}],
"maxPostCount": 50,
"scrollTimeout": 40,
"searchType": "posts",
"proxyConfiguration": {"useApifyProxy": true}
}
Key differences from other Apify actors:
startUrls with full Reddit URLs (not a searches array)proxyConfiguration is required — omitting it causes an error/top/?t=week), not separate input fieldspython3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit "LLMDevs,MachineLearning,LocalLLaMA" \
--keywords "Langfuse,Arize,Weights & Biases,Langsmith,Braintrust" \
--days 30 --sort top --time month
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit LLMDevs \
--keywords "frustrating,difficult,hard to,wish there was,better way" \
--days 30
python3 skills/reddit-scraper/scripts/search_reddit.py \
--subreddit "LLMDevs,MachineLearning" \
--keywords "YourProductName" \
--days 7 --sort new
When presenting Reddit results to the user, always include the original post URL for every post. This is critical for allowing users to read the full discussion, comments, and context. Never return a summary table without links.
Posts are returned as JSON array sorted by score. Each post has:
{
"id": "abc123",
"title": "Post title",
"author": "username",
"subreddit": "growthhacking",
"score": 42,
"numComments": 15,
"createdAt": "2026-02-18T12:00:00.000Z",
"selfText": "Post body...",
"url": "https://reddit.com/r/..."
}
See references/apify-config.md for detailed API configuration, token setup, and rate limits.
content-media
Takes an existing screen recording or demo video and adds professional zoom/pan effects synchronized to the narration. Uses transcript-driven zoom targeting and Remotion for rendering. Optionally replaces audio with a soundtrack.
tools
Repurposes long-form video (podcasts, interviews, talks) into short-form vertical clips for Instagram Reels, TikTok, and YouTube Shorts. Handles transcription, moment selection, clip extraction, speaker-tracked reframing (16:9 to 9:16), and animated captions.
development
Creates talking head videos from any source material (docs, changelogs, blog posts, notes, transcripts). Produces multi-scene videos with avatar narration over screenshots/images using HeyGen v2 API. Supports Quick Shot and Full Producer modes.
tools
Generates Instagram-ready product reels from any e-commerce product page URL. Scrapes product images, classifies by type, generates AI-animated clips via Higgsfield API, creates text overlays with style presets, and composes a 15-20 second reel with music. Supports model-based and product-only reels.