skills/apify-financial-services/skills/apify-financial-osint/SKILL.md
Social-listening signals for tracked portfolio companies via Apify Actors — Reddit sentiment (fatihtahta), Twitter/X real-time mentions (kaitoeasyapi pay-per-result), Trustpilot service quality (getwally.net). Use when the user asks for sentiment, social media mentions, customer reviews, brand perception, crisis signals, OSINT, social listening, "what are people saying about X". Reads tracked companies from data/companies.json. Do NOT use for news (use apify-financial-news) or registry lookups (use apify-public-registries).
npx skillsauth add apify/awesome-skills apify-financial-osintInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Discover and quantify what the internet is saying about portfolio companies. Three verified Apify Actors only — Reddit (sentiment + threaded discussion), Twitter/X (real-time mentions, crisis monitoring), Trustpilot (customer satisfaction). All actors verified against real demo data with ≥98% success rate.
apify CLI (npm install -g apify-cli && apify login); fallback: Apify MCP connector (call-actor tool). CLI is faster and preferred when both are available.${CLAUDE_PLUGIN_ROOT}/data/companies.json (read fields: queries.reddit, queries.twitter, trustpilot_urls, identifiers.ticker)${CLAUDE_PLUGIN_ROOT}/skills/apify-financial-osint/data/osint-targets.json${CLAUDE_PLUGIN_ROOT} is the plugin's root directory (where .claude-plugin/ lives). It is resolved automatically by Claude Code when the plugin is installed, or set to the --plugin-dir path during development.
Copy this and tick boxes as you progress:
Task Progress:
- [ ] Step 0: Verify Apify access — try `apify --version && apify info`; if unavailable, check for `call-actor` MCP tool; if neither, tell user to install apify CLI or Apify MCP connector
- [ ] Step 1: Pick actor(s) by signal type — see "Choose Actor by Signal" table
- [ ] Step 2: Build input — read data/osint-targets.json or construct from data/companies.json
- [ ] Step 3: Run actor via apify CLI
- [ ] Step 4: Output — present top results with sentiment + engagement signals
| Actor | Purpose | Cost | Success rate |
|-------|---------|------|--------------|
| fatihtahta/reddit-scraper-search-fast | Reddit sentiment, acquisition reactions, brand perception | $1.49 / 1k results | 98.4% (40,787 runs/30d) |
| kaitoeasyapi/twitter-x-data-tweet-scraper-pay-per-result-cheapest | Real-time mentions, crisis monitoring, dealflow signals | $0.25 / 1k tweets | 99.7% (4.3/5, 58 reviews) |
| getwally.net/trustpilot-reviews-scraper | Service quality, complaint patterns (telcos, e-commerce, banks) | $3.00 / 1k results | verified working |
Do NOT use any other actor. Do NOT use WebSearch, WebFetch, or browser tools.
| If you need | Use Actor | When NOT to use |
|---|---|---|
| Sentiment / discussion threads / reactions to corporate events | fatihtahta/reddit-scraper-search-fast | If company has no consumer base (B2B fintech, biotech) — expect <5 posts |
| Real-time mentions / crisis signals / dealflow chatter | kaitoeasyapi/twitter-x-data-tweet-scraper-pay-per-result-cheapest | If you need >1 week historical depth — Twitter API limits |
| Customer satisfaction / service quality complaints | getwally.net/trustpilot-reviews-scraper | If company has no Trustpilot page (B2B, holding companies) — see verified URL list in reference/osint-actor-schemas.md Section 3 |
For portfolio companies, look up the company in data/osint-targets.json — it pre-computes which actors to run with templated inputs. Routing rule (mirrors how the file was built):
queries.reddit non-empty → run Reddit actorqueries.twitter non-empty → run Twitter actor (always set for tracked companies)trustpilot_urls non-empty → run Trustpilot actorFor ad-hoc / non-portfolio targets, construct input from scratch (see Step 2).
| Field | Type | Default | Notes |
|---|---|---|---|
| queries | array of string | required (one of queries / urls / subredditName) | Global Reddit-wide search terms. |
| maxPosts | integer | 50000 (!) | ALWAYS set explicitly — typical 30-50 for scans, 100-200 for deep-dives. |
| scrapeComments | boolean | false | Set true to extract threaded discussion. |
| maxComments | integer | 50000 (!) | Only used when scrapeComments: true. Typical 5–10. |
| sort | enum | "relevance" | One of relevance, hot, top, new, comments. (NOT rising / best.) |
| timeframe | enum | "all" | One of all, year, month, week, day, hour. Must be >= dateFrom–dateTo range. |
| dateFrom | string | — | YYYY-MM-DD. Post-fetch filter: keep posts from this date onward. |
| dateTo | string | — | YYYY-MM-DD. Post-fetch filter: keep posts up to this date. |
Example:
apify call fatihtahta/reddit-scraper-search-fast \
--input '{"queries":["InPost FedEx acquisition"],"maxPosts":50,"scrapeComments":true,"maxComments":10,"sort":"relevance","timeframe":"month"}' \
--user-agent apify-awesome-skills/apify-financial-osint
| Field | Type | Default | Notes |
|---|---|---|---|
| twitterContent | string | — | One of twitterContent / tweetIDs / searchTerms. Twitter advanced-search syntax (OR, -, from:, since:). |
| tweetIDs | array of string | — | Plural — not tweetId. |
| searchTerms | array of string | — | Each term gets maxItems results independently. |
| maxItems | integer | 200 | REQUIRED — actor fails without it. Pay-per-result. |
| queryType | enum | "Latest" | One of Latest, Top, Photos, Videos. |
| lang | string | "en" | ISO 639-1. Set cs/pl/hu/bg/sk/tr for single-country B2C; omit for multilingual. |
| since / until | string | — | Format: YYYY-MM-DD_HH:MM:SS_UTC (NOT ISO 8601). |
| filter:news / filter:media / min_faves / min_retweets | various | — | Engagement / content filters. |
Example:
apify call kaitoeasyapi/twitter-x-data-tweet-scraper-pay-per-result-cheapest \
--input '{"twitterContent":"InPost FedEx acquisition OR INPST","maxItems":100,"queryType":"Latest","since":"2026-01-01_00:00:00_UTC","filter:news":true}' \
--user-agent apify-awesome-skills/apify-financial-osint
| Field | Type | Required | Notes |
|---|---|---|---|
| startUrls | array of {"url": "..."} objects | Yes | NOT plain strings — array of objects. |
| limit | integer | No (default 1000) | Set lower to control cost ($3/1k). |
Example:
apify call getwally.net/trustpilot-reviews-scraper \
--input '{"startUrls":[{"url":"https://www.trustpilot.com/review/inpost.pl"}],"limit":50}' \
--user-agent apify-awesome-skills/apify-financial-osint
Older docs reference fields like maxItems, includeStatistics, includeCompanyDetails, onlyNewerThan — these do NOT exist on this actor.
Always cap output before running. Defaults are dangerously high.
| Actor | Cap field | Portfolio scan | Deep-dive |
|---|---|---|---|
| Reddit | maxPosts | 30–50 | 100–200 |
| Reddit comments | maxComments | 5–10 (only if scrapeComments: true) | 20–50 |
| Twitter/X | maxItems | 50–100 | 200–500 |
| Trustpilot | limit | 30–50 | 100–200 |
Twitter and Trustpilot are pay-per-result — every returned item is billed.
Single example pulling Reddit threads + Twitter mentions for InPost (driven by data/osint-targets.json):
apify call fatihtahta/reddit-scraper-search-fast \
--input "$(jq -c '.targets[] | select(.company_id=="inpost") | .inputs.reddit' \
${CLAUDE_PLUGIN_ROOT}/skills/apify-financial-osint/data/osint-targets.json)" \
--user-agent apify-awesome-skills/apify-financial-osint \
--output-dataset > reddit_inpost.json
Full per-actor input schema (all 51 Twitter properties, every Reddit enum, every Trustpilot edge case) plus 30+ example invocations: reference/osint-actor-schemas.md.
Reddit search ignores quotes and matches partial words ("InPost" matches "in post game thread"). After fetching, filter results client-side: keep only posts where any of the company's search queries appears as a whole word (case-insensitive) in title or body. Use the queries array from data/osint-targets.json for matching (these are the terms the company is actually known by). Normalise diacritics before comparing (Š↔S, ö↔o, etc.).
Expect 90–95% of raw Reddit results to be false positives. This is normal — maxPosts is set to 200 to compensate.
Key output fields per actor:
| Actor | Date field | Date format | URL field | Engagement fields |
|---|---|---|---|---|
| Reddit | created_utc | ISO 8601 (2026-05-01T17:26:41.000Z) | canonical_url | score, num_comments |
| Twitter | createdAt | Non-standard (Fri May 01 17:35:21 +0000 2026) | url | likeCount, retweetCount, replyCount |
| Trustpilot | date | ISO 8601 (2026-01-27T21:53:45.000Z) | url (review ID, not company page) | ratingValue (string "1"–"5") |
Present top results with:
Example output for a sentiment scan:
## OSINT Scan: InPost — Last 30 days
### Reddit (3 posts, 47 comments analyzed)
| Title | Subreddit | Score | Sentiment | Date | URL |
|---|---|---|---|---|---|
| InPost lockers in UK getting better? | r/unitedkingdom | 124 | positive | 2026-04-12 | … |
| Anyone else missing parcels? | r/poland | 38 | negative | 2026-04-09 | … |
### Twitter/X (87 tweets)
| Tweet (truncated) | Author | Likes | Replies | Date | URL |
|---|---|---|---|---|---|
| FedEx-InPost rollout looks promising… | @logistics_eu | 412 | 27 | 2026-04-22 | … |
### Trustpilot (50 reviews — avg 3.2 / 5)
| Rating | Title | Author | Date | URL |
|---|---|---|---|---|
| 5 | Good system, very efficient | Yeison S. | 2026-03-10 | … |
| 1 | Parcel never delivered | Anna K. | 2026-04-18 | … |
data/osint-targets.json maps each portfolio company → which OSINT actors to run, with pre-built input templates derived from data/companies.json. Coverage as of v1.0: 31 entries (30 portfolio + group), Reddit 27, Twitter 31, Trustpilot 5, all-three 5, Twitter-only 4. Empty-actor entries reflect verified absence (e.g., MONETA / CETIN / SOTIO have no Trustpilot page).
maxPosts default is 50000 — ALWAYS set explicitly (typical: 30–50 for scans).maxComments default is 50000 — set low whenever scrapeComments: true.subredditKeywords is an array, not a string.sort enum has no "rising" / "best" — only relevance, hot, top, new, comments.includeNsfw — lowercase "sfw" (not includeNSFW).maxItems is REQUIRED — actor fails without it. Pay-per-result.since / until format is YYYY-MM-DD_HH:MM:SS_UTC (NOT ISO 8601).tweetIDs is plural array — not tweetId.lang default is "en" — set explicitly or omit for all languages.startUrls must be array of objects with url key — NOT plain strings.startUrls, limit) — older docs reference maxItems, includeStatistics, includeCompanyDetails, onlyNewerThan that DO NOT EXIST.ratingValue is a STRING ("1"–"5"), not integer — parse before aggregating.limit; post-filter by date field.reference/osint-actor-schemas.md Section 3. Some companies have NO Trustpilot page (B2B holdings, biotech) — running the actor returns 0 reviews.Full per-actor schemas + 30+ example invocations: reference/osint-actor-schemas.md.
${CLAUDE_PLUGIN_ROOT}/data/companies.jsondevelopment
Builds a list of verified business emails from Google Maps, Google SERPs, or a user-supplied URL list. Verification happens inside the same Apify run — no third-party verifier needed. Use when user asks to find verified emails, build a leads list, scrape emails from Maps or SERP, verify emails for a URL list, or find an Apollo / Hunter alternative.
development
Find sites ranking for target keywords, score every prospect with Ahrefs domain authority and page-level traffic, identify the strongest pitch angle per row ("links to competitor", "mentions brand without linking", "top-3 SERP", "resource page", "outdated content"), generate brand-voice-matched outreach emails using an outreach-type-aware template (unlinked-mention claim, competitor-link replacement, resource-page inclusion, outdated-content replacement, topical niche-edit), and propose a concrete in-article link placement as three artifacts — the verbatim source sentence, the same sentence rewritten with the link spliced in, or a fully-drafted new insertion if no natural fit exists. Use when user asks to find link building opportunities, prospect link partners, recover unlinked brand mentions, replace competitor links, build a tiered outreach list, or run cold email outreach for SEO link building.
tools
Discover Instagram brand–creator partnerships by chaining Apify Actors. Use when the user asks who collabs with a brand, which brands a creator has done paid posts for, wants to audit an influencer's branded-content history, or wants to scope a brand's sponsorship roster. **Triggers:** - "who collabs with [brand] on Instagram?" - "what brands has [creator] done sponsored posts for?" - "find paid partnerships / branded content for [handle]" - "audit [influencer]'s brand deals" - "show me [brand]'s influencer roster" Works in either direction — brand → creators or creator → brands — and detects direction from the data, so don't ask the user to declare it. Requires Apify MCP tools.
development
Look up official company data from European public registries across 11 countries/regions (CZ, SK, PL, DE, UK, NL, RO, HR, SE + EU-level + ESG). Covers company registration, ownership, financial filings, VAT status, ESG data. Use when the user asks to "look up a company", "check registry", "find company info", "look up IČO/KRS/LEI/CRN", "company due diligence", "check VAT status", "find ownership structure", or needs official data from European registries. Reads tracked companies from data/companies.json. Some lookups use Python scripts (stdlib), some fall back to Apify actors for scraping-based registries.