atris/skills/research-search/SKILL.md
Fast research sweep — arxiv, semantic scholar, github, web. Finds papers, scores relevance, extracts actionable insights, stores to wiki. Triggers on: research search, find papers, latest research, arxiv, what's new in, sweep papers, research sweep.
npx skillsauth add atrislabs/atris research-searchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Find the latest research on a topic, score it for relevance, extract what you can BUILD with it, store the best finds.
/research <topic> # Sweep a topic, show top results
/research <topic> --ingest # Sweep + store best finds to wiki
/research <topic> --deep <arxiv-url> # Deep-read a specific paper
/research --sweep # Run all topics from program.md
/research --trending # What's hot this week in your areas
Read atris/skills/research/program.md for:
atris/skills/research/results.tsvFor the given topic, search ALL of these sources in parallel (use Agent tool for parallelism):
Source A — arxiv API Run via Bash:
python3 atris/skills/research/arxiv_search.py "<topic>" --after 2025-10-01 --limit 20
Returns JSON array of papers with title, authors, abstract, date, url, categories.
Source B — Semantic Scholar API Run via Bash:
python3 atris/skills/research/scholar_search.py "<topic>" --after 2025-10-01 --limit 20
Returns JSON array with title, authors, abstract, date, url, citation count, venue.
Source C — Web search
Use WebSearch tool: "<topic>" site:arxiv.org OR site:github.com 2025..2026
Source D — GitHub
Use WebSearch tool: "<topic>" site:github.com stars:>100 pushed:>2025-10-01
Merge results from all sources. Deduplicate by title similarity.
For each paper, score 1-10 on:
Compute total = (relevance * 3 + actionability * 3 + recency * 2 + novelty * 2) / 10
Show a ranked table:
# Research Sweep: <topic>
## Date: YYYY-MM-DD | Sources: arxiv, scholar, web, github | Papers found: N
| # | Score | Title | Date | Key Insight | Source |
|---|-------|-------|------|-------------|--------|
| 1 | 9.2 | ... | ... | ... | arxiv |
| 2 | 8.5 | ... | ... | ... | scholar|
For the top 5, show:
For papers the user selects (or top 3 if --ingest):
Write each top paper to atris/wiki/research/<slug>.md:
---
title: <paper title>
source: <arxiv/scholar/github url>
date: <publication date>
relevance_score: <1-10>
last_compiled: <today>
tags: [<topic tags>]
---
# <Paper Title>
**Authors:** ...
**Published:** ...
**URL:** ...
## Core Technique
<one paragraph>
## Key Results
<bullet points with numbers>
## How to Use (Inference-Time)
<practical implementation notes>
## Applies To
<which of our projects benefit>
## Limitations
<what the authors say doesn't work>
Update atris/wiki/index.md with the new pages.
Append to atris/skills/research/results.tsv:
timestamp topic papers_found top_score top_paper source_breakdown
Over time, this log shows which topics are producing the best finds and which sources are most useful.
The research program evolves:
development
Wake a team member by name — 'gm <member>' or 'wake up <member>' — and run ONE closed-loop tick: boot, inbox, claim, one bounded slice, verify, commit+push, proof, receipt. Optionally dispatch the build to an engine (codex/cursor/devin). Triggers on: gm, good morning, wake up <member>, wake the team, run a tick as <member>.
development
Dispatch coding work to an installed terminal agent — Codex, Cursor, or Devin — as an interchangeable worker engine. Claude orchestrates: writes the bounded prompt, the engine builds, Claude verifies and lands. Triggers on: use codex, use cursor, use devin, engine, dispatch to, worker agent, second opinion build.
development
Live RL tuner for skills. Watches skill invocations, reads user reaction, proposes targeted SKILL.md overlay edits, requires explicit approval, writes scorecards. The in-session half of the skill-RL loop (Path B). Triggers on: tune, sharpen, skill feedback, that was shit, that was great, make X better.
development
Run one RL improvement tick on the workspace via POST /api/improve. Ships one verifiable change, scores it, writes the scorecard. The thing you pay for. Triggers on: improve, make this better, ship one thing, run a tick, get smarter.