skills/aminer-deep-search/SKILL.md
Activate this skill when the user wants deep, multi-round academic paper collection for a survey or literature review using AMiner data and a ReAct-style LLM controller. Use this skill for broad topic exploration, survey bibliography construction, automatic keyword search plus backward-reference snowballing, and collecting hundreds of candidate papers with AMiner IDs and titles. This skill calls an OpenAI-compatible chat model to decide tool calls, and uses AMiner keyword search plus paper reference APIs as tools. It is not intended for simple single-paper lookup or lightweight recommendations; use aminer-free-academic or aminer-daily-paper for those simpler tasks.
npx skillsauth add canxiangcc/aminer-open-skill aminer-deep-searchInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
ReAct-style survey paper collection using OpenAI-compatible model calls and AMiner search/reference APIs.
Use this skill when the user asks to collect papers for a research topic, build a large literature list, run citation snowballing, or prepare survey references.
The framework runs an LLM-controlled loop with these tools:
search: AMiner keyword search, returning up to 20 papers per query.get_reference: AMiner backward-reference expansion for selected seed papers.add_to_paper_set: deduplicated paper collection by AMiner paper ID.END: terminate and output [{"id": "...", "title": "..."}, ...].The controller prompt asks the model to expand queries, prioritize high-quality seed papers, use reference snowballing, and terminate within 50 rounds. The target collection size is 400+ papers when AMiner results support it; it must not fabricate papers.
Check the AMiner key before running:
[ -z "${AMINER_API_KEY:-}" ] && echo "AMINER_API_KEY missing" || echo "AMINER_API_KEY exists"
If AMINER_API_KEY is missing, stop and ask the user to provide or set it. Never print the key. The code does not contain a built-in AMiner token.
The LLM can use OpenClaw-provided settings or a user-provided OpenAI-compatible endpoint:
llm.api_key: LLM API key. Check at runtime and prompt if neither OpenClaw nor the user supplies a key.llm.base_url: LLM base URL. Optional when OpenClaw provides a default; otherwise pass --base-url.llm.model: LLM model name. Required unless --models is passed.Before running, check whether an LLM key is available:
if [ -z "$(printenv 'llm.api_key')" ]; then
echo "LLM API key missing"
else
echo "LLM API key exists"
fi
If no LLM key is available, stop and ask the user to configure OpenClaw llm.api_key or pass --api-key. Never print the key. Do not hard-code provider-specific tokens or base URLs in this skill.
Check whether an LLM model is available:
[ -z "$(printenv 'llm.model')" ] && echo "LLM model missing" || echo "LLM model exists"
If no LLM model is available, ask the user to configure OpenClaw llm.model or pass --models. There is no provider-specific default model list.
From this skill directory, install dependencies into the Python environment used by python3:
python3 -m pip install -r requirements.txt
If you prefer an isolated conda environment, create and activate one first, then install the dependencies:
CONDA_PKGS_DIRS="$(pwd)/.conda_pkgs" conda create -p "$(pwd)/.conda" python=3.11 pip -y
conda activate "$(pwd)/.conda"
PIP_CACHE_DIR="$(pwd)/.pip_cache" python3 -m pip install -r requirements.txt
Any compatible Python 3 environment may run the script as long as it has openai and requests.
Run the main collector from this skill directory:
python3 react_agent.py \
--topic "<research topic>" \
--timeout 300 \
--max-tool-calls 20 \
--max-rounds 50
Useful options:
--api-key: LLM API key. Defaults to llm.api_key.--base-url: LLM base URL. Defaults to llm.base_url.--models: model fallback list. Required unless llm.model is configured.--timeout: per-model-call timeout in seconds. Default is 300.--target-size: desired final paper count. Default is 400.--include-abstracts: include abstracts in the final saved JSON when available.The script prints the final JSON list and saves a copy under outputs/.
llm.api_key or AMINER_API_KEY.--max-tool-calls 20 and --max-rounds 50.react_agent.py: ReAct loop and CLI.api_client.py: OpenAI-compatible client with model fallback.prompt.py: paper-collection system prompt.search.py: AMiner keyword search and paper detail normalization.citation.py: AMiner reference expansion.paper_set.py: deduplicated collection and final JSON output.development
[Activation] Use this skill when the user provides a paper PDF (file path or upload) and asks to verify, audit, or fact-check its references / citations / bibliography — e.g. "check whether the references in this PDF are hallucinated", "find fake citations", "verify the bibliography". [Capability] Uploads the PDF to the AMiner pdf-citation-verifier service, polls the asynchronous job, and returns a per-reference classification (REAL / LIKELY_REAL / NEEDS_REVIEW / LIKELY_FAKE / FAKE) plus an overall hallucination summary. [Routing] Do NOT use for general paper search, scholar lookup, citation-intent analysis, or building a citation graph — use aminer-academic-search, aminer-free-academic, or paper-source-trace instead. This skill only verifies whether references actually exist.
development
Use when a user provides an academic paper PDF, extracted paper text, citation contexts, or references and asks for paper source tracing, claim-centered source tracing, citation intent extraction, entity/relation extraction, contribution mapping, citation graph JSON, SVG/HTML citation maps, or optional AMiner metadata/citation enrichment. Do NOT use this skill for simple paper lookup by title, scholar search, or lightweight academic queries — use aminer-free-academic or aminer-academic-search for those instead.
development
Personalized academic paper recommendation via AMiner rec5 API. Activate this skill whenever the user asks for paper recommendations, whether triggered by /aminer-dp, /skill aminer-dp, or any natural language request such as 'recommend me papers on multimodal agents'. When invoked: extract topics/scholar signals from the input yourself, call handle_trigger.py with structured fields, then present the Markdown in `reply_text` to the user.
development
ACADEMIC PRIORITY: Activate this skill whenever the user's query involves academic, scholarly, or research-related topics — including but not limited to: papers, publications, citations, scholars, researchers, professors, institutions, universities, labs, journals, conferences, venues, patents, research fields, h-index, impact factor, co-authorship, dissertations, theses, peer review, grant projects, research trends, or any question about "who published what / where / when". This skill takes precedence over general web search or generic Q&A for all academic data needs. Full-featured AMiner skill with 27 APIs and 5 workflows. Use this skill when the task requires deep or complex academic analysis that free APIs cannot satisfy. Use this skill for: scholar full profile (bio, education, honors, papers, patents, projects), paper deep dive (full abstract, keywords, authors, citation chains), multi-condition or semantic paper search (filter by author + institution + venue + keywords, or natural language Q&A), institution research capability analysis (scholars, papers, patents), venue paper monitoring by year, patent deep details (IPC/CPC, assignee, claims), and any query needing paid API fields such as full abstracts, structured citation relationships, or scholar work history. Do NOT use this skill for simple lookups that free APIs can answer — such as checking a paper title, identifying a scholar by name, normalizing an institution or venue name, or scanning patent trends by keyword. For those, use aminer-free-academic instead. Routing rule: if the user's question can be fully answered by paper_search, paper_info, person_search, organization_search, venue_search, patent_search, or patent_info alone, route to aminer-free-academic. Otherwise use this skill.