skills/seeing-as-agent/SKILL.md
Debugging methodology for LLM tool calls — trace from the model's side first, use runtime evidence over code inference, and follow live request chains for reasoning/thinking bugs.
npx skillsauth add duruii/scientific-skills seeing-as-agentInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Every tool has two users: the human who triggers the agent, and the model that decides how to call it. Design for both.
The model has no memory across turns, no outside world except through tools, non-deterministic output, and a training cutoff. Designing from that existence means asking:
When a tool call goes wrong, resist the urge to fix the code immediately. Instead:
llm_request — name, description, parameters. This is the model's entire action space.tool_use block — the exact arguments it passed. Not what you expected, what it actually sent.For LLM request-shape bugs, tool-call regressions, reasoning_content / thinking mismatches, or any issue involving "was field X really sent?", follow this order:
messages, turn_metrics.llm_request, turn_metrics.llm_response, and the structured trace/log buffer before reading implementation code.For any reasoning_content / thinking-model regression:
POST /api/v1/conversations/:id/messages/stream.StreamMessage -> buildSmartContext -> provider request -> stream parser -> persistence/backfill.testing
Research-grade single-paper analysis with evidence-grounded structured extraction and internal self-evaluation. Use when users ask to summarize or screen one academic paper from an arXiv link/ID or local PDF and need verifiable claims with citations, especially for Chinese-language output to students.
tools
Use browser MCP to access IEEE Xplore through university library proxy, preserve institutional session, run keyword/advanced/journal search, and optionally post-filter by CCF rank (for example CCF-A) with structured output.
testing
Fetch and organize course transcripts from DeepLearning.AI. Use this skill whenever the user mentions DeepLearning.AI courses, wants to download course transcripts, subtitles, or VTT files from a course, or asks to organize lesson transcripts from learn.deeplearning.ai. It does NOT trigger for general video subtitle downloading — only for DeepLearning.AI courses specifically.
development
Query CCF (China Computer Federation) venue rankings for conferences and journals using year-partitioned reference data. Use when users ask for CCF level (A/B/C), category/domain, or rank verification for a venue abbreviation/full name (for example ICML, CVPR, TOCS), or request batch lookup/comparison across venues or years.