archived/skills/extractor/SKILL.md
Archive information extraction - assess archival documents and identify information worth preserving in the knowledge base.
npx skillsauth add nicsuzor/academicops extractorInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Assess archival documents (emails, correspondence, receipts) to identify information worth preserving in the knowledge base. This skill provides judgment criteria only - actual storage is handled by Skill(skill="remember").
Filter noise from archival documents and identify genuinely significant information that should be preserved. Most archival documents have NO long-term value - be highly selective.
For each document, use LLM judgment to answer:
Trust your judgment - you understand context better than regex patterns.
Think: Is this a concrete action or outcome? Or just noise?
Think: Did Nic DO something significant? Or just receive an invitation?
Think: Is this a concrete milestone or application? Or just planning?
Think: Is this the start/milestone of an important relationship? Or routine correspondence?
Think: Will we need this for financial reporting?
Most documents are noise. Skip:
Test: Would I forget this within a week with no consequence?
When uncertain:
#review-classification for manual reviewEXTRACT: "Your paper 'Platform Governance' has been accepted by Nature" → Clear publication outcome
SKIP: "CFP: Submit to Journal of Platform Studies by Dec 1" → Generic invitation, no submission
EXTRACT: "Congratulations, your FT210100263 grant has been awarded $500K" → Grant outcome with specifics
SKIP: "Reminder: Your FT210100263 annual report is due next month" → Administrative reminder
EXTRACT: "Following our chat at IGF, I'd love to collaborate on disinformation research..." → New substantive collaboration starting
SKIP: "You're invited to join our webinar on content moderation" → Mass invitation, not personal
EXTRACT: Email from examiner with detailed feedback on PhD thesis → Supervision milestone
SKIP: "HDR student seminars happening this week" → Generic announcement
EXTRACT: "OSB Case 2025R final decision: Upheld with modifications..." → Actual OSB work product
SKIP: "OSB weekly meeting reminder" → Routine scheduling
When you decide to extract, identify these elements:
Always extract canonical identifiers to allow future reference:
Store identifier inline with extracted information.
Use Skill(skill="remember") to store extracted information in the knowledge base. The remember skill handles:
Store information using the memory server with properly formatted markdown. Tag extracted information appropriately for searchability and future retrieval.
Common approaches for email extraction:
#email, #correspondence#project#timeline, #deadline#statusStorage principles:
#review-classificationtools
Streamlit implementation of the analyst presentation layer. Use when building or updating a Streamlit dashboard that displays pre-computed research data. This is the Streamlit-specific HOW for the tech-agnostic principles in the aops-tools analyst skill — display only, never transform.
tools
Python plotting and statistical-modelling libraries (matplotlib, seaborn, statsmodels) for the analyst presentation and statistical-methodology layers. Use when producing publication-quality figures or fitting statistical models in Python. Library-specific HOW for the tech-agnostic principles in the aops-tools analyst skill.
tools
dbt (data build tool) implementation of the analyst transformation layer. Use when a project has a dbt/ directory or you need to build, test, or document SQL transformations as version-controlled, reproducible dbt models. This is the dbt-specific HOW for the tech-agnostic principles in the aops-tools analyst skill.
development
Core academicOps skill — institutional memory, strategic coordination, workflow routing, and framework governance. Merges butler (chief-of-staff) with framework development conventions.