skills/domains/cs/ai-security-papers-guide/SKILL.md
AI security papers from top-4 security conferences
npx skillsauth add wentorai/research-plugins ai-security-papers-guideInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
A curated collection of AI security papers from the top-4 security conferences: IEEE S&P, ACM CCS, USENIX Security, and NDSS. Covers adversarial attacks, model stealing, data poisoning, privacy attacks, deepfake detection, and LLM security. Organized by year and venue, focusing exclusively on peer-reviewed work from these prestigious venues.
| Venue | Full Name | Focus | |-------|-----------|-------| | S&P | IEEE Symposium on Security and Privacy | Broad security + privacy | | CCS | ACM Conference on Computer and Communications Security | Systems security | | USENIX | USENIX Security Symposium | Systems + network security | | NDSS | Network and Distributed System Security | Network security |
AI Security (BIG4)
├── Adversarial ML
│ ├── Evasion attacks (adversarial examples)
│ ├── Poisoning attacks (backdoors, trojans)
│ ├── Model stealing (extraction, distillation)
│ └── Defenses (certified robustness, detection)
├── Privacy Attacks
│ ├── Membership inference
│ ├── Model inversion
│ ├── Attribute inference
│ └── Training data extraction
├── LLM Security
│ ├── Prompt injection
│ ├── Jailbreaking
│ ├── Data leakage
│ └── Alignment attacks
├── Deepfakes
│ ├── Generation methods
│ ├── Detection techniques
│ └── Watermarking
└── Federated Learning Security
├── Byzantine attacks
├── Gradient leakage
└── Secure aggregation
# Recent highlights
papers_2024_2025 = [
{"title": "Not What You've Signed Up For: "
"Compromising Real-World LLM-Integrated Applications",
"venue": "S&P 2024", "topic": "LLM security"},
{"title": "Prompt Stealing Attacks Against "
"Text-to-Image Generation Models",
"venue": "S&P 2024", "topic": "Prompt extraction"},
{"title": "Backdoor Attacks on Language Models",
"venue": "CCS 2024", "topic": "NLP backdoors"},
{"title": "Membership Inference in LLMs",
"venue": "USENIX 2024", "topic": "Privacy"},
]
for p in papers_2024_2025:
print(f"[{p['venue']}] {p['title']}")
print(f" Topic: {p['topic']}")
### Emerging Areas (2024-2025)
1. **LLM security** — Jailbreaking, prompt injection, agent attacks
2. **Supply chain attacks** — Poisoned models, malicious packages
3. **Multi-modal attacks** — Cross-modal adversarial examples
4. **Agent security** — Attacks on LLM-based autonomous systems
5. **Watermarking** — LLM output detection, IP protection
6. **Unlearning** — Machine unlearning verification and attacks
tools
10 document processing skills. Trigger: extracting text from PDFs, parsing references, document Q&A. Design: parsing pipelines (GROBID, marker) and structured extraction tools.
documentation
Guide to tldraw for infinite canvas whiteboarding and diagram creation
testing
Create graphical abstracts, schematic diagrams, and scientific illustrations
documentation
Create UML diagrams and architecture visualizations with PlantUML