.github/skills/tsh-transcript-processing/SKILL.md
Clean raw workshop or meeting transcripts from small talk, filler words, and off-topic tangents. Extract and structure business-relevant content into a standardized format with discussion topics, key decisions, action items, and open questions.
npx skillsauth add thesoftwarehouse/copilot-collections tsh-transcript-processingInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
This skill helps you clean raw workshop or meeting transcripts and produce a structured, business-relevant document. It removes noise (small talk, greetings, filler words, off-topic tangents) while preserving all actionable and business-critical discussion points.
Use the checklist below and track your progress:
Processing progress:
- [ ] Step 1: Identify transcript format and meeting metadata
- [ ] Step 2: Identify and tag participants
- [ ] Step 3: Remove non-business content
- [ ] Step 4: Group remaining content by discussion topics
- [ ] Step 5: Extract key decisions
- [ ] Step 6: Extract action items and open questions
- [ ] Step 7: Preserve critical raw context
- [ ] Step 8: Save the cleaned transcript
Step 1: Identify transcript format and meeting metadata
Determine the format of the raw transcript:
[Speaker Name]: text)[00:12:34] text)pdf-reader tool to extract text content first)Extract meeting metadata where available:
If metadata is not explicitly stated in the transcript, ask the user to provide it.
Step 2: Identify and tag participants
Scan the transcript for participant names or speaker labels. For each participant:
Step 3: Remove non-business content
Systematically identify and remove:
Important: When in doubt about whether content is business-relevant, keep it. It is better to preserve potentially useful context than to accidentally remove important information.
Step 4: Group remaining content by discussion topics
Analyze the cleaned content and organize it into logical discussion topics:
Step 5: Extract key decisions
Review the structured content and identify explicit and implicit decisions:
Step 6: Extract action items and open questions
Scan for action items:
Scan for open questions:
Step 7: Preserve critical raw context
Identify and preserve exact quotes or passages where the original wording is important:
Place these in a "Preserved Context" section with attribution to the speaker.
Step 8: Save the cleaned transcript
Generate the final output following the ./cleaned-transcript.example.md template.
Save the file to specifications/<workshop-name>/cleaned-transcript.md.
Review the output to ensure:
tsh-task-extracting - uses the cleaned transcript as a primary input for identifying epics and storiesdevelopment
Custom hook and composable patterns — naming, composition, stable return shapes, lifecycle cleanup, and testing strategies. Use when writing reusable logic units (React hooks, Vue composables), refactoring logic into hooks, debugging hook behavior, or reviewing hook implementations.
testing
UI verification criteria, structure checklists, severity definitions, and tolerance rules for comparing implementations against Figma designs. Use for verifying UI matches design, understanding what to check, and determining acceptable differences.
development
Discover and establish technical context before implementing any feature. Prioritize project instructions, existing codebase patterns, and external documentation in that order. Use for any task requiring understanding of project conventions, coding standards, architecture patterns, and established practices before writing code.
development
Analyze extracted epics and user stories for quality gaps, missing edge cases, and improvement opportunities. Runs domain-agnostic analysis passes, optionally enriches findings with Jira board context, and produces accept/reject suggestions that refine the task list before Jira formatting.