Agent skill
transcribe
Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.
Stars
163
Forks
31
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/development/transcribe
SKILL.md
Transcribe
Speech-to-text using Groq Whisper API.
Setup
The script needs GROQ_API_KEY environment variable. Check if already set:
bash
echo $GROQ_API_KEY
If not set, guide the user through setup:
- Ask if they have a Groq API key
- If not, have them sign up at https://console.groq.com/ and create an API key
- Have them add to their shell profile (~/.zshrc or ~/.bashrc):
bash
export GROQ_API_KEY="<their-api-key>" - Then run
source ~/.zshrc(or restart terminal)
Usage
bash
{baseDir}/transcribe.sh <audio-file>
Supported Formats
- m4a, mp3, wav, ogg, flac, webm
- Max file size: 25MB
Output
Returns plain text transcription with punctuation and proper capitalization to stdout.
Didn't find tool you were looking for?