clanker.net
COMMUNICATION inf-sh/skills

speech-to-text

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation, multi-language, timestamps. Use for: meeting transcription, subtitles, podcast transcripts, voice notes. Triggers: speech to text, transcription, whisper, audio to text, transcribe audio, voice to text, stt, automatic transcription, subtitles generation, transcribe meeting, audio transcription, whisper ai

COMMUNICATION
USE THIS SKILL

DOWNLOAD THE APP TO INSTALL AND USE /speech-to-text ON YOUR DEVICE

Scan to open on your device
QR code for speech-to-text Opens skill content in Expo Go
COMMAND
/speech-to-text
CATEGORY
Communication
REPOSITORY
inf-sh/skills
COMMIT

SKILL PROMPT

--- name: speech-to-text description: "Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation, multi-language, timestamps. Use for: meeting transcription, subtitles, podcast transcripts, voice notes. Triggers: speech to text, transcription, whisper, audio to text, transcribe audio, voice to text, stt, automatic transcription, subtitles generation, transcribe meeting, audio transcription, whisper ai" allowed-tools: Bash(infsh *) --- # Speech-to-Text Transcribe audio to text via [inference.sh](https://inference.sh) CLI. ![Speech-to-Text](https://cloud.inference.sh/u/4mg21r6ta37mpaz6ktzwtt8krr/01jz025e88nkvw55at1rqtj5t8.png) ## Quick Start > Requires inference.sh CLI (`infsh`). Get installation instructions: `npx skills add inference-sh/skills@agent-tools` ```bash infsh login infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://audio.mp3"}' ``` ## Available Models | Model | App ID | Best For | |-------|--------|----------| | Fast Whisper V3 | `infsh/fast-whisper-large-v3` | Fast transcription | | Whisper V3 Large | `infsh/whisper-v3-large` | Highest accuracy | ## Examples ### Basic Transcription ```bash infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "https://meeting.mp3"}' ``` ### With Timestamps ```bash infsh app sample infsh/fast-whisper-large-v3 --save input.json # { # "audio_url": "https://podcast.mp3", # "timestamps": true # } infsh app run infsh/fast-whisper-large-v3 --input input.json ``` ### Translation (to English) ```bash infsh app run infsh/whisper-v3-large --input '{ "audio_url": "https://french-audio.mp3", "task": "translate" }' ``` ### From Video ```bash # Extract audio from video first infsh app run infsh/video-audio-extractor --input '{"video_url": "https://video.mp4"}' > audio.json # Transcribe the extracted audio infsh app run infsh/fast-whisper-large-v3 --input '{"audio_url": "<audi [... prompt truncated for preview ...]