speech-to-text

Star

Here are 4,162 public repositories matching this topic...

ggml-org / whisper.cpp

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Dec 6, 2025
C++

mozilla / DeepSpeech

Star

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

Updated Jun 19, 2025
C++

SYSTRAN / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Nov 19, 2025
Python

m-bain / whisperX

Sponsor

Star

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Oct 21, 2025
Python

leon-ai / leon

Star

🧠 Leon is your open-source personal assistant.

Updated Nov 6, 2025
TypeScript

jianchang512 / pyvideotrans

Star

Translate the video from one language to another and embed dubbing & subtitles.

text-to-speech speech-to-text video-transition

Updated Nov 24, 2025
Python

kaldi-asr / kaldi

Star

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

Updated Sep 22, 2025
Shell

alphacep / vosk-api

Star

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated Oct 24, 2025
Jupyter Notebook

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Dec 3, 2025
Python

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

Updated Dec 5, 2025
C++

KoljaB / RealtimeSTT

Star

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

python realtime speech-to-text

Updated Jul 11, 2025
Python

Uberi / speech_recognition

Sponsor

Star

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

Updated Nov 19, 2025
Python

Zackriya-Solutions / meeting-minutes

Sponsor

Star

Open-source Rust based AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted Ai meeting note taker for macOS & Windows.

windows rust mac ai offline-first self-hosted speech-to-text transcription whisper meeting-minutes meeting-notes privacy-tools parakeet privacy-focused llm whisper-cpp local-ai ollama ai-meeting-assistant

Updated Dec 6, 2025
Rust

nl8590687 / ASRT_SpeechRecognition

Star

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn python3 speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Updated Sep 6, 2025
Python

cjpais / Handy

Sponsor

Star

A free, open source, and extensible speech-to-text application that works completely offline.

cross-platform accessibility speech-to-text tauri-v2

Updated Dec 5, 2025
TypeScript

FunAudioLLM / SenseVoice

Star

Multilingual Voice Understanding Model

multilingual python ai pytorch speech-recognition speech-to-text asr cross-lingual speech-emotion-recognition audio-event-classification aigc llm gpt-4o

Updated Aug 15, 2025
Python

TalAter / annyang

Star

💬 Speech recognition for your site

voice speech speech-recognition speech-to-text

Updated Aug 7, 2024
JavaScript

snakers4 / silero-models

Star

Silero Models: pre-trained text-to-speech models made embarrassingly simple

Updated Dec 5, 2025
Jupyter Notebook

MahmoudAshraf97 / whisper-diarization

Star

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated Nov 26, 2025
Jupyter Notebook

abus-aikorea / voice-pro

Star

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

text-to-speech translator audiobook podcasts tts speech-synthesis subtitles speech-recognition webui speech-to-text karaoke transcription gradio whisper voice-conversion voice-cloning yt-dlp faster-whisper whisperx

Updated Dec 5, 2025
Python

Improve this page

Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-to-text

Here are 4,162 public repositories matching this topic...

ggml-org / whisper.cpp

mozilla / DeepSpeech

SYSTRAN / faster-whisper

m-bain / whisperX

leon-ai / leon

jianchang512 / pyvideotrans

kaldi-asr / kaldi

alphacep / vosk-api

speechbrain / speechbrain

k2-fsa / sherpa-onnx

KoljaB / RealtimeSTT

Uberi / speech_recognition

Zackriya-Solutions / meeting-minutes

nl8590687 / ASRT_SpeechRecognition

cjpais / Handy

FunAudioLLM / SenseVoice

TalAter / annyang

snakers4 / silero-models

MahmoudAshraf97 / whisper-diarization

abus-aikorea / voice-pro

Improve this page

Add this topic to your repo