funasr

Here are 64 public repositories matching this topic...

modelscope / FunASR

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Updated Jun 21, 2026
Python

FunAudioLLM / SenseVoice

Star

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

multilingual python pytorch audio-analysis speech-recognition speech-to-text transcription asr emotion-detection cross-lingual language-identification speech-emotion-recognition voice-ai llm funasr sensevoice audio-event-detection whisper-alternative

Updated Jun 20, 2026
C

modelscope / FunClip

Star

Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.

Updated Jun 21, 2026
Python

yan5xu / ququ

Star

开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流

open-source electron-app speech-to-text chinese-speech-recognition privacy-first voice-dictation local-processing funasr wispr-flow-alternative ai-text-processing

Updated Oct 8, 2025
JavaScript

harry0703 / AudioNotes

Star

快速提取音视频内容，整理成一份结构化的markdown笔记

python ai whisper asr ollama qwen2 funasr

Updated Jul 26, 2024
Python

wwbin2017 / bailing

Star

百聆是一个类似GPT-4o的语音对话机器人，通过ASR+LLM+TTS实现，集成DeepSeek R1等优秀大模型，接入openClaw，真正的个人语音助手，时延低至800ms，Mac等低配置也可运行，支持打断

ai tts openai asr voice-assistant llm chatgpt deepseek funasr gpt-4o chattts openclaw

Updated Apr 6, 2026
Python

FunAudioLLM / Fun-ASR

Star

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

audio pytorch speech-recognition speech-to-text transcription asr speaker-diarization chinese-dialects real-time-asr gguf funasr audio-language-model multilingual-asr fun-asr whisper-alternative 31-languages llm-asr

Updated Jun 20, 2026
C

HG-ha / MTools

Star

MTools 是一个功能强大的多功能桌面应用程序，集成了音视频处理、图片编辑、文本操作和编码工具，内置AI增强功能。旨在简化您的工作流程，提升生产效率

tools ai voice speech whisper ppocr funasr

Updated Jun 9, 2026
Python

233stone / vocotype-cli

Star

VocoType 是一款运行在本地端侧的隐私安全语音输入工具，通过快捷键即可将语音实时转换为文字并自动输入到当前应用。支持语音转文字MCP、AI 优化文本、自定义替换词典、录音视频转文字等功能，让语音输入更高效、更安全。

asr funasr

Updated May 16, 2026
Python

TheDeathDragon / LiveTranslate

Star

Real-time audio translation, captures system audio + mic, runs ASR (Whisper/SenseVoice), translates via LLM API with streaming display. Perfect for VTubers, livestreamers, and watching foreign content. Windows 实时音频翻译，ASR 语音识别后 LLM 流式翻译显示，适合 VTuber、主播和外语视频观看。

voice-recognition speech-recognition asr live-translation vtuber real-time-translation vtuber-software funasr subtitle-overlay vuter-subtitle