SenseVoice
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
Category
AI Video
Quality
84/100
Primary source
GitHub
What is SenseVoice?
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
Key features
Best fit
Why consider it
- SenseVoice is categorized for ai video workflows and tagged with Video editing, Social media, Workflows.
- The public repository has 8,701 stars, which gives buyers and builders an extra adoption signal.
- License metadata is available: NOASSERTION.
Source & verification
- Verified on Jun 27, 2026 from public source metadata.
- Primary reference: github.com.
- Repository freshness signal: last commit Jun 21, 2026.
Alternative tools
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
autoclip
AI Video
AutoClip : AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具
FunClip
AI Video
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.
Related tools
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
autoclip
AI Video
AutoClip : AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具
FunClip
AI Video
Open-source, accurate and easy-to-use video speech recognition & clipping tool. LLM-based AI clipping integrated.