Releases: k2-fsa/sherpa-onnx
Releases · k2-fsa/sherpa-onnx
v1.12.6
What's Changed
- Support silero-vad v4 exported by k2-fsa by @csukuangfj in #2372
- Support ten vad by @csukuangfj in #2377
- Fix compile errors for Linux by @csukuangfj in #2378
- Add C API for ten-vad by @csukuangfj in #2379
- Add CXX API examples for ten-vad. by @csukuangfj in #2380
- Add JavaScript (WebAssembly) API for ten-vad by @csukuangfj in #2382
- Add JavaScript (node-addon) API for ten-vad by @csukuangfj in #2383
- Add Go API for ten-vad by @csukuangfj in #2384
- Add C# API for ten-vad by @csukuangfj in #2385
- Add Dart API for ten-vad by @csukuangfj in #2386
- Add Swift API for ten-vad by @csukuangfj in #2387
- Add Pascal API for ten-vad by @csukuangfj in #2388
- Add Java/Kotlin API and Android support for ten-vad by @csukuangfj in #2389
- Release v1.12.6 by @csukuangfj in #2390
Full Changelog: v1.12.5...v1.12.6
v1.12.5
What's Changed
- Fix typo CMAKE_EXECUTBLE_LINKER_FLAGS -> CMAKE_EXECUTABLE_LINKER_FLAGS by @linsui in #2344
- Fix testing dart packages by @csukuangfj in #2345
- fix(canary): use dynamo export, single input_ids and avoid 0/1 specialization by @lucaelin in #2348
- Fix TTS for UE by @csukuangfj in #2349
- Update readme to include https://github.com/mawwalker/stt-server by @csukuangfj in #2350
- Add meta data to NeMo canary ONNX models by @csukuangfj in #2351
- Update README to include https://github.com/bbeyondllove/asr_server by @csukuangfj in #2353
- Add C++ runtime and Python API for NeMo Canary models by @csukuangfj in #2352
- Add C/CXX/JavaScript API for NeMo Canary models by @csukuangfj in #2357
- Add Java and Kotlin API for NeMo Canary models by @csukuangfj in #2359
- Upload fp16 onnx model files for FireRedASR by @csukuangfj in #2360
- Fix nemo feature normalization in test code by @csukuangfj in #2361
- Refactor exporting NeMo models by @csukuangfj in #2362
- Add LODR support to online and offline recognizers by @vsd-vector in #2026
- Add CXX examples for NeMo TDT ASR. by @csukuangfj in #2363
- Add Pascal/Go/C#/Dart API for NeMo Canary ASR models by @csukuangfj in #2367
- Release v1.12.5 by @csukuangfj in #2368
New Contributors
Full Changelog: v1.12.4...v1.12.5
v1.12.4
What's Changed
- Refactor release scripts. by @csukuangfj in #2323
- Add TTS engline APKs for more models by @csukuangfj in #2327
- Fix static link without tts by @csukuangfj in #2328
- Fix VAD+ASR C++ example. by @csukuangfj in #2335
- 添加sherpa-onnx-streaming-zipformer-zh-int8-2025-06-30(type=1002) by @wenjietech in #2336
- Support non-streaming zipformer CTC ASR models by @csukuangfj in #2340
- Support linux aarch64 for Dart and Flutter by @csukuangfj in #2342
- Release v1.12.4 by @csukuangfj in #2343
New Contributors
- @wenjietech made their first contribution in #2336
Full Changelog: v1.12.3...v1.12.4
v1.12.3
What's Changed
- Show cmake debug information. by @csukuangfj in #2316
- Remove portaudio-go in Go API examples. by @csukuangfj in #2317
- Support zipformer CTC ASR with whisper features. by @csukuangfj in #2319
- Support Zipformer transducer ASR with whisper features. by @csukuangfj in #2321
- Release v1.12.3 by @csukuangfj in #2322
Full Changelog: v1.12.2...v1.12.3
v1.12.2
What's Changed
- Fix CI for windows by @csukuangfj in #2279
- Add jar for Java 24. by @csukuangfj in #2280
- Add Python API for source separation by @csukuangfj in #2283
- Add link to huggingface space for source separation. by @csukuangfj in #2284
- 解决windows编译isspace函数调用失败问题 by @GlocKieHuan in #2042
- update wasm/vad-asr/assets/README.md for more clear by @guoxiangyang in #2297
- Update TTS Engine APK to support multi-lang by @csukuangfj in #2294
- Add scripts for exporting Piper TTS models to sherpa-onnx by @csukuangfj in #2299
- 修复了C++链接错误,添加-lsherpa-onnx-cxx-api:Update sherpa-onnx-shared.pc.in by @DSOE1024 in #2300
- Fixes #2172 by @csukuangfj in #2301
- Refactor kokoro tts export by @csukuangfj in #2302
- Fix building for Pascal by @csukuangfj in #2305
- Support extra languages in multi-lang kokoro tts by @csukuangfj in #2303
- Update readme to include BreezeApp from MediaTek Research by @csukuangfj in #2313
- Add API to get version information by @csukuangfj in #2309
- Release v1.12.2 by @csukuangfj in #2314
New Contributors
- @GlocKieHuan made their first contribution in #2042
- @guoxiangyang made their first contribution in #2297
- @DSOE1024 made their first contribution in #2300
Full Changelog: v1.12.1...v1.12.2
v1.12.1
What's Changed
- Use jlong explicitly in jni. by @csukuangfj in #2229
- Fix building rknn wheels by @csukuangfj in #2233
- Fix publishing binaries for RKNN by @csukuangfj in #2234
- Export spleeter model to onnx for source separation by @csukuangfj in #2237
- Add C++ runtime for spleeter about source separation by @csukuangfj in #2242
- Add include headers for ANDROID_API,OHOS by @f2janyway in #2251
- JAVA-API: Manual Library Loading Support for Restricted Environments by @Skepller in #2253
- Build APK with replace.fst by @csukuangfj in #2254
- repair rknn wheels by @csukuangfj in #2257
- Update kaldi-native-fbank. by @csukuangfj in #2259
- Fix building sherpa-onnx by @csukuangfj in #2262
- Fix building MFC examples by @csukuangfj in #2263
- Add UVR models for source separation. by @csukuangfj in #2266
- move portaudio common record code to microphone by @mtdxc in #2264
- fixed mfc build error by @mtdxc in #2267
- Add C++ support for UVR models by @csukuangfj in #2269
- Export nvidia/canary-180m-flash to sherpa-onnx by @csukuangfj in #2272
- Update utils.dart by @flying-forever in #2275
- Fix using rknn in multi-threads by @csukuangfj in #2274
- Fix 32-bit arm CI by @csukuangfj in #2276
- Release v1.12.1 by @csukuangfj in #2277
New Contributors
- @f2janyway made their first contribution in #2251
- @Skepller made their first contribution in #2253
- @mtdxc made their first contribution in #2264
- @flying-forever made their first contribution in #2275
Full Changelog: v1.12.0...v1.12.1
source-separation-models
audio_example.wav
is converted from
https://github.com/deezer/spleeter/blob/v1.4.0/audio_example.mp3
with the following command
sox audio_example.mp3 audio_example.wav
UVR models are converted from
https://github.com/TRvlvr/model_repo/releases/tag/all_public_uvr_models
using the code from
https://github.com/k2-fsa/sherpa-onnx/tree/master/scripts/uvr_mdx
v1.12.0
What's Changed
- Fix building wheels for macOS by @csukuangfj in #2192
- Show verbose logs in homophone replacer by @csukuangfj in #2194
- Fix displaying streaming speech recognition results for Python. by @csukuangfj in #2196
- Add real-time speech recognition example for SenseVoice. by @csukuangfj in #2197
- docs: add Open-XiaoAI KWS project by @idootop in #2198
- Add C++ example for streaming ASR with SenseVoice. by @csukuangfj in #2199
- Add C++ example for real-time ASR with nvidia/parakeet-tdt-0.6b-v2. by @csukuangfj in #2201
- Add a link to YouTube video including sherpa-onnx. by @csukuangfj in #2202
- Support sending is_eof for online websocket server. by @csukuangfj in #2204
- Add alsa-based streaming ASR example for sense voice. by @csukuangfj in #2207
- Support homophone replacer in Android asr demo. by @csukuangfj in #2210
- Add a Go implementation of the TTS generation callback by @xiaokuang95 in #2213
- Add Android demo for real-time ASR with non-streaming ASR models. by @csukuangfj in #2214
- Expose dither for JNI by @esavin in #2215
- Add nodejs example for parakeet-tdt-0.6b-v2. by @csukuangfj in #2219
- Add script to build APK for simulated-streaming-asr. by @csukuangfj in #2220
- Release v1.12.0 by @csukuangfj in #2221
New Contributors
Full Changelog: v1.11.5...v1.12.0
v1.11.5
What's Changed
- export parakeet-tdt-0.6b-v2 to sherpa-onnx by @csukuangfj in #2180
- Add C++ runtime for parakeet-tdt-0.6b-v2. by @csukuangfj in #2181
- Avoid NaN in feature normalization. by @csukuangfj in #2186
- Release v1.11.5 by @csukuangfj in #2187
Full Changelog: v1.11.4...v1.11.5
v1.11.4
What's Changed
- Disable strict hotword matching mode for offline transducer by @vsd-vector in #1837
- Comment refinement: Add note about vocoder file for matcha TTS config by @HaoWang0101 in #2106
- Fix a typo in the JNI for Android. by @csukuangfj in #2108
- Generate subtitles with FireRedAsr models by @csukuangfj in #2112
- Use manylinux_2_28_x86_64 to build linux gpu for sherpa-onnx by @csukuangfj in #2123
- Support running sherpa-onnx with RK NPU on Android by @csukuangfj in #2124
- Fix building for HarmonyOS by @csukuangfj in #2125
- cmake build, configurable from env by @KarelVesely84 in #2115
- Expose dither in python API by @nshmyrev in #2127
- Add support for GigaAM-CTC-v2 by @rominf in #2135
- Support Giga AM transducer V2 by @csukuangfj in #2136
- Export kokoro 1.0 int8 models by @csukuangfj in #2137
- Upload more onnx ASR models by @csukuangfj in #2141
- Fix building for open harmonyOS by @csukuangfj in #2142
- online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) by @KarelVesely84 in #2129
- Fix punctuations for kokoro tts 1.1-zh. by @csukuangfj in #2146
- Fix setting OnlineModelConfig in Java API by @csukuangfj in #2147
- Support decoding multiple streams in Java API. by @csukuangfj in #2149
- Support replacing homonphonic phrases by @csukuangfj in #2153
- Add C and CXX API for homophone replacer by @csukuangfj in #2156
- Add JavaScript API (WASM) for homophone replacer by @csukuangfj in #2157
- Add JavaScript API (node-addon) for homophone replacer by @csukuangfj in #2158
- Fix building without TTS by @csukuangfj in #2159
- Add homonphone replacer example for Python API. by @csukuangfj in #2161
- More fix for building without tts by @csukuangfj in #2162
- Add Swift API for homophone replacer. by @csukuangfj in #2164
- Add C# API for homophone replacer by @csukuangfj in #2165
- Add Kotlin and Java API for homophone replacer by @csukuangfj in #2166
- Add Dart API for homophone replacer by @csukuangfj in #2167
- Add Go API for homophone replacer by @csukuangfj in #2168
- Release v1.11.4 by @csukuangfj in #2169
New Contributors
- @HaoWang0101 made their first contribution in #2106
Full Changelog: v1.11.3...v1.11.4