Releases · k2-fsa/sherpa-onnx

@csukuangfj

What's Changed

Support silero-vad v4 exported by k2-fsa by @csukuangfj in #2372
Support ten vad by @csukuangfj in #2377
Fix compile errors for Linux by @csukuangfj in #2378
Add C API for ten-vad by @csukuangfj in #2379
Add CXX API examples for ten-vad. by @csukuangfj in #2380
Add JavaScript (WebAssembly) API for ten-vad by @csukuangfj in #2382
Add JavaScript (node-addon) API for ten-vad by @csukuangfj in #2383
Add Go API for ten-vad by @csukuangfj in #2384
Add C# API for ten-vad by @csukuangfj in #2385
Add Dart API for ten-vad by @csukuangfj in #2386
Add Swift API for ten-vad by @csukuangfj in #2387
Add Pascal API for ten-vad by @csukuangfj in #2388
Add Java/Kotlin API and Android support for ten-vad by @csukuangfj in #2389
Release v1.12.6 by @csukuangfj in #2390

Full Changelog: v1.12.5...v1.12.6

@linsui

What's Changed

Fix typo CMAKE_EXECUTBLE_LINKER_FLAGS -> CMAKE_EXECUTABLE_LINKER_FLAGS by @linsui in #2344
Fix testing dart packages by @csukuangfj in #2345
fix(canary): use dynamo export, single input_ids and avoid 0/1 specialization by @lucaelin in #2348
Fix TTS for UE by @csukuangfj in #2349
Update readme to include https://github.com/mawwalker/stt-server by @csukuangfj in #2350
Add meta data to NeMo canary ONNX models by @csukuangfj in #2351
Update README to include https://github.com/bbeyondllove/asr_server by @csukuangfj in #2353
Add C++ runtime and Python API for NeMo Canary models by @csukuangfj in #2352
Add C/CXX/JavaScript API for NeMo Canary models by @csukuangfj in #2357
Add Java and Kotlin API for NeMo Canary models by @csukuangfj in #2359
Upload fp16 onnx model files for FireRedASR by @csukuangfj in #2360
Fix nemo feature normalization in test code by @csukuangfj in #2361
Refactor exporting NeMo models by @csukuangfj in #2362
Add LODR support to online and offline recognizers by @vsd-vector in #2026
Add CXX examples for NeMo TDT ASR. by @csukuangfj in #2363
Add Pascal/Go/C#/Dart API for NeMo Canary ASR models by @csukuangfj in #2367
Release v1.12.5 by @csukuangfj in #2368

New Contributors

@linsui made their first contribution in #2344
@lucaelin made their first contribution in #2348

Full Changelog: v1.12.4...v1.12.5

@csukuangfj

What's Changed

Refactor release scripts. by @csukuangfj in #2323
Add TTS engline APKs for more models by @csukuangfj in #2327
Fix static link without tts by @csukuangfj in #2328
Fix VAD+ASR C++ example. by @csukuangfj in #2335
添加sherpa-onnx-streaming-zipformer-zh-int8-2025-06-30（type=1002） by @wenjietech in #2336
Support non-streaming zipformer CTC ASR models by @csukuangfj in #2340
Support linux aarch64 for Dart and Flutter by @csukuangfj in #2342
Release v1.12.4 by @csukuangfj in #2343

New Contributors

@wenjietech made their first contribution in #2336

Full Changelog: v1.12.3...v1.12.4

@csukuangfj

What's Changed

Show cmake debug information. by @csukuangfj in #2316
Remove portaudio-go in Go API examples. by @csukuangfj in #2317
Support zipformer CTC ASR with whisper features. by @csukuangfj in #2319
Support Zipformer transducer ASR with whisper features. by @csukuangfj in #2321
Release v1.12.3 by @csukuangfj in #2322

Full Changelog: v1.12.2...v1.12.3

@csukuangfj

What's Changed

Fix CI for windows by @csukuangfj in #2279
Add jar for Java 24. by @csukuangfj in #2280
Add Python API for source separation by @csukuangfj in #2283
Add link to huggingface space for source separation. by @csukuangfj in #2284
解决windows编译isspace函数调用失败问题 by @GlocKieHuan in #2042
update wasm/vad-asr/assets/README.md for more clear by @guoxiangyang in #2297
Update TTS Engine APK to support multi-lang by @csukuangfj in #2294
Add scripts for exporting Piper TTS models to sherpa-onnx by @csukuangfj in #2299
修复了C++链接错误，添加-lsherpa-onnx-cxx-api：Update sherpa-onnx-shared.pc.in by @DSOE1024 in #2300
Fixes #2172 by @csukuangfj in #2301
Refactor kokoro tts export by @csukuangfj in #2302
Fix building for Pascal by @csukuangfj in #2305
Support extra languages in multi-lang kokoro tts by @csukuangfj in #2303
Update readme to include BreezeApp from MediaTek Research by @csukuangfj in #2313
Add API to get version information by @csukuangfj in #2309
Release v1.12.2 by @csukuangfj in #2314

New Contributors

@GlocKieHuan made their first contribution in #2042
@guoxiangyang made their first contribution in #2297
@DSOE1024 made their first contribution in #2300

Full Changelog: v1.12.1...v1.12.2

@csukuangfj

What's Changed

Use jlong explicitly in jni. by @csukuangfj in #2229
Fix building rknn wheels by @csukuangfj in #2233
Fix publishing binaries for RKNN by @csukuangfj in #2234
Export spleeter model to onnx for source separation by @csukuangfj in #2237
Add C++ runtime for spleeter about source separation by @csukuangfj in #2242
Add include headers for ANDROID_API,OHOS by @f2janyway in #2251
JAVA-API: Manual Library Loading Support for Restricted Environments by @Skepller in #2253
Build APK with replace.fst by @csukuangfj in #2254
repair rknn wheels by @csukuangfj in #2257
Update kaldi-native-fbank. by @csukuangfj in #2259
Fix building sherpa-onnx by @csukuangfj in #2262
Fix building MFC examples by @csukuangfj in #2263
Add UVR models for source separation. by @csukuangfj in #2266
move portaudio common record code to microphone by @mtdxc in #2264
fixed mfc build error by @mtdxc in #2267
Add C++ support for UVR models by @csukuangfj in #2269
Export nvidia/canary-180m-flash to sherpa-onnx by @csukuangfj in #2272
Update utils.dart by @flying-forever in #2275
Fix using rknn in multi-threads by @csukuangfj in #2274
Fix 32-bit arm CI by @csukuangfj in #2276
Release v1.12.1 by @csukuangfj in #2277

New Contributors

@f2janyway made their first contribution in #2251
@Skepller made their first contribution in #2253
@mtdxc made their first contribution in #2264
@flying-forever made their first contribution in #2275

Full Changelog: v1.12.0...v1.12.1

audio_example.wav is converted from
https://github.com/deezer/spleeter/blob/v1.4.0/audio_example.mp3
with the following command

sox audio_example.mp3 audio_example.wav

UVR models are converted from
https://github.com/TRvlvr/model_repo/releases/tag/all_public_uvr_models
using the code from
https://github.com/k2-fsa/sherpa-onnx/tree/master/scripts/uvr_mdx

@csukuangfj

What's Changed

Fix building wheels for macOS by @csukuangfj in #2192
Show verbose logs in homophone replacer by @csukuangfj in #2194
Fix displaying streaming speech recognition results for Python. by @csukuangfj in #2196
Add real-time speech recognition example for SenseVoice. by @csukuangfj in #2197
docs: add Open-XiaoAI KWS project by @idootop in #2198
Add C++ example for streaming ASR with SenseVoice. by @csukuangfj in #2199
Add C++ example for real-time ASR with nvidia/parakeet-tdt-0.6b-v2. by @csukuangfj in #2201
Add a link to YouTube video including sherpa-onnx. by @csukuangfj in #2202
Support sending is_eof for online websocket server. by @csukuangfj in #2204
Add alsa-based streaming ASR example for sense voice. by @csukuangfj in #2207
Support homophone replacer in Android asr demo. by @csukuangfj in #2210
Add a Go implementation of the TTS generation callback by @xiaokuang95 in #2213
Add Android demo for real-time ASR with non-streaming ASR models. by @csukuangfj in #2214
Expose dither for JNI by @esavin in #2215
Add nodejs example for parakeet-tdt-0.6b-v2. by @csukuangfj in #2219
Add script to build APK for simulated-streaming-asr. by @csukuangfj in #2220
Release v1.12.0 by @csukuangfj in #2221

New Contributors

@idootop made their first contribution in #2198
@esavin made their first contribution in #2215

Full Changelog: v1.11.5...v1.12.0

@csukuangfj

What's Changed

export parakeet-tdt-0.6b-v2 to sherpa-onnx by @csukuangfj in #2180
Add C++ runtime for parakeet-tdt-0.6b-v2. by @csukuangfj in #2181
Avoid NaN in feature normalization. by @csukuangfj in #2186
Release v1.11.5 by @csukuangfj in #2187

Full Changelog: v1.11.4...v1.11.5

@vsd-vector

What's Changed

Disable strict hotword matching mode for offline transducer by @vsd-vector in #1837
Comment refinement: Add note about vocoder file for matcha TTS config by @HaoWang0101 in #2106
Fix a typo in the JNI for Android. by @csukuangfj in #2108
Generate subtitles with FireRedAsr models by @csukuangfj in #2112
Use manylinux_2_28_x86_64 to build linux gpu for sherpa-onnx by @csukuangfj in #2123
Support running sherpa-onnx with RK NPU on Android by @csukuangfj in #2124
Fix building for HarmonyOS by @csukuangfj in #2125
cmake build, configurable from env by @KarelVesely84 in #2115
Expose dither in python API by @nshmyrev in #2127
Add support for GigaAM-CTC-v2 by @rominf in #2135
Support Giga AM transducer V2 by @csukuangfj in #2136
Export kokoro 1.0 int8 models by @csukuangfj in #2137
Upload more onnx ASR models by @csukuangfj in #2141
Fix building for open harmonyOS by @csukuangfj in #2142
online-transducer: reset the encoder toghter with 2 previous output symbols (non-blank) by @KarelVesely84 in #2129
Fix punctuations for kokoro tts 1.1-zh. by @csukuangfj in #2146
Fix setting OnlineModelConfig in Java API by @csukuangfj in #2147
Support decoding multiple streams in Java API. by @csukuangfj in #2149
Support replacing homonphonic phrases by @csukuangfj in #2153
Add C and CXX API for homophone replacer by @csukuangfj in #2156
Add JavaScript API (WASM) for homophone replacer by @csukuangfj in #2157
Add JavaScript API (node-addon) for homophone replacer by @csukuangfj in #2158
Fix building without TTS by @csukuangfj in #2159
Add homonphone replacer example for Python API. by @csukuangfj in #2161
More fix for building without tts by @csukuangfj in #2162
Add Swift API for homophone replacer. by @csukuangfj in #2164
Add C# API for homophone replacer by @csukuangfj in #2165
Add Kotlin and Java API for homophone replacer by @csukuangfj in #2166
Add Dart API for homophone replacer by @csukuangfj in #2167
Add Go API for homophone replacer by @csukuangfj in #2168
Release v1.11.4 by @csukuangfj in #2169

New Contributors

@HaoWang0101 made their first contribution in #2106

Full Changelog: v1.11.3...v1.11.4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What's Changed

Contributors

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

What's Changed

Contributors

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

What's Changed

Contributors

Uh oh!

What's Changed

New Contributors

Contributors

Uh oh!

Releases: k2-fsa/sherpa-onnx

v1.12.6

What's Changed

Contributors

Uh oh!

v1.12.5

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.4

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.3

What's Changed

Contributors

Uh oh!

v1.12.2

What's Changed

New Contributors

Contributors

Uh oh!

v1.12.1

What's Changed

New Contributors

Contributors

Uh oh!

source-separation-models

Uh oh!

v1.12.0

What's Changed

New Contributors

Contributors

Uh oh!

v1.11.5

What's Changed

Contributors

Uh oh!

v1.11.4

What's Changed

New Contributors

Contributors

Uh oh!