Bug or a feature in streaming_utils.py BatchedFrameASRTDT? Is it possible to run parakeet-tdt-0.6b-v2 in streaming mode? #13763

SatenHarutyunyan · 2025-05-29T07:35:11Z

SatenHarutyunyan
May 29, 2025

In this huggingface discussion https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2/discussions/3
It is said that its possible to use parakeet tdt in streaming mode and an inference bug was detected and corrected. I run
this script but get empty strings as prediction.

# Evaluate Parakeet in streaming mode python examples/asr/asr_chunked_inference/rnnt/speech_to_text_buffered_infer_rnnt.py \ model_path="parakeet_downlaoded/parakeet-tdt-0.6b-v2.nemo" \ audio_dir="audio_dir" \ output_filename="output.json" \ chunk_len_in_secs=0.2 \ total_buffer_in_secs=0.8 \ model_stride=4 \ batch_size=1

Pulled from main, commit=259d684e73c45091f0b6144342133e6ceb7e824c installed with pip install '.[all]'

deepanshu-yadav · 2025-06-09T14:56:04Z

deepanshu-yadav
Jun 9, 2025

It is difficult to do so. But possible. Someone has converted to lightweight integer models https://github.com/k2-fsa/sherpa-onnx/tree/master/scripts/nemo/parakeet-tdt-0.6b-v2 . I made an example https://github.com/deepanshu-yadav/voice-form-filler on how to wrap this around a server and send audio packets and return the final inference. It works okayish on my laptop without any GPU.

0 replies

SatenHarutyunyan · 2025-06-10T04:28:50Z

SatenHarutyunyan
Jun 10, 2025
Author

There is a bug in the script
#13797
Thats why its not working. Will wait for the fix.

0 replies

csukuangfj · 2025-07-15T05:20:32Z

csukuangfj
Jul 15, 2025

Is it possible to run parakeet-tdt-0.6b-v2 in streaming mode?

Definitely possible, without any GPUs, on your phones.

You can find a pre-built Android APK for this model at
https://hf-mirror.com/csukuangfj/sherpa-onnx-apk/blob/main/vad-asr-simulated-streaming/1.12.6/sherpa-onnx-1.12.6-arm64-v8a-simulated_streaming_asr-en-parakeet_tdt_0.6b_v2.apk

More APKs are available at
https://k2-fsa.github.io/sherpa/onnx/android/apk-simulate-streaming-asr.html

Everything is open-sourced at
https://github.com/k2-fsa/sherpa-onnx

See also
https://k2-fsa.github.io/sherpa/onnx/pretrained_models/offline-transducer/nemo-transducer-models.html#sherpa-onnx-nemo-parakeet-tdt-0-6b-v2-int8-english

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug or a feature in streaming_utils.py BatchedFrameASRTDT? Is it possible to run parakeet-tdt-0.6b-v2 in streaming mode? #13763

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Bug or a feature in streaming_utils.py BatchedFrameASRTDT? Is it possible to run parakeet-tdt-0.6b-v2 in streaming mode? #13763

Uh oh!

Uh oh!

SatenHarutyunyan May 29, 2025

Replies: 3 comments

Uh oh!

deepanshu-yadav Jun 9, 2025

Uh oh!

SatenHarutyunyan Jun 10, 2025 Author

Uh oh!

csukuangfj Jul 15, 2025

SatenHarutyunyan
May 29, 2025

deepanshu-yadav
Jun 9, 2025

SatenHarutyunyan
Jun 10, 2025
Author

csukuangfj
Jul 15, 2025