A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
-
Updated
Oct 22, 2024 - Python
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd like!
List of direct speech-to-speech translation papers.
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
Applying deep learning to translate animation and re-generate audio.
Official repository for our NeurIPS 2024 paper: DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
cascaded speech-to-speech translation (STST), mapping from source speech in any language to target speech in English
HF Space app for End-to-End Speech-to-Speech Translation from Spanish to English using ESPnet
Tool to generate English AI Dubbing for a YouTube video
Speech to Speech Translation Python
A comparison of E2E and Cascading S2ST systems on the CVSS-C Spanish to English dataset (CommonVoice 4.0)
Add a description, image, and links to the speech-to-speech-translation topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-speech-translation topic, visit your repo's landing page and select "manage topics."