Audio-WestlakeU
Pinned Loading
Repositories
- Mel-McNet Public
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
Audio-WestlakeU/Mel-McNet’s past year of commit activity - ATST-SED Public
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".
Audio-WestlakeU/ATST-SED’s past year of commit activity - VINP Public
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification'
Audio-WestlakeU/VINP’s past year of commit activity - CleanMel Public
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
Audio-WestlakeU/CleanMel’s past year of commit activity - RealMAN Public
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
Audio-WestlakeU/RealMAN’s past year of commit activity - RVAE-EM Public
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]
Audio-WestlakeU/RVAE-EM’s past year of commit activity - FS-EEND Public
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
Audio-WestlakeU/FS-EEND’s past year of commit activity - NBSS Public
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation
Audio-WestlakeU/NBSS’s past year of commit activity - UMA-ASR Public
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
Audio-WestlakeU/UMA-ASR’s past year of commit activity - FN-SSL Public
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
Audio-WestlakeU/FN-SSL’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…