Audio-WestlakeU

Mel-McNet Public
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]

Audio-WestlakeU/Mel-McNet’s past year of commit activity

12 0 0 0 Updated Jun 9, 2025
ATST-SED Public
This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Audio-WestlakeU/ATST-SED’s past year of commit activity

Jupyter Notebook 132 MIT 13 1 0 Updated Jun 5, 2025
VINP Public
Official PyTorch implementation of 'VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification'

Audio-WestlakeU/VINP’s past year of commit activity

Python 18 MIT 3 0 0 Updated May 22, 2025
CleanMel Public
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".

Audio-WestlakeU/CleanMel’s past year of commit activity

Python 60 Apache-2.0 3 0 0 Updated May 22, 2025
RealMAN Public
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Audio-WestlakeU/RealMAN’s past year of commit activity

Python 128 14 4 0 Updated Apr 29, 2025
RVAE-EM Public
Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutive transfer function" [ICASSP2024]

Audio-WestlakeU/RVAE-EM’s past year of commit activity

Python 45 MIT 5 0 0 Updated Mar 6, 2025
FS-EEND Public
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"

Audio-WestlakeU/FS-EEND’s past year of commit activity

Python 136 MIT 6 5 0 Updated Mar 4, 2025
NBSS Public
The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Audio-WestlakeU/NBSS’s past year of commit activity

Python 278 MIT 34 24 0 Updated Jan 1, 2025
UMA-ASR Public
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).

Audio-WestlakeU/UMA-ASR’s past year of commit activity

Shell 28 5 2 0 Updated Dec 17, 2024
FN-SSL Public
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Audio-WestlakeU/FN-SSL’s past year of commit activity

Python 108 14 2 0 Updated Dec 9, 2024

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio-WestlakeU

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!