🌟 IASNLP-2025: Word-Level Stress Identification from Speech

This project proposes an approach for word-level stress identification from speech using prosodic features, assuming that the corresponding transcribed text is available.

Word Level stress Identification pipeline

📁 Dataset

The dataset consists of a CSV file with the following columns:

📌 Audio Path – Path to the audio file
📜 Transcribed Text – Manually transcribed speech
🔤 Stress Labels – Word-level stress annotations (e.g., stressed/unstressed)

You can access the dataset here:
👉 Raw audio files for training

📂 File Structure

File / Directory	Description
`setup_env.sh`	Shell script to set up the development and training environment
`config.py`	Contains all configuration parameters (paths, hyperparameters, etc.)
`dataset.py`	Defines a custom PyTorch-compatible dataset class for loading and preprocessing audio data in NeMo-ASR-compatible format
`model.py`	Contains the model architecture for stress classification
`train_test.py`	Includes PyTorch training and evaluation loop logic
`utils.py`	Utility functions for audio loading and prosodic feature extraction
`stress_classification_model.ipynb`	Jupyter Notebook entry point to train and test the stress classifier

🚀 Getting Started

Clone the repository:
- git clone <repo_url>
- cd <repo_directory>
Set up the environment:
- create a new conda env with python-3.10 version
- chmod +x setup_nv.sh
- ./setup_env.sh
Run the notebook:
- Open and Run stress_classification_model.ipynb to train the model.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Dataset		Dataset
TrainedModel		TrainedModel
__pycache__		__pycache__
.gitignore		.gitignore
Dataset.py		Dataset.py
README.md		README.md
config.py		config.py
model.py		model.py
setup_env.sh		setup_env.sh
stress_classification_model.ipynb		stress_classification_model.ipynb
train_test.py		train_test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌟 IASNLP-2025: Word-Level Stress Identification from Speech

Word Level stress Identification pipeline

📁 Dataset

📂 File Structure

🚀 Getting Started

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Tejanikhil-MSR/IASNLP-2025

Folders and files

Latest commit

History

Repository files navigation

🌟 IASNLP-2025: Word-Level Stress Identification from Speech

Word Level stress Identification pipeline

📁 Dataset

📂 File Structure

🚀 Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages