NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
-
Updated
Mar 8, 2024 - Python
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Reading list for research topics in Sound AI
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager
Face Recognition in real-world images [ICASSP 2017]
[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).
SERAB: a multi-lingual benchmark for speech emotion recognition
Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).
ICASSP 2019 official Latex template
ICASSP 2021: Scene Completeness-Aware Lidar Depth Completion for Driving Scenario
Continual Learning Benchmark for Spoken Keyword Spotting
A regularized version of RBM for unsupervised feature selection.
The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"
2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification
Add a description, image, and links to the icassp topic page so that developers can more easily learn about it.
To associate your repository with the icassp topic, visit your repo's landing page and select "manage topics."