Build software better, together

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

text-to-speech deep-learning pytorch tts speech-synthesis voice-conversion icassp speech-quality quality-of-experience interspeech

Updated Mar 8, 2024
Python

sibozhang / Text2Video

Star

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

avatar video deep-learning tts speech-synthesis gan digital-humanities metaverse talking icassp virtual-humans vid2vid text-to-video talking-head aigc talking-heads talking-face-generation generative-ai

Updated Jun 4, 2023
Python

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated Oct 31, 2024
Python

IBM / TabFormer

Star

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

machine-learning tabular-data pytorch artificial-intelligence transformer gpt bert fraud-detection icassp huggingface credit-card-dataset prsa-dataset credit-card-transaction icassp2021

Updated Aug 12, 2023
Python

soham97 / awesome-sound_event_detection

Star

Reading list for research topics in Sound AI

representation-learning audio-processing zero-shot-learning icassp sound-event-detection interspeech acoustic-scene-classification audio-captioning audio-generation audio-retrieval

Updated Aug 8, 2024

Jiaxin-Ye / TIM-Net_SER

Star

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

bi-directional emotion-recognition casia icassp emodb speech-emotion-recognition iemocap ravdess savee emovo

Updated May 15, 2024
Python

glam-imperial / EmotionalConversionStarGAN

Star

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

deep-neural-networks deep-learning speech-synthesis generative-adversarial-network data-augmentation emotion-recognition icassp stargan imperial-college-london stargan-vc icassp-2020 augsburg-university imperial-glam

Updated Oct 24, 2021
Python

DmitryRyumin / NewEraAI-Papers

Star

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

natural-language-processing computer-vision deep-learning text-classification signal-processing image-processing artificial-intelligence video-processing neural-networks emnlp cvpr iccv icassp ismir interspeech mashine-learning

Updated May 18, 2024
Python

XuesongYang / end2end_dialog

Star

ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager

icassp

Updated Jul 7, 2017
Python

fonfonx / FaceRecognition

Star

Face Recognition in real-world images [ICASSP 2017]

python opencv face-recognition landmarks sparse-coding rsc real-world-images lfw icassp

Updated Feb 27, 2017
Python

30stomercury / Interaction-Aware-Attention-Network

Star

[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs

tensorflow emotion-recognition icassp speech-emotion-recognition icassp-2019

Updated May 17, 2020
Python

doheejin / HiPAMA

Star

This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).