/ˈaː.ri/

Aditya

Arie

Nugraha

Research Scientist @ RIKEN-AIP (Japan) — Doctorate in Informatics from the University of Lorraine (France)

prof_pic.jpg

A1-215, Kyotodaigaku-Katsura, Nishikyo-ku, Kyoto 615-8510, Japan

I am a research scientist in the Music Information Intelligence Team at the RIKEN Center for Advanced Intelligence Project (RIKEN-AIP) and a visiting researcher in the Spatio-Temporal Sensing Laboratory at Kyoto University.

Before joining the Music Information Intelligence Team in April 2026, I was a member of the Sound Scene Understanding Team at RIKEN-AIP until its closing in March 2026.

I received a Doctorate in Informatics from the University of Lorraine, France, for my doctoral research on deep neural network-based multichannel audio source separation. The research was conducted at Inria Nancy – Grand-Est, France, under the supervision of Dr. Emmanuel Vincent and Dr. Antoine Liutkus. My doctoral thesis covered applications of source separation to speech enhancement, singing voice separation, musical instrument separation, and noise-robust speech recognition.

My current research interests include audio source separation, speech enhancement, spatial audio, audio-visual scene understanding, and machine learning, with a particular interest in probabilistic modeling, deep learning, and physics-informed approaches to audio intelligence.

news

May 4, 2026 Three Papers at ICASSP 2026: From Missing Samples to Personalized 3D Audio
Apr 1, 2026 A New Chapter at RIKEN-AIP: From Sound Scene Understanding to Music Information Intelligence
Jun 13, 2025 We proudly showcased our AV-SUARA system during a live demonstration at the IPSJ Otogaku Symposium 2025 at Waseda University.
Oct 24, 2023 Our paper “Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning” was presented at IEEE WASPAA 2023.
Aug 20, 2023 Our team provided a tutorial entitled “Foundations, Extensions and Applications of Statistical Multichannel Speech Separation Models” at Interspeech 2023.
Jun 8, 2023 Our paper “Exploiting Sparse Recovery Algorithms for Semi-Supervised Training of Deep Neural Networks for Direction-of-Arrival Estimation” was presented at IEEE ICASSP 2023.
Oct 26, 2022 Our paper “Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments” was presented at IEEE/RSJ IROS 2022.
Sep 21, 2022 Our paper “Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments” was presented at Interspeech 2022.
Sep 7, 2022 We presented two papers at IWAENC 2022: ① “DNN-free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF” and ② “Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis”.
May 7, 2022 Our Sound Scene Understanding Team presented two papers at IEEE ICASSP 2022: ① “Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation” and ② “Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation”.

selected publications

  1. ICASSP
    Sampling-Rate-Agnostic Speech Super-Resolution Based on Gaussian Process Dynamical Systems with Deep Kernel Learning
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
  2. WASPAA
    Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
    In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
  3. IROS
    Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
    Kouhei Sekiguchi*, Aditya Arie Nugraha*, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022
  4. IWAENC
    DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), 2022
  5. ICASSP
    Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
  6. SPL
    Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation
    IEEE Signal Processing Letters, 2021
  7. TASLP

    15th IEEE Signal Processing Society (SPS) Japan Student Journal Paper Award

    Fast Multichannel Nonnegative Matrix Factorization with Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  8. SPL
    Flow-Based Independent Vector Analysis for Blind Source Separation
    IEEE Signal Processing Letters, 2020
  9. CSL

    ISCA Award for the Best Review Paper published in Computer Speech and Language (2016-2020)

    An analysis of environment, microphone and data simulation mismatches in robust speech recognition
    Computer Speech & Language, 2017
  10. TASLP

    6th IEEE Signal Processing Society (SPS) Japan Young Author Best Paper Award

    Multichannel audio source separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016