Aditya /ˈaː.ri/ Arie Nugraha

Research Scientist @ RIKEN-AIP (Japan) — Doctorate in Informatics from the University of Lorraine (France)

prof_pic.jpg

A1-215, Kyotodaigaku-Katsura, Nishikyo-ku, Kyoto 615-8510, Japan

I am a research scientist in the Music Information Intelligence Team at the RIKEN Center for Advanced Intelligence Project (RIKEN-AIP) and a visiting researcher in the Spatio-Temporal Sensing Laboratory at Kyoto University.

Before joining the Music Information Intelligence Team in April 2026, I was a member of the Sound Scene Understanding Team at RIKEN-AIP until its closing in March 2026.

I received a Doctorate in Informatics from the University of Lorraine, France, for my doctoral research on deep neural network-based multichannel audio source separation. The research was conducted at Inria Nancy – Grand-Est, France, under the supervision of Dr. Emmanuel Vincent and Dr. Antoine Liutkus. My doctoral thesis covered applications of source separation to speech enhancement, singing voice separation, musical instrument separation, and noise-robust speech recognition.

My current research interests include audio source separation, speech enhancement, spatial audio, audio-visual scene understanding, and machine learning, with a particular interest in probabilistic modeling, deep learning, and physics-informed approaches to audio intelligence.

news

selected publications

  1. ICASSP
    Sampling-Rate-Agnostic Speech Super-Resolution Based on Gaussian Process Dynamical Systems with Deep Kernel Learning
    Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2026
  2. WASPAA
    Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
    Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2023
  3. ICASSP
    Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022
  4. IWAENC
    DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2022
  5. IROS
    Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
    Kouhei Sekiguchi*, Aditya Arie Nugraha*, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Oct 2022
  6. SPL
    Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation
    Yoshiaki Bando, Kouhei Sekiguchi, Yoshiki Masuyama, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii
    IEEE Signal Processing Letters, Aug 2021
  7. SPL
    Flow-Based Independent Vector Analysis for Blind Source Separation
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    IEEE Signal Processing Letters, 2020
  8. TASLP
    Fast Multichannel Nonnegative Matrix Factorization with Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Kazuyoshi Yoshii, and Tatsuya Kawahara
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Aug 2020
  9. CSL
    An analysis of environment, microphone and data simulation mismatches in robust speech recognition
    Emmanuel Vincent, Shinji Watanabe, Aditya Arie Nugraha, Jon Barker, and Ricard Marxer
    Computer Speech & Language, Nov 2017
  10. TASLP
    Multichannel audio source separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Sep 2016