publications

This page lists publications in reversed chronological order. Asterisk symbols (*) indicate authors who contributed equally to an article. An up-to-date list is available on Google Scholar.

2026

  1. ICASSP
    Sampling-Rate-Agnostic Speech Super-Resolution Based on Gaussian Process Dynamical Systems with Deep Kernel Learning
    Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2026
  2. ICASSP
    SIRUP: A Diffusion-Based Virtual Upmixer of Steering Vectors for Highly-Directive Spatialization with First-Order Ambisonics
    Emilio Picard, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2026
  3. ICASSP
    Physics-Informed Learning of Neural Scattering Fields Towards Measurement-Free Mesh-To-HRTF Estimation
    Tancrède Martinez, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2026

2025

  1. APSIPA
    Visually-Informed Multichannel Sound Source Separation Based on 3D Gaussian Primitives
    Haruaki Asano, Ryunosuke Nihei, Yoshiaki Bando, Aditya Arie Nugraha, Diego Di Carlo, Hiroyuki Ueda, Yosuke Ito, and Kazuyoshi Yoshii
    In Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Oct 2025
  2. EUSIPCO
    SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer
    Diego Di Carlo, Mathieu Fontaine, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), Sep 2025
  3. WASPAA
    Physically Informed Spatial Regularization for Sound Event Localization and Detection
    Haocheng Liu, Diego Di Carlo, Aditya Arie Nugraha, Kazuyoshi Yoshii, Gaël Richard, and Mathieu Fontaine
    In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2025
  4. APSIPA
    Joint Separation and Tracking of Moving Sources with Distributed Microphone Arrays Based on Time-Varying Inertial Spatial Models
    Ryunosuke Nihei, Yoshiaki Bando, Aditya Arie Nugraha, Diego Di Carlo, Hiroyuki Ueda, Yosuke Ito, and Kazuyoshi Yoshii
    In Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Oct 2025

2024

  1. ICASSPW
    Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Direction
    Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing Workshops (ICASSPW), Apr 2024
  2. APSIPA
    Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising
    Yoto Fujita, Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Dec 2024
  3. Interspeech
    RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
    Liam Kelley, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), Sep 2024
  4. IWAENC
    Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF
    Yoshiaki Sumura, Diego Di Carlo, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2024

2023

  1. ICASSP
    Exploiting Sparse Recovery Algorithms for Semi-Supervised Training of Deep Neural Networks for Direction-of-Arrival Estimation
    Murtiza Ali, Aditya Arie Nugraha, and Karan Nathwani
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023
  2. EUSIPCO
    Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation
    Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), Sep 2023
  3. WASPAA
    Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
    Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2023

2022

  1. TASLP
    Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
    Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, May 2022
  2. TASLP
    Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii, and Tatsuya Kawahara
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Jul 2022
  3. Interspeech
    Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
    Yicheng Du*, Aditya Arie Nugraha*, Kouhei Sekiguchi*, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), Sep 2022
  4. EUSIPCO
    Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization
    Mathieu Fontaine, Diego Di Carlo, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), Aug 2022
  5. ICASSP
    Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2022
  6. IWAENC
    DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2022
  7. IROS
    Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
    Kouhei Sekiguchi*, Aditya Arie Nugraha*, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Oct 2022
  8. IWAENC
    Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis
    Yoshiaki Sumura, Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, and Kazuyoshi Yoshii
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2022

2021

  1. SPL
    Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation
    Yoshiaki Bando, Kouhei Sekiguchi, Yoshiki Masuyama, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii
    IEEE Signal Processing Letters, Aug 2021
  2. Interspeech
    Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation
    Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), Aug 2021
  3. ICASSP
    Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021

2020

  1. TASLP
    A Flow-Based Deep Latent Variable Model for Speech Spectrogram Modeling and Enhancement
    Aditya Arie Nugraha, Kouhei Sekiguchi, and Kazuyoshi Yoshii
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  2. SPL
    Flow-Based Independent Vector Analysis for Blind Source Separation
    Aditya Arie Nugraha, Kouhei Sekiguchi, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
    IEEE Signal Processing Letters, 2020
  3. TASLP
    Fast Multichannel Nonnegative Matrix Factorization with Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Kazuyoshi Yoshii, and Tatsuya Kawahara
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Aug 2020
  4. EUSIPCO
    Semi-supervised Multichannel Speech Separation Based on a Phone- and Speaker-Aware Deep Generative Model of Speech Spectrograms
    Yicheng Du, Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii, and Tatsuya Kawahara
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2020
  5. Interspeech
    Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
    Mathieu Fontaine, Kouhei Sekiguchi, Aditya Arie Nugraha, and Kazuyoshi Yoshii
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), Oct 2020
  6. EUSIPCO
    Fast Multichannel Correlated Tensor Factorization for Blind Source Separation
    Kazuyoshi Yoshii, Kouhei Sekiguchi, Yoshiaki Bando, Mathieu Fontaine, and Aditya Arie Nugraha
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2020

2019

  1. TASLP
    Semi-supervised Multichannel Speech Enhancement with a Deep Speech Prior
    Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Kazuyoshi Yoshii, and Tatsuya Kawahara
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Dec 2019
  2. RO-MAN
    Audio-Visual SLAM towards Human Tracking and Human-Robot Interaction in Indoor Environments
    Aaron Chau, Kouhei Sekiguchi, Aditya Arie Nugraha, Kazuyoshi Yoshii, and Kotaro Funakoshi
    In Proceedings of IEEE International Conference on Robot & Human Interactive Communication (RO-MAN), Oct 2019
  3. EUSIPCO
    Cauchy Multichannel Speech Enhancement with a Deep Speech Prior
    Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, and Antoine Liutkus
    In Proceedings of European Signal Processing Conference (EUSIPCO), Sep 2019
  4. ICASSP
    A Deep Generative Model of Speech Complex Spectrograms
    Aditya Arie Nugraha, Kouhei Sekiguchi, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2019
  5. EUSIPCO
    Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices
    Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), Sep 2019

2018

  1. Deep Neural Network Based Multichannel Audio Source Separation
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    2018

2017

  1. CSL
    An analysis of environment, microphone and data simulation mismatches in robust speech recognition
    Emmanuel Vincent, Shinji Watanabe, Aditya Arie Nugraha, Jon Barker, and Ricard Marxer
    Computer Speech & Language, Nov 2017

2016

  1. TASLP
    Multichannel audio source separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, Sep 2016
  2. EUSIPCO
    Multichannel music separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    In Proceedings of European Signal Processing Conference (EUSIPCO), Aug 2016

2015

  1. ASRU
    Robust ASR using neural network based speech enhancement and feature simulation
    Sunit Sivasankaran, Aditya Arie Nugraha, Emmanuel Vincent, Juan Andrés Morales Cordovilla, Siddharth Dalmia, Irina Illina, and Antoine Liutkus
    In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Dec 2015

2014

  1. ASMP
    Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    EURASIP Journal on Audio, Speech, and Music Processing, Apr 2014

2013

  1. APSIPA
    Single channel dereverberation method in logmelspectral domain using limited stereo data for distant speaker identification
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    In Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Oct 2013
  2. SP/IPSJ-SLP
    Single Channel Dereverberation Method by Feature Mapping Using Limited Stereo Data
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    Jul 2013

2012

  1. ASJ
    Improving distant speaker identification robustness using a nonlinear regression based dereverberation method in feature domain
    Aditya Arie Nugraha and Seiichi Nakagawa
    In Proceedings of the Autumn Meeting of Acoustical Society of Japan, Sep 2012

2011

  1. TSSA
    Performance evaluation of audio-video streaming service in Keerom, Papua using integrated audio-video performance test tool
    Yudi Satria Gondokaryono, Yoanes Bandung, Joko Ari Wibowo, Aditya Arie Nugraha, Bryan Yonathan, and Dwi Ramadhianto
    In Proceedings of International Conference on Telecommunication Systems, Services, and Applications (TSSA), Oct 2011

2010

  1. AEEI
    Web based multimedia conference system for digital learning in rural elementary school
    Aska Narendra, Aditya Arie Nugraha, Yoanes Bandung, Armein Z. R. Langi, and Bambang Pharmasetiawan
    Advances in Electrical Engineering and Informatics, 2010