publications

This page lists publications by categories in reversed chronological order. Asterisk symbols (*) indicate authors who contributed equally to an article. An up-to-date list is available on Google Scholar.

2023

  1. WASPAA
    Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
    Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
  2. EUSIPCO
    Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation
    Yoshiaki Bando, Yoshiki Masuyama, Aditya Arie Nugraha, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2023
  3. ICASSP
    Exploiting Sparse Recovery Algorithms for Semi-Supervised Training of Deep Neural Networks for Direction-of-Arrival Estimation
    Murtiza Ali, Aditya Arie Nugraha, and Karan Nathwani
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

2022

  1. IROS
    Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments
    Kouhei Sekiguchi*, Aditya Arie Nugraha*, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022
  2. Interspeech
    Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
    Yicheng Du*, Aditya Arie Nugraha*, Kouhei Sekiguchi*, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), 2022
  3. IWAENC
    DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), 2022
  4. IWAENC
    Joint Localization and Synchronization of Distributed Camera-Attached Microphone Arrays for Indoor Scene Analysis
    Yoshiaki Sumura, Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, and Kazuyoshi Yoshii
    In Proceedings of International Workshop on Acoustic Signal Enhancement (IWAENC), 2022
  5. EUSIPCO
    Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization
    Mathieu Fontaine, Diego Di Carlo, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2022
  6. TASLP
    Autoregressive Moving Average Jointly-Diagonalizable Spatial Covariance Analysis for Joint Source Separation and Dereverberation
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
  7. TASLP
    Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
  8. ICASSP
    Flow-Based Fast Multichannel Nonnegative Matrix Factorization for Blind Source Separation
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022

2021

  1. SPL
    Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation
    IEEE Signal Processing Letters, 2021
  2. Interspeech
    Alpha-Stable Autoregressive Fast Multichannel Nonnegative Matrix Factorization for Joint Speech Enhancement and Dereverberation
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), 2021
  3. ICASSP
    Autoregressive Fast Multichannel Nonnegative Matrix Factorization For Joint Blind Source Separation And Dereverberation
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021

2020

  1. Interspeech
    Unsupervised Robust Speech Enhancement Based on Alpha-Stable Fast Multichannel Nonnegative Matrix Factorization
    In Proceedings of Annual Conference of the International Speech Communication Association (Interspeech), 2020
  2. TASLP

    15th IEEE Signal Processing Society (SPS) Japan Student Journal Paper Award

    Fast Multichannel Nonnegative Matrix Factorization with Directivity-Aware Jointly-Diagonalizable Spatial Covariance Matrices for Blind Source Separation
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  3. TASLP
    A Flow-Based Deep Latent Variable Model for Speech Spectrogram Modeling and Enhancement
    Aditya Arie Nugraha, Kouhei Sekiguchi, and Kazuyoshi Yoshii
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  4. SPL
    Flow-Based Independent Vector Analysis for Blind Source Separation
    IEEE Signal Processing Letters, 2020
  5. EUSIPCO
    Semi-supervised Multichannel Speech Separation Based on a Phone- and Speaker-Aware Deep Generative Model of Speech Spectrograms
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2020
  6. EUSIPCO
    Fast Multichannel Correlated Tensor Factorization for Blind Source Separation
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2020

2019

  1. TASLP

    17th IEEE Kansai Section Student Paper Award

    Semi-supervised Multichannel Speech Enhancement with a Deep Speech Prior
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2019
  2. RO-MAN

    Best Conference Paper Award

    Audio-Visual SLAM towards Human Tracking and Human-Robot Interaction in Indoor Environments
    Aaron Chau, Kouhei Sekiguchi, Aditya Arie Nugraha, Kazuyoshi Yoshii, and Kotaro Funakoshi
    In Proceedings of IEEE International Conference on Robot & Human Interactive Communication (RO-MAN), 2019
  3. EUSIPCO
    Cauchy Multichannel Speech Enhancement with a Deep Speech Prior
    Mathieu Fontaine, Aditya Arie Nugraha, Roland Badeau, Kazuyoshi Yoshii, and Antoine Liutkus
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2019
  4. EUSIPCO
    Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices
    Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2019
  5. ICASSP
    A Deep Generative Model of Speech Complex Spectrograms
    Aditya Arie Nugraha, Kouhei Sekiguchi, and Kazuyoshi Yoshii
    In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019

2018

  1. Deep Neural Network Based Multichannel Audio Source Separation
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    In Audio Source Separation, 2018

2017

  1. CSL

    ISCA Award for the Best Review Paper published in Computer Speech and Language (2016-2020)

    An analysis of environment, microphone and data simulation mismatches in robust speech recognition
    Computer Speech & Language, 2017

2016

  1. TASLP

    6th IEEE Signal Processing Society (SPS) Japan Young Author Best Paper Award

    Multichannel audio source separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016
  2. EUSIPCO
    Multichannel music separation with deep neural networks
    Aditya Arie Nugraha, Antoine Liutkus, and Emmanuel Vincent
    In Proceedings of European Signal Processing Conference (EUSIPCO), 2016

2015

  1. ASRU
    Robust ASR using neural network based speech enhancement and feature simulation
    Sunit Sivasankaran, Aditya Arie Nugraha, Emmanuel Vincent, Juan Andrés Morales Cordovilla, Siddharth Dalmia, Irina Illina, and Antoine Liutkus
    In Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 2015

2014

  1. ASMP
    Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    EURASIP Journal on Audio, Speech, and Music Processing, 2014

2013

  1. APSIPA
    Single channel dereverberation method in logmelspectral domain using limited stereo data for distant speaker identification
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    In Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013
  2. SP/IPSJ-SLP
    Single Channel Dereverberation Method by Feature Mapping Using Limited Stereo Data
    Aditya Arie Nugraha, Kazumasa Yamamoto, and Seiichi Nakagawa
    Technical Report of Institute of Electronics, Information and Communication Engineers (IEICE), 2013

2012

  1. ASJ
    Improving distant speaker identification robustness using a nonlinear regression based dereverberation method in feature domain
    Aditya Arie Nugraha and Seiichi Nakagawa
    In Proceedings of the Autumn Meeting of Acoustical Society of Japan, 2012

2011

  1. TSSA
    Performance evaluation of audio-video streaming service in Keerom, Papua using integrated audio-video performance test tool
    Yudi Satria Gondokaryono, Yoanes Bandung, Joko Ari Wibowo, Aditya Arie Nugraha, Bryan Yonathan, and Dwi Ramadhianto
    In Proceedings of International Conference on Telecommunication Systems, Services, and Applications (TSSA), 2011

2010

  1. AEEI
    Web based multimedia conference system for digital learning in rural elementary school
    Aska Narendra, Aditya Arie Nugraha, Yoanes Bandung, Armein Z. R. Langi, and Bambang Pharmasetiawan
    Advances in Electrical Engineering and Informatics, 2010