publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. IEEE ICASSP
    gathermos.png
    Few-Shot and Pseudo-Label Guided Speech Quality Evaluation with Large Language Models
    Ryandhimas E. Zezario, Dyah A. M. G. Wisnu, Szu-Wei Fu, and 3 more authors
    In ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
  2. IEEE ICASSP
    comp_ha.png
    Enhancing Speech Intelligibility Prediction for Hearing Aids with Complementary Speech Foundation Model Representations
    Guojian Lin, Xuefei Wang, Ryandhimas E. Zezario, and 1 more author
    In ICASSP 2026 - 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026
  3. IEEE TAI
    neuroamp.png
    NeuroAMP: A Novel End-to-End General Purpose Deep Neural Amplifier for Personalized Hearing Aids
    Shafique Ahmed, Ryandhimas E. Zezario, Hui-Guan Yuan, and 4 more authors
    IEEE Transactions on Artificial Intelligence, 2026

2025

  1. IEEE ASRU
    triplet.png
    Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings
    Dyah A. M. G. Wisnu, Ryandhimas E. Zezario, Stefano Rini, and 2 more authors
    In 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2025
  2. IEEE ASRU
    highratemos.png
    HighRateMOS: Sampling-Rate Aware Modeling for Speech Quality Assessment
    Wenze Ren, Yi-Cheng Lin, Wen-Chin Huang, and 9 more authors
    In 2025 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2025
  3. APSIPA
    perspective.png
    Non-Intrusive Intelligibility Prediction for Hearing Aids: Recent Advances, Trends, and Challenges
    Ryandhimas E. Zezario
    In 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2025
  4. APSIPA
    whisper_slstm.png
    Speech Intelligibility Assessment with Uncertainty-Aware Whisper Embeddings and sLSTM
    Ryandhimas E. Zezario, Dyah A.M.G. Wisnu, Hsin-Min Wang, and 1 more author
    In 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2025
  5. Interspeech
    fido.png
    Feature Importance across Domains for Improving Non-Intrusive Speech Intelligibility Prediction in Hearing Aids
    Ryandhimas E. Zezario, Sabato M. Siniscalchi, Fei Chen, and 2 more authors
    In Interspeech 2025, 2025
  6. Interspeech
    avse.png
    A Study on Speech Assessment with Visual Cues
    Shafique Ahmed, Ryandhimas E. Zezario, Nasir Saleem, and 3 more authors
    In Interspeech 2025, 2025
  7. Clarity
    multi-stage.png
    Non-Intrusive Multi-Branch Speech Intelligibility Prediction using Multi-Stage Training
    Ryandhimas E. Zezario, Szu-Wei Fu, Dyah A.M.G. Wisnu, and 2 more authors
    In The 6th Clarity Workshop on Improving Speech-in-Noise for Hearing Devices (Clarity-2025), 2025
  8. IEEE ICCE-TW
    gpt_whisper_ha.png
    A Study on Zero-Shot Non-Intrusive Speech Intelligibility for Hearing aids Using Large Language Models
    Ryandhimas E. Zezario, Dyah A.M.G. Wisnu, Hsin-Min Wang, and 1 more author
    In 2025 IEEE International Conference on Consumer Electronics - Taiwan (ICCE-Taiwan), 2025
  9. IEEE ICASSP
    gpt_whisper.png
    A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models
    Ryandhimas E. Zezario, Sabato M. Siniscalchi, Hsin-Min Wang, and 1 more author
    In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  10. IEEE TASLPRO
    haaqinet.png
    HAAQI-Net: A Non-Intrusive Neural Music Audio Quality Assessment Model for Hearing Aids
    Dyah A. M. G. Wisnu, Stefano Rini, Ryandhimas E. Zezario, and 2 more authors
    IEEE Transactions on Audio, Speech and Language Processing, 2025

2024

  1. IEEE SLT
    voicemos.png
    The Voicemos Challenge 2024: Beyond Speech Quality Prediction
    Wen-Chin Huang, Szu-Wei Fu, Erica Cooper, and 5 more authors
    In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024
  2. Interspeech
    mbinet_plus.png
    Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata
    Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, and 2 more authors
    In Interspeech 2024, 2024
  3. IEEE ICME
    mosanet_plus.png
    A Study On Incorporating Whisper For Robust Speech Assessment
    Ryandhimas E. Zezario, Yu-Wen Chen, Szu-Wei Fu, and 3 more authors
    In 2024 IEEE International Conference on Multimedia and Expo (ICME), 2024
  4. IEEE ICASSP
    mpl_model.png
    Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model
    Ryandhimas E. Zezario, Bo-Ren Brian Bai, Chiou-Shann Fuh, and 2 more authors
    In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024

2023

  1. IEEE TASLP
    mosanet.png
    Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features
    Ryandhimas E. Zezario, Szu-Wei Fu, Fei Chen, and 3 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023

2022

  1. Interspeech
    mtinet.png
    MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
    Ryandhimas E. Zezario, Szu-wei Fu, Fei Chen, and 3 more authors
    In Interspeech 2022, 2022
  2. Interspeech
    mbinet.png
    MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
    Ryandhimas E. Zezario, Fei Chen, Chiou-Shann Fuh, and 2 more authors
    In Interspeech 2022, 2022

2021

  1. EUSIPCO
    zmos.png
    Speech Enhancement with Zero-Shot Model Selection
    Ryandhimas E. Zezario, Chiou-Shann Fuh, Hsin-Min Wang, and 1 more author
    In 2021 29th European Signal Processing Conference (EUSIPCO), 2021

2020

  1. APSIPA
    stoi_net.png
    STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model
    Ryandhimas E. Zezario, Szu-Wei Fu, Chiou-Shann Fuh, and 2 more authors
    In 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2020
  2. IEEE TASLP
    daeme.png
    Speech Enhancement Based on Denoising Autoencoder With Multi-Branched Encoders
    Cheng *Yu, Ryandhimas E. *Zezario, Syu-Siang Wang, and 5 more authors
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
  3. IEEE ICASSP
    daeld.png
    Self-Supervised Denoising Autoencoder with Linear Regression Decoder for Speech Enhancement
    Ryandhimas E. Zezario, Tassadaq Hussain, Xugang Lu, and 2 more authors
    In ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020

2019

  1. ISPACS
    helm.png
    Comparative Study of Masking and Mapping Based on Hierarchical Extreme Learning Machine for Speech Enhancement
    Ryandhimas E. Zezario, Join W. C. Sigalingging, Tassadaq Hussain, and 2 more authors
    In 2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), 2019
  2. Interspeech
    ssems.png
    Specialized Speech Enhancement Model Selection Based on Learned Non-Intrusive Quality Assessment Metric
    Ryandhimas E. Zezario, Szu-Wei Fu, Xugang Lu, and 2 more authors
    In Interspeech 2019, 2019

2018

  1. APSIPA
    dae_postfilter.png
    Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
    Ryandhimas E. Zezario, Jen-Wei Huang, Xugang Lu, and 3 more authors
    In 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2018

2016

  1. ISCSLP
    enn_asr.png
    Incorporating local environment information with ensemble neural networks to robust automatic speech recognition
    Chia-Yung Hsu, Ryandhimas E. Zezario, Jia-Ching Wang, and 3 more authors
    In 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2016