I am an assistant professor at Singapore Institute of Technology until July 2025 and will join Duke KunShan University as a tenure-track assistant professor in August 2025. Prior to that, from 2021 to 2023, I was a postdoctoral researcher at the National Institute of Informatics (NII), Japan, supervised by Prof Junichi Yamagishi. I received my Ph.D. degree from the Institute of Acoustics, Chinese Academy of Sciences/University of Chinese Academy of Sciences, in 2021, supervised by Prof Pengyuan Zhang. During 2018 and 2019, I was at the University of Kent, UK, as a Visiting Student, supervised by Prof Ian McLoughlin.
My research interest includes speech security, speaker and language recognition, and machine learning. I am a co-organizer of the VoicePrivacy challenge 2022, 2024 and Attacker Challenge at ICASSP 2025. My work has been published at the top international AI conferences and journals such as IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), IEEE Transactions on Information Forensics and Security (TIFS), Neural Networks, Computer Speech and Language (CSL), ICASSP, INTERSPEECH.
Recruiting
Teaching
- AAI3001, Deep Learning and Computer Vision, SIT undergraduate course
- INF2008, Machine Learning, SIT undergraduate course
Projects as PI
- Biometrics Security
- 2022-2023 JSPS KAKENHI Grant-in-Aid for Research Start-up (22K21319): Language-independent speaker anonymization with multiple privacy-related attributes, Budget: 2.2M JPY.
- 2024-2026 MOE AcRF Tier 1 (R-R13-A405-0005): User-centric Personalized Voice Protection, Budget: 150K SGD.
- Large Language Model
- 2024-2025 MOE Ignition Grant (R-IE3-A405-0005): Generative AI Aided Safety Investigation System, Budget: 180K SGD (Including 30K cash contribution from Singapore SMRT).
Patent and License as the First Author
- Japanese patent: Deep-learning based speaker anonymization method.
- Japanese program commercial license granted by Japan Broadcasting Corporation (NHK): VocalGuard, a speaker anonymization program.
Academic Activities
- Organizer: Attacker Challenge at ICASSP 2025, VoicePrivacy Challenge 2024 and 2022.
- Session chair: ICASSP 2024, INTERSPEECH 2023, ISCSLP 2022.
- Reviewer: IEEE TASLP, IEEE TPAMI, IEEE SPL, CSL, ICASSP, INTERSPEECH, etc.
Publication
2025
- The First VoicePrivacy Attacker Challenge, Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, and Junichi Yamagishi, ICASSP, 2025.
- A benchmark for multi-speaker anonymization, Xiaoxiao Miao, Ruijie Tao, Chang Zeng, Xin Wang, IEEE TIFS, 2025.
- Adapting General Disentanglement-Based Speaker Anonymization for Enhanced Emotion Preservation, Xiaoxiao Miao*, Yuxiang Zhang*, Xin Wang, Natalia Tomashenko, Donny Cheng Lock Soh, Ian Mcloughlin, accepted by CSL.
2024
- Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches, Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, SLT, 2024.
- Instructsing: High-Fidelity Singing Voice Generation Via Instructing Yourself, Chang Zeng, Chunhui Wang, Xiaoxiao Miao, Jian Zhao, Zhonglin Jiang, Yong Chen, SLT, 2024.
- The First VoicePrivacy Attacker Challenge Evaluation Plan, Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, Junichi Yamagishi, arXiv, 2024.
- The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation, Michele Panariello, Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas Evans, Emmanuel Vincent, Junichi Yamagishi, TASLP, 2024.
- The VoicePrivacy 2024 Challenge Evaluation Plan, Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco, arXiv preprint, 2024.
- Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances, Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, CSL, 2024.
- SynVox2: Towards a privacy-friendly VoxCeleb2 dataset, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, ICASSP, 2024.
- Libri2Vox Dataset: Target Speaker Extraction with Diverse Speaker Conditions and Synthetic Data, Yun Liu, Xuechen Liu, Xiaoxiao Miao, Junichi Yamagishi, Under Review, 2024.
2023
- VoicePAT: An Efficient Open-source Evaluation Toolkit for Voice Privacy Research, Sarina Meyer*, Xiaoxiao Miao*, Ngoc Thang Vu, IEEE Open Journal of Signal Processing, 2023.
- Speaker Anonymization using Orthogonal Householder Neural Network, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko, TASLP, 2023.
- Speaker-Text Retrieval via Contrastive Learning, Xuechen Liu, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi, arXiv preprint, 2023.
- Hiding speaker’s sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline, Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf, ICASSP, 2023.
2022
- Attention back-end for automatic speaker verification with multiple enrollment utterances, Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi, ICASSP, 2022.
- Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, INTERSPEECH, 2022.
- Language-independent speaker anonymization approach using self-supervised pre-trained models, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko, Odyssey, 2022.
- GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system, Runqiu Xiao, Zhuo Li, Xiaoxiao Miao, Wenchao Wang, Pengyuan Zhang, Electronics Letters, 2022.
2021
- D-MONA: A dilated mixed-order non-local attention network for speaker and language recognition, Xiaoxiao Miao, Ian McLoughlin, Wenchao Wang, Pengyuan Zhang, Neural Networks, 2021.
- Variance Normalised Features for Language and Dialect Discrimination, Xiaoxiao Miao, Ian McLoughlin, Yan Song, Circuits, Systems, and Signal Processing, 2021.
- Adaptive Margin Circle Loss for Speaker Verification, Runqiu Xiao, Xiaoxiao Miao, Wenchao Wang, Pengyuan Zhang, Bin Cai, Liuping Luo, INTERSPEECH, 2021.
2020-2018
- A New Time–Frequency Attention Tensor Network for Language Identification, Xiaoxiao Miao, Ian McLoughlin, Yanyong Hong, Circuits, Systems, and Signal Processing, 2020.
- Lstm-tdnn with convolutional front-end for dialect identification in the 2019 multi-genre broadcast challenge, Xiaoxiao Miao, Ian McLoughlin, arXiv, 2019.
- A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification, Xiaoxiao Miao, Ian McLoughlin, Yonghong Yan, INTERSPEECH, 2019.
- Improved conditional generative adversarial net classification for spoken language recognition, Xiaoxiao Miao, Ian McLoughlin, Shengyu Yao, Yonghong Yan, SLT, 2018.
Code
- Language-Independent Speaker Anonymization
- Multi-Speaker Anonymization
- Voice Anonymization Toolkit
- VoicePrivacy Challenge 2024
Award
- Outstanding reviewer award in INTERSPEECH, 2023
- Best paper nomination award in Odyssey, 2022
- The 2th place in MGB-5 challenge - Fine-grained Arabic Dialect Identification, ASRU, 2019
- Merit Student of University Chinese Academy of Science, 2018
- Best paper nomination award in NCMMSC, 2017