I am an assistant professor at Duke KunShan University. Prior to that, I was an assistant professor at Singapore Institute of Technology from October 2023 to July 2025 and a postdoctoral researcher at the National Institute of Informatics (NII), Japan, from 2021 to 2023, supervised by Prof Junichi Yamagishi. I received my Ph.D. degree from the Institute of Acoustics, Chinese Academy of Sciences/University of Chinese Academy of Sciences, in 2021, supervised by Prof Pengyuan Zhang. During 2018 and 2019, I was at the University of Kent, UK, as a Visiting Student, supervised by Prof Ian McLoughlin.
My research interest includes speech security, speaker and language recognition, and machine learning. I am a co-organizer of the VoicePrivacy challenge 2022, 2024 and Attacker Challenge at ICASSP 2025. My work has been published at the top international AI conferences and journals such as IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP), IEEE Transactions on Information Forensics and Security (TIFS), Neural Networks, Computer Speech and Language (CSL), ICASSP, INTERSPEECH.
Teaching
- AAI3001, Deep Learning and Computer Vision, SIT undergraduate course
- INF2008, Machine Learning, SIT undergraduate course
Projects as PI
- Biometrics Security
- 2022-2023 JSPS KAKENHI Grant-in-Aid for Research Start-up (22K21319): Language-independent speaker anonymization with multiple privacy-related attributes, Budget: 2.2M JPY.
- 2024-2026 MOE AcRF Tier 1 (R-R13-A405-0005): User-centric Personalized Voice Protection, Budget: 150K SGD.
- Large Language Model
- 2024-2025 MOE Ignition Grant (R-IE3-A405-0005): Generative AI Aided Safety Investigation System, Budget: 180K SGD (Including 30K cash contribution from Singapore SMRT).
Patent and License as the First Author
- Japanese patent: Deep-learning based speaker anonymization method.
- Japanese program commercial license granted by Japan Broadcasting Corporation (NHK): VocalGuard, a speaker anonymization program.
Academic Activities
- Organizer: Attacker Challenge at ICASSP 2025, VoicePrivacy Challenge 2024 and 2022.
- Session chair: ICASSP 2024, INTERSPEECH 2023, ISCSLP 2022.
- Reviewer: IEEE TASLP, IEEE TPAMI, IEEE SPL, CSL, ICASSP, INTERSPEECH, etc.
Publication
2025
- SegReConcat: A Data Augmentation Method for Voice Anonymization Attack, Ridwan Arefeen, Xiaoxiao Miao, Tong Rong, Aik Beng Ng, Simon See, APCIPA ASC, 2025.
- Exploring Machine Learning and Language Models for Multimodal Depression Detection, Javier Si Zhao Hong, Timothy Zoe Delaya, Sherwyn Chan Yin Kit, Pai Chet Ng, Xiaoxiao Miao, APCIPA ASC, 2025.
- Speech Emotion Recognition via Entropy-Aware Score Selection, ChenYi Chua, JunKai Wong, Chengxin Chen, Xiaoxiao Miao, APCIPA ASC, 2025.
- SEF-MK: Speaker-Embedding-Free Voice Anonymization through Multi-k-means Quantization, Beilong Tang, Xiaoxiao Miao, Xin Wang, Ming Li, IEEE ASRU, 2025.
- The Risks and Detection of Overestimated Privacy Protection in Voice Anonymisation, Michele Panariello, Sarina Meyer, Pierre Champion, Xiaoxiao Miao, Massimiliano Todisco, Ngoc Thang Vu, Nicholas Evans, 5th Symposium on Security and Privacy in Speech Communication, 2025.
- Localizing Audio-Visual Deepfakes via Hierarchical Boundary Modeling, Xuanjun Chen, Shih-Peng Cheng, Jiawei Du, Lin Zhang, Xiaoxiao Miao, Chung-Che Wang, Haibin Wu, Hung-yi Lee, Jyh-Shing Roger Jang, under review, 2025.
- SecureSpeech: Prompt-based Speaker and Content Protection, Belinda Soh Hui Hui, Xiaoxiao Miao, Xin Wang, IEEE IJCB, 2025.
- Mitigating Language Mismatch in SSL-Based Speaker Anonymization, Zhe Zhang, Wen-Chin Huang, Xin Wang, Xiaoxiao Miao, Junichi Yamagishi, INTERSPEECH, 2025.
- Automated evaluation of children’s speech fluency for low-resource languages, Bowen Zhang, Nur Afiqah Abdul Latiff, Justin Kan, Rong Tong, Donny Soh, Xiaoxiao Miao, Ian McLoughlin, INTERSPEECH, 2025
- LSPnet: an ultra-low bitrate hybrid neural codec, Bowen Zhang, Ian McLoughlin, Xiaoxiao Miao, AS Madhukumar, INTERSPEECH, 2025
- The First VoicePrivacy Attacker Challenge, Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, and Junichi Yamagishi, ICASSP, 2025.
- A benchmark for multi-speaker anonymization, Xiaoxiao Miao, Ruijie Tao, Chang Zeng, Xin Wang, IEEE TIFS, 2025.
- Adapting General Disentanglement-Based Speaker Anonymization for Enhanced Emotion Preservation, Xiaoxiao Miao*, Yuxiang Zhang*, Xin Wang, Natalia Tomashenko, Donny Cheng Lock Soh, Ian Mcloughlin, CSL, 2025.
2024
- Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches, Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, SLT, 2024.
- Instructsing: High-Fidelity Singing Voice Generation Via Instructing Yourself, Chang Zeng, Chunhui Wang, Xiaoxiao Miao, Jian Zhao, Zhonglin Jiang, Yong Chen, SLT, 2024.
- The First VoicePrivacy Attacker Challenge Evaluation Plan, Natalia Tomashenko, Xiaoxiao Miao, Emmanuel Vincent, Junichi Yamagishi, arXiv, 2024.
- The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation, Michele Panariello, Natalia Tomashenko, Xin Wang, Xiaoxiao Miao, Pierre Champion, Hubert Nourtel, Massimiliano Todisco, Nicholas Evans, Emmanuel Vincent, Junichi Yamagishi, TASLP, 2024.
- The VoicePrivacy 2024 Challenge Evaluation Plan, Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco, arXiv preprint, 2024.
- Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances, Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, CSL, 2024.
- SynVox2: Towards a privacy-friendly VoxCeleb2 dataset, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, ICASSP, 2024.
- Libri2Vox Dataset: Target Speaker Extraction with Diverse Speaker Conditions and Synthetic Data, Yun Liu, Xuechen Liu, Xiaoxiao Miao, Junichi Yamagishi, Under Review, 2024.
2023
- VoicePAT: An Efficient Open-source Evaluation Toolkit for Voice Privacy Research, Sarina Meyer*, Xiaoxiao Miao*, Ngoc Thang Vu, IEEE Open Journal of Signal Processing, 2023.
- Speaker Anonymization using Orthogonal Householder Neural Network, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko, TASLP, 2023.
- Speaker-Text Retrieval via Contrastive Learning, Xuechen Liu, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi, arXiv preprint, 2023.
- Hiding speaker’s sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline, Paul-Gauthier Noé, Xiaoxiao Miao, Xin Wang, Junichi Yamagishi, Jean-François Bonastre, Driss Matrouf, ICASSP, 2023.
2022
- Attention back-end for automatic speaker verification with multiple enrollment utterances, Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi, ICASSP, 2022.
- Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, INTERSPEECH, 2022.
- Language-independent speaker anonymization approach using self-supervised pre-trained models, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia Tomashenko, Odyssey, 2022.
- GuidedMix: An on‐the‐fly data augmentation approach for robust speaker recognition system, Runqiu Xiao, Zhuo Li, Xiaoxiao Miao, Wenchao Wang, Pengyuan Zhang, Electronics Letters, 2022.
2021
- D-MONA: A dilated mixed-order non-local attention network for speaker and language recognition, Xiaoxiao Miao, Ian McLoughlin, Wenchao Wang, Pengyuan Zhang, Neural Networks, 2021.
- Variance Normalised Features for Language and Dialect Discrimination, Xiaoxiao Miao, Ian McLoughlin, Yan Song, Circuits, Systems, and Signal Processing, 2021.
- Adaptive Margin Circle Loss for Speaker Verification, Runqiu Xiao, Xiaoxiao Miao, Wenchao Wang, Pengyuan Zhang, Bin Cai, Liuping Luo, INTERSPEECH, 2021.
2020-2018
- A New Time–Frequency Attention Tensor Network for Language Identification, Xiaoxiao Miao, Ian McLoughlin, Yanyong Hong, Circuits, Systems, and Signal Processing, 2020.
- Lstm-tdnn with convolutional front-end for dialect identification in the 2019 multi-genre broadcast challenge, Xiaoxiao Miao, Ian McLoughlin, arXiv, 2019.
- A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification, Xiaoxiao Miao, Ian McLoughlin, Yonghong Yan, INTERSPEECH, 2019.
- Improved conditional generative adversarial net classification for spoken language recognition, Xiaoxiao Miao, Ian McLoughlin, Shengyu Yao, Yonghong Yan, SLT, 2018.
Code
- Language-Independent Speaker Anonymization
- Multi-Speaker Anonymization
- Voice Anonymization Toolkit
- VoicePrivacy Challenge 2024
Award
- Outstanding reviewer award in INTERSPEECH, 2023
- Best paper nomination award in Odyssey, 2022
- The 2th place in MGB-5 challenge - Fine-grained Arabic Dialect Identification, ASRU, 2019
- Merit Student of University Chinese Academy of Science, 2018
- Best paper nomination award in NCMMSC, 2017