publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. Tech Report
    Overview of the Amphion Toolkit (v0. 2)
    Jiaqi Li, Xueyao Zhang, Yuancheng Wang, Haorui He, Chaoren Wang, Li Wang, Huan Liao, Junyi Ao, Zeyu Xie, Yiqiao Huang, and  others
    arXiv preprint arXiv:2501.15442, 2025
    TL;DR: This is the technical report for the second version of the Amphion toolkit.

2024

  1. SLT 2024
    Investigating neural audio codecs for speech language model-based speech generation
    Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, and  others
    In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024
  2. SLT 2024
    Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
    Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, and Zhizheng Wu
    In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024
    TL;DR: We collect a 100k hours in-the-wild speech dataset for speech generation.
  3. SLT 2024
    Amphion: an Open-Source Audio, Music, and Speech Generation Toolkit
    Xueyao Zhang*, Liumeng Xue*, Yicheng Gu*, Yuancheng Wang*Jiaqi Li, Haorui He, Chaoren Wang, Songting Liu, Xi Chen, Junan Zhang, Tze Ying Tang, Lexiao Zou, Mingxuan Wang, Jun Han, Kai Chen, Haizhou Li, and Zhizheng Wu
    In 2024 IEEE Spoken Language Technology Workshop (SLT), 2024
    TL;DR: We develop a unified toolkit for audio, music, and speech generation.
  4. ICASSP 2024
    An initial investigation of neural replay simulator for over-the-air adversarial perturbations to automatic speaker verification
    Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, and Zhizheng Wu
    In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024
  5. ICASSP 2024
    Advsv: An over-the-air adversarial attack dataset for speaker verification
    Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, and Zhizheng Wu
    In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024
  6. Debatts: Zero-shot debating text-to-speech synthesis
    Yiqiao Huang, Yuancheng Wang, Jiaqi Li, Haotian Guo, Haorui He, Shunsi Zhang, and Zhizheng Wu
    arXiv preprint arXiv:2411.06540, 2024

2023

  1. ROME: Testing image captioning systems via recursive object melting
    Boxi Yu, Zhiqing Zhong, Jiaqi Li, Yixing Yang, Shilin He, and Pinjia He
    In Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023