Selected Publications
Dissertations:
Tutorial Books:
- S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw,
X. Liu, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev and P. C.
Woodland. The HTK Book Version 3.4.1, 2009.
Book Chapters:
- M. Tomalin, W. Byrne, A. de Gispert, M. Gales, X. Liu,
P. Woodland et al. ``Improving Speech Transcription for
Machine Translation'', in J. Olive, C. Christianson, and J. McCary,
editors, Handbook of natural language processing and machine
translation. DARPA Global Autonomous Language Exploitation.
Springer, ISBN 978-1-4419-7712-0, 2011.
- S. Matsoukas, T. Ng , L. Nguyen , F. Diehl, M. J. F. Gales, X. Liu,
P. C. Woodland, J-L Gauvain, L. Lamel, A. Messaoudi et al.
``Optimizing Speech-To-Text System Combination for Machine
Translation'', in J. Olive, C. Christianson, and J. McCary,
editors, Handbook of natural language processing and machine
translation. DARPA Global Autonomous Language Exploitation.
Springer, ISBN 978-1-4419-7712-0, 2011.
Journal Papers:
- Shujie Hu, Xurong Xie, Mengzhe Geng, Zengrui Jin, Jiajun Deng, Guinan Li, Yi Wang, Mingyu Cui, Tianzi Wang, Helen Meng, Xunying Liu. Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 32, Pages 3561-3575, 2024. [DOI]
- Zengrui Jin, Mengzhe Geng, Jiajun Deng, Tianzi Wang, Shujie Hu, Guinan Li, Xunying Liu. Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 32, Pages 413-429, 2024. [DOI]
- Guinan Li, Jiajun Deng, Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Mingyu Cui, Helen Meng, Xunying Liu. Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 31, Pages 2707-2723, 2023. [DOI]
- Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Guinan Li, Shujie Hu, Xunying Liu. Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 31, Pages 1175-1190, 2023. [DOI]
- Xixin Wu, Hui Lu, Kun Li, Zhiyong Wu, Xunying Liu, Helen Meng. Hiformer: Sequence Modeling Networks with Hierarchical Attention Mechanisms, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 31, Pages 3993-4003, 2023. [DOI]
- Cai Wingfield, Chao Zhang, Barry J. Devereux, Elisabeth Fonteneau, Andrew Thwaites, Xunying Liu, Phil Woodland, William D. Marslen-Wilson, Li Su.
On the similarities of representations in artificial and brain neural networks for speech recognition,
Frontiers in Computational Neuroscience, Volume 16, December, 2022. [DOI]
- Boyang Xue, Shoukang Hu, Junhao Xu, Mengzhe Geng, Xunying Liu, Helen Meng. Bayesian Neural Network Language Modeling for Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 30, Pages 2900-2917, 2022. [DOI]
- Mengzhe Geng, Xurong Xie, Zi Ye, Tianzi Wang, Guinan Li, Shujie Wu, Xunying Liu and Helen Meng. Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 30, Pages 2597-2611, 2022. [DOI]
- Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu and Helen Meng. Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 30, Pages 1093-1107, 2022. [DOI]
- Junhao Xu, Jianwei Yu, Shoukang Hu, Xunying Liu and Helen Meng. Mixed Precision Low-bit Quantisation of Neural Network Language Models for Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 3679-3693, 2021. [DOI]
- Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Zi Ye, Mengzhe Geng, Xunying Liu and Helen Meng. Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 1514-1529, 2021. [DOI]
- Jianwei Yu, Shi-Xiong Zhang, Bo Wu, Shansong Liu, Shoukang Hu, Mengzhe Geng, Xunying Liu, Helen Meng and Dong Yu. Audio-visual Multi-channel Integration and Recognition of Overlapped Speech, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 2067-2082, 2021. [DOI]
- Xurong Xie, Xunying Liu, Tan Lee and Lan Wang. Bayesian Learning for Deep Neural Network Adaptation, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 2096-2110, 2021. [DOI]
- Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu and Helen Meng. Recent Progress in the CUHK Dysarthric Speech Recognition System, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 2267-2281, 2021. [DOI]
- Songxiang Liu, Yuewen Cao, Disong Wang, Xixin Wu, Xunying Liu and Helen Meng. Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling, to appear in IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 1717-1728, 2021. [DOI]
- Xixin Wu, Yuewen Cao, Hui Lu, Songxiang Liu, Shiyin Kang, Zhiyong Wu, Xunying Liu and Helen Meng. Exemplar-based Emotive Speech Synthesis, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 874-886, 2021. [DOI]
- Xixin Wu, Yuewen Cao, Hui Lu, Songxiang Liu, Disong Wang, Zhiyong Wu, Xunying Liu and Helen Meng. Speech Emotion Recognition Using Sequential Capsule Networks, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 29, Pages 3280-3291, 2021. [DOI]
- Rongfeng Su, Xunying Liu, Lan Wang and Jingzhou Yang. Cross-Domain Deep Visual Feature Generation for Mandarin Audio-Visual Speech Recognition, IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 28, Issue 1, December 2020, Pages 185-197. [DOI]
- Xie Chen, Xunying Liu, Yu Wang, Anton Ragni, Jeremy Wong and Mark. J. F. Gales. Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition,
IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 27, Issue 9, September 2019, Pages 1444-1454. [DOI]
- Cai Wingfiled, Li Su, Xunying Liu, Cao Zhang, Philip C. Woodland, Andrew Thwaites, Elizabeth Fonteneau and William D Marslen-Wilson.
Relating Dynamic Brain States to Dynamic Machine States: Human and Machine Solutions to the Speech Recognition Problem,
September 2017,PLoS Computational Biology 13(9):e1005617. [web]
- Xunying Liu, Xie Chen, Yongqiang Wang, Mark J. F. Gales and Philip C. Woodland.
Two Efficient Lattice Rescoring Methods Using Recurrent Neural
Network Language Models,
IEEE/ACM Transactions on Audio, Speech and
Language Processing, Volume 24, Issue 8, August
2016, Pages 1438-1449.
[pdf|web]
- Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales and Philip C. Woodland.
Efficient Training and Evaluation of Recurrent Neural Network
Language Models for Automatic Speech Recognition,
IEEE/ACM Transactions on Audio, Speech and
Language Processing, Volume 24, Issue 11, November
2016, Pages 2146-2157.
[pdf|web]
- Rongfeng Su, Xunying Liu and Lan Wang. Automatic Complexity Control
of Generalized Variable Parameters HMMs for Noise Robust
Speech Recognition, IEEE/ACM Transactions on Audio,
Speech and Language Processing, Volume 23, Issue 1, November
2014, Pages 102-114.
[pdf|web]
- Xunying Liu, Mark J. F. Gales and Philip C. Woodland.
Paraphrastic Language Models.
Computer Speech and Language, Volume 28, Issue 6, November 2014, Pages 1298-1316.
[pdf|web]
- Xunying Liu, Mark J. F. Gales and Philip C. Woodland.
Use of Contexts in Language Model Interpolation and
Adaptation. Computer Speech and Language, Volume 27, Issue 1,
January 2013. [pdf|web]
- Xunying Liu, James L. Hieronymus, Mark J. F. Gales and Philip C. Woodland.
Syllable Language Models for Mandarin Speech
Recognition: Exploiting Character Sequence Models,
Journal of the Acoustical Society of America, Volume 133,
Issue 1, 519-528, January 2013.
[pdf|web]
- Xunying Liu, Mark J. F. Gales and Philip C. Woodland.
Language Model Cross Adaptation For LVCSR System Combination.
Computer Speech and Language, Volume 27, Issue 4,
June 2013.
[pdf|web]
- Ning Cheng, Xunying Liu and Lan Wang.
A Flexible Framework for
HMM Based Noise Robust Speech Recognition Using Generalized
Parametric Space Polynomial Regression, SCIENCE CHINA
Information Sciences, Volume 54, Number 12, 2481-2491, 2011.
- Xunying Liu and Mark J. F. Gales. Automatic
Model Complexity
Control Using Marginalized Discriminative Growth
Functions. IEEE Transactions on Audio, Speech and Language
Processing, Vol. 4, May 2007.
- Thomas Hain, Philip C. Woodland, Gunnar Evermann, Mark J. F. Gales, Xunying Liu,
Gareth L. Moore, Daniel Povey and Lan Wang.
Automatic Transcription of Conversational Telephone Speech, IEEE Transactions on
Speech and Audio Processing, Vol. 13, Nov. 2005.
Conference Papers:
2024
- Jiajun Deng, Xurong Xie, Guinan Li, Mingyu Cui, Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Zhaoqing Li, Xunying Liu. TOWARDS HIGH-PERFORMANCE AND LOW-LATENCY FEATURE-BASED SPEAKER ADAPTATION OF CONFORMER SPEECH RECOGNITION SYSTEMS, IEEE ICASSP2024, Seoul, Korea.
- Zengrui Jin, Xurong Xie, Tianzi Wang, Mengzhe Geng, Jiajun Deng, Guinan Li, Shujie Hu, Xunying Liu. TOWARDS AUTOMATIC DATA AUGMENTATION FOR DISORDERED SPEECH RECOGNITION, IEEE ICASSP2024, Seoul, Korea.
- Huimeng Wang, Zengrui Jin, Mengzhe Geng, Shujie Hu, Guinan Li, Tianzi Wang, Haoning Xu, Xunying Liu. ENHANCING PRE-TRAINED ASR SYSTEM FINE-TUNING FOR DYSARTHRIC SPEECH RECOGNITION USING ADVERSARIAL DATA AUGMENTATION, IEEE ICASSP2024, Seoul, Korea.
- Xueyuan Chen, Yuejiao Wang, Xixin Wu, Disong Wang, Zhiyong Wu, Xunying Liu, Helen Meng. EXPLOITING AUDIO-VISUALFEATURES WITH PRETRAINED AV-HUERT FOR MULTI-MODAL DYSARTHRIC SPEECH RECONSTRUCTION, IEEE ICASSP2024, Seoul, Korea.
- Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng. Cross-speaker encoding network for multi-talker speech recognition, IEEE ICASSP2024, Seoul, Korea.
- Guinan Li, Jiajun Deng, Youjun Chen, Mengzhe Geng, Shujie Hu, Zhe Li, Zengrui Jin, Tianzi Wang, Xurong Xie, Helen Meng, Xunying Liu. Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition, ISCA Interspeech2024, Kos, Greece.
- Tianzi Wang, Xurong Xie, Zhaoqing Li, Shoukang Hu, Zengrui Jing, Jiajun Deng, Mingyu Cui, Shujie Hu, Mengzhe Geng, Guinan Li, Helen Meng, Xunying Liu. Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask, ISCA Interspeech2024, Kos, Greece.
- Zhaoqing Li, Haoning Xu, Tianzi Wang, Shoukang Hu, Zengrui Jin, Shujie Hu, Jiajun Deng, Mingyu Cui, Mengzhe Geng, Xunying Liu. One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model, ISCA Interspeech2024, Kos, Greece.
- Yicong Jiang, Tianzi Wang, Xurong Xie, Juan Liu, Wei Sun, Nan Yan, Hui Chen, Lan Wang, Xunying Liu, Feng Tian. Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition, ISCA Interspeech2024, Kos, Greece.
- Lingwei Meng, Jiawen Kang, Yuejiao Wang, Zengrui Jin, Xixin Wu, Xunying Liu, Helen Meng. Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System, ISCA Interspeech2024, Kos, Greece.
- Jiajun Deng, Yaolong Ju, Jing Yang, Simon Lui, Xunying Liu. EFFICIENT ADAPTER TUNING FOR JOINT SINGING VOICE BEAT AND DOWNBEAT TRACKING WITH SELF-SUPERVISED LEARNING FEATURES, ISMIR2024, San Francisco, CA, USA.
- Shujie HU, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei. WavLLM: Towards Robust and Adaptive Speech Large Language Model, EMNLP2024, Miami, Florida, USA.
- Yicong Jiang, Youjun Chen, Tianzi Wang, Zengrui Jin, Xurong Xie, Hui Chen, Xunying Liu, Feng Tian. Investigation of Cross Modality Feature Fusion for Audio-Visual Dysarthric Speech Assessment, ISCSLP2024, Beijing, China.
- Dongrui Han, Mingyu Cui, Jiawen Kang, Xixin Wu, Xunying Liu, Helen Meng. Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models, ISCSLP2024, Beijing, China.
2023
- Xurong Xie, Xunying Liu, Hui Chen, Hongan Wang. Unsupervised model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition, IEEE ICASSP2023, Rhodes Island, Greece.
- Zengrui Jin, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shujie Hu, Jiajun Deng, Guinan Li, Xunying Liu. Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition, IEEE ICASSP2023, Rhodes Island, Greece.
- Shujie Hu, Xurong Xie, Zengrui Jin, Mengzhe Geng, Yi Wang, Mingyu Cui, Jiajun Deng, Xunying Liu, Helen Meng. Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition, IEEE ICASSP2023, Rhodes Island, Greece.
- Yi Wang, Jiajun Deng, Tianzi Wang, Bo Zheng, Shoukang Hu, Xunying Liu, Helen Meng. Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection, IEEE ICASSP2023, Rhodes Island, Greece.
- Jinchao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng. Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection, IEEE ICASSP2023, Rhodes Island, Greece.
- Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng. A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition, IEEE ICASSP2023, Rhodes Island, Greece.
- Mengzhe Geng, Xurong Xie, Rongfeng Su, Jianwei Yu, Zengrui Jin, Tianzi Wang, Shujie Hu, Zi Ye, Helen Meng, Xunying Liu. On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition, ISCA Interspeech2023, Dublin, Ireland.
- Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Jiajun Deng, Mingyu Cui, Guinan Li, Jianwei Yu, Xurong Xie, Xunying Liu. Use of Speech Impairment Severity for Dysarthric Speech Recognition, ISCA Interspeech2023, Dublin, Ireland.
- Jiajun Deng, Guinan Li, Xurong Xie, Zengrui Jin, Mingyu Cui, Tianzi Wang, Shujie Hu, Mengzhe Geng, Xunying Liu. Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems, ISCA Interspeech2023, Dublin, Ireland.
- Shujie Hu, Xurong Xie, Mengzhe Geng, Mingyu Cui, Jiajun Deng, Guinan Li, Tianzi Wang, Helen Meng, Xunying Liu. Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition, ISCA Interspeech2023, Dublin, Ireland.
- Mingyu Cui, Jiawen Kang, Jiajun Deng, Xi Yin, Yutao Xie, Xie Chen, Xunying Liu. Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems, ISCA Interspeech2023, Dublin, Ireland.
- Tianzi Wang, Shoukang Hu, Jiajun Deng, Zengrui Jin, Mengzhe Geng, Yi Wang, Helen Meng, Xunying Liu. Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition, ISCA Interspeech2023, Dublin, Ireland.
- Zhaoqing Li, Tianzi Wang, Jiajun Deng, Junhao Xu, Shoukang Hu, Xunying Liu. Lossless 4-bit Quantization of Architecture Compressed Conformer ASR Systems on the 300-hr Switchboard Corpus, ISCA Interspeech2023, Dublin, Ireland.
- Helen Meng, Brian Mak, Man-Wai Mak, Helene Fung, Xianmin Gong, Timothy Kwok, Xunying Liu, Vincent C. T. Mok, Patrick Wong, Jean Woo, Xixin Wu, Ka Ho Wong, Shensheng Xu, Naijun Zheng, Ranzo Huang, Jiawen Kang, Xiaoquan Ke, Junan Li, Jinchao Li, Yi Wang. Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders, ISCA Interspeech2023, Dublin, Ireland.
2022
- Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng. AUDIO-VISUAL MULTI-CHANNEL SPEECH SEPARATION, DEREVERBERATION AND RECOGNITION, IEEE ICASSP2022, Singapore.
- Junhao, Jianwei Yu, Xunying Liu, Helen Meng. MIXED PRECISION DNN QUANTIZATION FOR OVERLAPPEDSPEECH SEPARATION AND RECOGNITION, IEEE ICASSP2022, Singapore.
- Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng. EXPLOITING CROSS DOMAIN ACOUSTIC-TO-ARTICULATORY INVERTED FEATURES FOR DISORDERED SPEECH RECOGNITION, IEEE ICASSP2022, Singapore.
- Xixin Wu, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng. Neural Architecture Search for Speech Emotion Recognition, IEEE ICASSP2022, Singapore.
- Naijun Zheng, Na Li, Jianwei Yu, Chao Weng, Dan Su, XunYing Liu, Helen Meng. MULTI-CHANNEL SPEAKER DIARIZATION USING SPATIAL FEATURES FOR MEETINGS, IEEE ICASSP2022, Singapore.
- Disong Wang, Songxiang Liu, Xixin Wu, Hui Lu, Lifa Sun, Xunying Liu, Helen Meng. SPEAKER IDENTITY PRESERVATION IN DYSARTHRIC SPEECH RECONSTRUCTION BY ADVERSARIAL SPEAKER ADAPTATION, IEEE ICASSP2022, Singapore.
- Disong Wang, Shan Yang, Dan Su, Xunying Liu, Dong Yu, Helen Meng. VCVTS: MULTI-SPEAKER VIDEO-TO-SPEECH SYNTHESIS VIA CROSS-MODAL KNOWLEDGE TRANSFER FROM VOICE CONVERSION, IEEE ICASSP2022, Singapore.
- Hang Su, Danyang Zhao, Long Dang, Minglei Li, Xixin Wu, Xunying Liu, Helen Meng. A MULTITASK LEARNING FRAMEWORK FOR SPEAKER CHANGE DETECTION WITH CONTENT INFORMATION FROM UNSUPERVISED SPEECH DECOMPOSITION, IEEE ICASSP2022, Singapore.
- Junhao Xu, Shoukang Hu, Xunying Liu and Helen Meng. Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Swithboard Corpus, ISCA Interspeech2022, Incheon, Korea.
- Jiajun Deng, Xurong Xie, Tianzi Wang, Mingyu Cui, Boyang Xue, Zengrui Jin, Mengzhe Geng, Guinan Li, Xunying Liu and Helen Meng. Confidence Score Based Conformer Speaker Adaptation for Speech Recognition, ISCA Interspeech2022, Incheon, Korea.
- Mingyu Cui, Jiajun Deng, Shoukang Hu, Xurong Xie, Tianzi Wang, Shujie HU, Mengzhe Geng, Boyang Xue, Xunying Liu and Helen Meng. Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems, ISCA Interspeech2022, Incheon, Korea.
- Tianzi Wang, Jiajun Deng, Mengzhe Geng, Zi Ye, Shoukang Hu, Yi Wang, Mingyu Cui, Zengrui Jin, Xunying Liu and Helen Meng. Conformer Based Elderly Speech Recognition System for Alzheimer’s Disease Detection, ISCA Interspeech2022, Incheon, Korea.
- Yi Wang, Tianzi Wang, Zi Ye, Lingwei Meng, Shoukang Hu, Xixin Wu, Xunying Liu and Helen Meng. Exploring linguistic feature and model combination for speech recognition based automatic AD detection, ISCA Interspeech2022, Incheon, Korea.
- Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu and Helen Meng. Context-aware Multimodal Fusion for Emotion Recognition, ISCA Interspeech2022, Incheon, Korea.
- Hui Lu, Disong Wang, Xixin Wu, Zhiyong Wu, Xunying Liu, Helen Meng. Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE, IEEE SLT2022, Doha, Qatar.
2021
- Sirui Xie, Shoukang Hu, Xinjiang Wang, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin. Understanding the wiring evolution in differentiable neural architecture search, AISTATS2021, San Diego, California, USA.
- Shoukang Hu, Xurong Xie, Shansong Liu, Mingyu Cui, Mengzhe Geng, Xunying Liu, Helen Meng.
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks,
IEEE ICASSP2021, Toronto, Canada.
- Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng.
DEVELOPMENT OF THE CUHK ELDERLY SPEECH RECOGNITION SYSTEM FOR NEUROCOGNITIVE DISORDER DETECTION USING THE DEMENTIABANK CORPUS,
IEEE ICASSP2021, Toronto, Canada.
- Junhao Xu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng.
MIXED PRECISION QUANTIZATION OF TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION,
IEEE ICASSP2021, Toronto, Canada.
- Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng.
BAYESIAN TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION,
IEEE ICASSP2021, Toronto, Canada.
- Jinchao Li, Jianwei Yu, Ye Zi, Simon Wong, Manwai Mak, Brian Mak, Xunying Liu, Helen Meng.
A COMPARATIVE STUDY OF ACOUSTIC AND LINGUISTIC FEATURES CLASSIFICATION FOR ALZHEIMER’S DISEASE DETECTION,
IEEE ICASSP2021, Toronto, Canada.
- Naijun Zheng, Na Li, Bo Wu, Meng Yu, JianWei Yu, Chao Weng, Dan Su, Xunying Liu, Helen Meng.
A JOINT TRAINING FRAMEWORK OF MULTI-LOOK SEPARATOR AND SPEAKER EMBEDDING EXTRACTOR FOR OVERLAPPED SPEECH,
IEEE ICASSP2021, Toronto, Canada.
- Disong Wang, Liqun Deng, Yang Zhang, Nianzu Zheng, Yu Ting Yeung, Xiao Chen, Xunying Liu, Helen Meng.
FCL-TACO2: TOWARDS FAST, CONTROLLABLE AND LIGHTWEIGHT TEXT-TO-SPEECH SYNTHESIS,
IEEE ICASSP2021, Toronto, Canada.
- Xu Li, Na Li, Chao Weng, Xunying Liu, Dan Su, Dong Yu, Helen Meng.
REPLAY AND SYNTHETIC SPEECH DETECTION WITH RES2NET ARCHITECTURE,
IEEE ICASSP2021, Toronto, Canada.
- Jiajun Deng, Fabian Ritter Gutierrez, Shoukang Hu, Mengzhe Geng, Xurong Xie, Zi Ye, Shansong Liu, Jianwei Yu, Xunying Liu, Helen Meng. Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition, ISCA Interspeech2021, Brno, Czech Republic.
- Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng. Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition, ISCA Interspeech2021, Brno, Czech Republic.
- Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng. Adversarial Data Augmentation for Disordered Speech Recognition, ISCA Interspeech2021, Brno, Czech Republic.
- Xurong Xie, Rukiye Ruzi, Xunying Liu, Lan Wang. Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition, ISCA Interspeech2021, Brno, Czech Republic.
- Disong Wang, Liqun Deng, Yu Ting Yeung, Xiao Chen, Xunying Liu and Helen Meng. Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion, ISCA Interspeech2021, Brno, Czech Republic.
- Disong Wang, Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu and Helen Meng. Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion, ISCA Interspeech2021, Brno, Czech Republic.
- Disong Wang, Liqun Deng, Yu Ting Yeung, Xiao Chen, Xunying Liu and Helen Meng. Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization, ISCA Interspeech2021, Brno, Czech Republic.
- Xu Li, Xixin Wu, Hui Lu, Xunying Liu and Helen Meng. Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks, ISCA Interspeech2021, Brno, Czech Republic.
- Hui Lu, Zhiyong Wu, Xixin Wu, Xu Li, Shiyin Kang, Xunying Liu and Helen Meng. VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis, ISCA Interspeech2021, Brno, Czech Republic.
- Disong Wang, Jianwei Yu, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng.
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization,
ISCSLP2021, Hong Kong, China.
- Yuewen Cao, Songxiang Liu, Shiyin Kang, Na Hu, Peng Liu, Xunying Liu, Dan Su, Dong Yu, Helen Meng.
Exploring Cross-lingual Singing Voice Synthesis Using Speech Data,
ISCSLP2021, Hong Kong, China.
2020
- Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin.
DSNAS: Direct Neural Architecture Search without Parameter Retraining,
IEEE/CVF CVPR2020, Seattle WA, USA.
- Junhao Xu, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng.
LOW-BIT QUANTIZATION OF RECURRENT NEURAL NETWORK LANGUAGE MODELS USING ALTERNATING DIRECTION METHODS OF MULTIPLIERS,
IEEE ICASSP2020, Barcelona, Spain.
- Jianwei Yu, Shixiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu.
AUDIO-VISUAL RECOGNITION OF OVERLAPPED SPEECH FOR THE LRS2 DATASET, IEEE Signal Processing Society Travel Grant Winner,
IEEE ICASSP2020, Barcelona, Spain.
- Disong Wang, Jianwei Yu, Xixin Wu, Songxiang Liu, Lifa Sun, Xunying Liu, Helen Meng.
END-TO-END VOICE CONVERSION VIA CROSS-MODAL KNOWLEDGE DISTILLATION FOR DYSARTHRIC SPEECH RECONSTRUCTION,
IEEE ICASSP2020, Barcelona, Spain.
- Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng.
ADVERSARIAL ATTACK ON GMM I-VECTOR BASED SPEAKER VERIFICATION SYSTEMS,
IEEE ICASSP2020, Barcelona, Spain.
- Songxiang Liu, Disong Wang, Yuewen Cao, Lifa Sun, Xixin Wu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng.
END-TO-END ACCENT CONVERSION WITHOUT USING NATIVE UTTERANCES,
IEEE ICASSP2020, Barcelona, Spain.
- Yuewen Cao, Songxiang Liu, Xixin Wu, Shiyin Kang, Peng Liu, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng.
Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora,
IEEE ICASSP2020, Barcelona, Spain.
- Xu Li, Jinghua Zhong, Jianwei Yu, Shoukang Wu, Xixin, Wu, Xunying Liu, Helen Meng.
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification,
ISCA SPLC-ODYSSEY2020, Tokyo, Japan.
- Jianwei Yu, Bo Wu, Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng. Audio-visual Multi-channel Recognition of Overlapped Speech, ISCA Interspeech2020, Shanghai, China.
- Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng. Investigation of Data Augmentation Techniques for Disordered Speech Recognition, ISCA Interspeech2020, Shanghai, China.
- Shansong Liu, Xurong Xie, Jianwei Yu, Shoukang Hu, Mengzhe Geng, Rongfeng Su, Shixiong Zhang, Xunying Liu, Helen Meng. Exploiting Cross Domain Visual Feature Generation for Disordered Speech Recognition, ISCA Interspeech2020, Shanghai, China.
- Xu Li, Na Li, Jinghua Zhong, Xixin Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng. Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification, ISCA Interspeech2020, Shanghai, China.
- Naijun Zheng, Xixin Wu, Jinghua Zhong, Xunying Liu, Helen Meng. Speaker-Aware Linear Discriminant Analysis in Speaker Verification, ISCA Interspeech2020, Shanghai, China.
- Songxiang Liu, Yuewen Cao, Shiyin Kang, Na Hu, Xunying Liu, Dan Su, Dong Yu, Helen Meng. Transferring Source Style in Non-Parallel Voice Conversion, ISCA Interspeech2020, Shanghai, China.
2019
- Max W. Y. Lam, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng
GAUSSIAN PROCESS LSTM RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR SPEECH RECOGNITION, IEEE Signal Processing Society Travel Grant Winner, IEEE ICASSP2019, Brighton, UK.
- Xurong Xie, Xunying Liu, Tan Lee, Shoukang Hu, Lan Wang
BLHUC: BAYESIAN LEARNING OF HIDDEN UNIT CONTRIBUTIONS FOR DEEP NEURAL NETWORK SPEAKER ADAPTATION, Best Student Paper Award, IEEE Signal Processing Society Travel Grant Winner, IEEE ICASSP2019, Brighton, UK.
- Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng
BAYESIAN AND GAUSSIAN PROCESS NEURAL NETWORKS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION,
IEEE ICASSP2019, Brighton, UK.
- Jianwei Yu, Max. W. Y. Lam, Xie Chen, Shoukang Hu, Songxiang Liu, Xixin Wu, Xunying Liu, Helen Meng
RECURRENT NEURAL NETWORK LANGUAGE MODEL TRAINING USING NATURAL GRADIENT,
IEEE ICASSP2019, Brighton, UK.
- Xixin Wu, Songxiang Liu, Yuewen Cao, Xu Li, Jianwei Yu, Dongyang Dai, Xi Ma, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng
Speech Emotion Recognition using Capsule Networks,
IEEE ICASSP2019, Brighton, UK.
- Wai-Kim Leung, Xunying Liu, Helen Meng
CNN-RNN-CTC BASED END-TO-END MISPRONUNCIATION DETECTION AND DIAGNOSIS,
IEEE ICASSP2019, Brighton, UK.
- Yuewen Cao, Xixin Wu, Songxiang Liu, Jianwei Yu, Xu Li, Zhiyong Wu, Xunying Liu, Helen Meng
END-TO-END CODE-SWITCHED TTS WITH MIX OF MONOLINGUAL RECORDINGS,
IEEE ICASSP2019, Brighton, UK.
- Shoukang Hu, Xurong Xie, Shansong Liu, Max W. Y. Lam, Jianwei Yu, Xixin Wu, Xunying Liu and Helen Meng. LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition, ISCA Yajie Miao Memorial Grant Winner, ISCA Interspeech2019, Graz, Austria.
- Xurong Xie, Xunying Liu, Tan Lee and Lan Wang. Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features, ISCA Interspeech2019, Graz, Austria.
- Jianwei Yu, Max W. Y. Lam, Shoukang Hu, Xixin Wu, Xu Li, Yuewen Cao, Xunying Liu and Helen Meng. Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models, ISCA Travel Grant Winner, ISCA Interspeech2019, Graz, Austria.
- Max W. Y. Lam, Jun Wang, Xunying Liu, Helen Meng, Dan Su and Dong Yu Extract, Adapt and Recognize: an End-to-end Neural Network for Corrupted Monaural Speech Recognitio, ISCA Interspeech2019, Graz, Austria.
- Shansong Liu, Shoukang Hu, Yi Wang, Jianwei Yu, Rongfeng Su, Xunying Liu and Helen Meng. Exploiting Visual Features using Bayesian Gated Neural Networks for Disordered Speech Recognition, ISCA Student Paper Award Nomination, ISCA Interspeech2019, Graz, Austria.
- Shansong Liu, Shoukang Hu, Xunying Liu and Helen Meng. On the Use of Pitch Features for Disordered Speech Recognition, ISCA Interspeech2019, Graz, Austria.
- Shoukang Hu, Shansong Liu, Hengfai Chang, Mengzhe Geng, Jiani Chen, Wingchung Lau, Kahei To, Jianwei Yu, Kaho Wong, Xunying Liu and Helen Meng. The CUHK Dysarthric Speech Recognition Systems for English and Cantonese, ISCA Interspeech2019, Graz, Austria.
- Songxiang Liu, Yuewen Cao, Xixin Wu, Lifa Sun, Xunying Liu and Helen Meng. Jointly Trained Conversion Model and WaveNet Vocoder for Non-parallel Voice Conversion using Mel-spectrograms and Phonetic Posteriorgrams, ISCA Interspeech2019, Graz, Austria.
- Hang Su, Borislav Dzodzo, Xixin Wu, Xunying Liu and Helen Meng. Unsupervised Methods for Audio Classification from Lecture Discussion Recordings, ISCA Interspeech2019, Graz, Austria.
- Xin Shen, Wai Lam, Xunying Liu and Piji Li. Compressive Multi-document Summarization with Sense-level Concepts, IJCAI2019 Workshop on Bringing Semantic Knowledge into Vision and Text Understanding, Macau, China.
2018
- Xurong Xie, Xunying Liu, Tan Lee, Lan Wang:
Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion,
IEEE ISCSLP 2018:36-40, Taipei, Taiwan.
- M. Lam, X. Liu, H. Meng and K. Tsoi
Drawing-Based Automatic Dementia Screening Using Gaussian Process Markov Chains,
Hawaii International Conference on System Sciences 2018, Honolulu, Hawaii, USA.
- S. Liu, L. Sun, X. Wu, X. Liu and H. Meng
The HCCL-CUHK System for the Voice Conversion Challenge 2018,
Odyssey 2018 The Speaker and Language Recognition Workshop, Les Sables d’Olonne, France.
- X. Liu, S. Liu, J. Sha, J. Yu, Z. Xu, X. Chen and H. Meng
Limited-memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition,
IEEE ICASSP2018, Calgary, Canada.
- X. Wu, L. Sun, S. Kang, S. Liu, Z. Wu, X. Liu and H. Meng
Feature Based Adaptation for Speaking Style Synthesis,
IEEE ICASSP2018, Calgary, Canada.
- S. Mao, X. Li, K. Li, Z. Wu, X. Liu and H. Meng
Unsupervised Discovery of An Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis,
IEEE ICASSP2018, Calgary, Canada.
- J. Yu, X. Xie, S. Hu, S. Liu, M. Lam, X. Wu, K. H. Wong, X. Liu and H. Meng
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus,
ISCA Interspeech2018, Hyderabad, India.
- M. Lam, S. Hu, X. Xie, S. Liu, J. Yu, R. Su, X. Liu and Helen Meng.
Gaussian Process Neural Networks for Speech Recognition,
ISCA Interspeech2018, Hyderabad, India.
- X. Wu, Y. Cao, M. Wang, S. Liu, S. Kang, Z. Wu, X. Liu, D. Su, D. Yu and H. Meng
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis,
ISCA Interspeech2018, Hyderabad, India.
- R. Su, X. Liu and L. Wang
Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription,
ISCA Interspeech2018, Hyderabad, India.
- X. Li, S. Mao, X. Wu, K. Li, X. Liu and H. Meng
Unsupervised discovery of non-native pronunciation patterns in L2 English speech for mispronunciation detection and diagnosis,
ISCA Interspeech2018, Hyderabad, India.
- S. Liu, J. Zhong, L. Sun, X. Wu, X. Liu and H. Meng
Voice Conversion Across Arbitrary Speakers based on a Single Target-Speaker Utterance,
ISCA Interspeech2018, Hyderabad, India.
2017
- R. Li, Z. Wu, X. Liu, H. Meng and L. Cai.
Multi-task Learning Of Structured Output Layer Bidirectional LSTMS For Speech Synthesis,
IEEE ICASSP2017, New Orleans, Louisiana, USA.
- X. Chen, A. Ragni, J. Vasilakes, X. Liu, K. Knill and M.J.F. Gales.
Recurrent Neural Network Language Models For Keyword Search,
IEEE ICASSP2017, New Orleans, Louisiana, USA.
- X. Xie, X. Liu, T. Lee and L. Wang.
RNN-LDA Clustering for Feature Based DNN Adaptation,
ISCA Interspeech2017, Stockholm, Sweden.
- X. Chen, A. Rangi, X. Liu and M.J.F. Gales.
Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition,
ISCA Interspeech2017, Stockholm, Sweden.
- X. Chen, X. Liu, A. Rangi, Y. Wang and M.J.F. Gales.
Future Word Contexts in Neural Network Language Models,
IEEE ASRU2017, Okinawa, Japan.
- R. Su, L. Wang and X. Liu.
Multimodal learning using 3D audio-visual data for audio-visual speech recognition,
IALP2017, Singapore.
2016
- X. Chen, X. Liu, Y. Qian, M. J. F. Gales and P. C. Woodland.
CUED-RNNLM - An Open-Source Toolkit for Efficient Training and Evaluation of Recurrent Neural Network Language Models,
IEEE ICASSP2016, Shanghai, China.
- L. Wang, C. Zhang, P. C. Woodland, M. J. F. Gales, X. Liu and Y. Qian.
Improved DNN Based Segmentation for Multi-genre Broadcast Audio,
IEEE ICASSP2016, Shanghai, China.
- P. Lanchantin, M. J. F. Gales, P. Karanasou, X. Liu, Y. Qian, L. Wang, P. C. Woodland and C. Zhang
Selection of Multi-Genre Broadcast Data for the Training of Automatic Speech Recognition Systems,
ISCA Interspeech2016, San Francisco, California, USA.
- X. Xie, X. Liu and L. Wang
Deep Neural Network Based Acoustic-to-articulatory Inversion Using Phone Sequence Information,
ISCA Interspeech2016, San Francisco, California, USA.
2015
- X. Liu, X. Chen, M. J. F. Gales and P. C. Woodland.
Paraphrastic Recurrent Neural Network Language Models
,
IEEE ICASSP2015, Brisbane, Australia.
- X. Chen, X. Liu, M. J. F. Gales and P. C. Woodland.
Recurrent Neural Network Language Model Training with Noise Contrastive Estimation for Speech Recognition
,
IEEE ICASSP2015, Brisbane, Australia.
- X. Chen, X. Liu, M. J. F. Gales and P. C. Woodland.
Improving the Training and Evaluation Efficiency of Recurrent Neural Network Language Models
,
IEEE ICASSP2015, Brisbane, Australia.
- X. Liu, F. Flego, L. Wang, C. Zhang, M. J. F. Gales and P. C. Woodland.
The Cambridge University 2014 BOLT Conversational Telephone Mandarin Chinese LVCSR System for Speech Translation
,
ISCA Interspeech2015, Dresden, Germany.
- X. Chen, T. Tan, X. Liu, P. Lanchantin, M. J. F. Gales and P. C. Woodland.
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition
,
ISCA Interspeech2015, Dresden, Germany.
- X. Xie, X. Liu, L. Wang and R. Su
Generalized Variable Parameter HMMs Based Acoustic-to-articulatory Inversion
,
ISCA Interspeech2015, Dresden, Germany.
- R. Su, X. Xie, X. Liu and L. Wang
Efficient Use of DNN Bottleneck Features in Generalized Variable Parameter HMMs for Noise Robust Speech Recognition
,
ISCA Interspeech2015, Dresden, Germany.
- X. Chen, X. Liu, M. J. F. Gales and P. C. Woodland.
Investigation of bac-off based interpolation between Recurrent Neural Network and N-Gram Language Models,
IEEE ASRU2015, Scottsdale, Arizona, USA.
- P. C. Woodland, X. Liu, Y. Qian, C. Zhang, P. Karanasou, P. Lanchantin, L. Wang and M. J. F. Gales.
Cambridge University Transcription Systems for the Multi-Genre Broadcast Challenge,
IEEE ASRU2015, Scottsdale, Arizona, USA.
- P. Lanchantin, M. J. F. Gales, P. Karanasou, X. Liu, Y. Qian, L. Wang, P. C. Woodland and C. Zhang
The Development of the Cambridge University Alignment Systems for the Multi-Genre Broadcast Challenge,
IEEE ASRU2015, Scottsdale, Arizona, USA.
- P. Karanasou, M. J. F. Gales, P. Lanchantin, X. Liu, Y. Qian, L. Wang, P. C. Woodland and C. Zhang
Speaker Diarisation and Longotudinal Linking in Multi-Genre Broadcast Data,
IEEE ASRU2015, Scottsdale, Arizona, USA.
- P. Bell, M. J. F. Gales, T. Hain, J. Kilgour, P. Lanchantin, X. Liu, A. McParland, S. Renals, O. Saz, M. Wester, P. C. Woodland
The MGB Challenge: Evaluating Multi-Genre Broadcast Media Recognition,
IEEE ASRU2015, Scottsdale, Arizona, USA.
2014
- X. Liu, M. J. F. Gales and P. C. Woodland.
Paraphrastic Neural Network Language Models,
IEEE ICASSP2014, Florence, Italy.
- X. Liu, Y. Wang, X. Chen, M. J. F. Gales and P. C. Woodland.
Efficient Lattice Rescoring Using Recurrent Neural Network
Language Models, Paper Award Nomination,
IEEE ICASSP2014, Florence, Italy.
- X. Chen, Y. Wang, X. Liu, M. J. F. Gales and P. C. Woodland.
Efficient GPU-based Training of Recurrent Neural
Network Language Models Using Spliced Sentence Bunch,
ISCA Interspeech2014, Singapore.
- X. Xie, R. Su, X. Liu and L. Wang.
Deep Neural Network Bottleneck Features For Generalized Variable Parameter HMMs,
ISCA Interspeech2014, Singapore.
2013
- X. Liu, M. J. F. Gales and P. C. Woodland.
Paraphrastic Language Models and Combination with Neural
Network Language Models,
IEEE ICASSP2013, Vancouver.
- X. Liu, M. J. F. Gales and P. C. Woodland.
Cross-domain Paraphrasing For Improving Language Modelling
Using Out-of-domain Data,
ISCA Interspeech2013, Lyon.
- Y. Li, X. Liu and L. Wang.
Feature Space Generalized Variable Parameter HMMs for Noise
Robust Recognition,
ISCA Interspeech2013, Lyon.
- Y. Long, M.J.F. Gales, P. Lanchantin, X. Liu,
M.S. Seigel and P.C. Woodland.
Improving Lightly Supervised Training for Broadcast Transcription,
ISCA Interspeech2013, Lyon.
- P. Lanchantin, P. Bell, M. J. F. Gales, T. Hain, X.
Liu, Y. Long, J. Quinnell, S. Renals, O. Saz,
M. S. Seigel, P. Swietojanski and P. C. Woodland.
Automatic Transcription of Multi-genre Media Archives,
SLAM@INTERSPEECH 2013: 26-31.
- R. Su, X. Liu and L. Wang.
Automatic Model Complexity Control for Generalized Variable Parameter HMMs,
IEEE ASRU2013, Olomouc.
2012
- X. Liu, M. J. F. Gales and P. C. Woodland.
Paraphrastic Language Models,
ISCA Interspeech2012, Portland, Oregon.
- P. Bell, M. Gales, P. Lanchantin, X. Liu,
Y. Long, S. Renals, P. Swietojanski and P. Woodland.
Transcription of Multi-genre Media Archives Using
Out-of-domain Data,
IEEE SLT2012, Miami, Florida.
- Y. Li, X. Liu and L. Wang.
Structured Modelling Based on Generalized Variable Parameter HMMs,
ISCSLP2012, Hong Kong.
2011
- X. Liu, M. J. F. Gales, J. L. Hieronymus and P. C. Woodland.
Investigation of Acoustic Units for LVCSR Systems,
IEEE ICASSP2011, Prague.
- X. Liu, M. J. F. Gales and P. C. Woodland.
Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation,
ISCA Interspeech2011, Florence, Italy.
- F. Diehl, M. J. F. Gales, X. Liu, M. Tomalin and P. C. Woodland.
Word Boundary Modelling and Full Covariance Gaussians for
Arabic Speech-to-Text Systems,
ISCA Interspeech2011, Florence, Italy.
- N. Cheng, X. Liu and L. Wang
Generalized Variable Parameter HMMs for Noise Robust Speech Recognition,
ISCA Interspeech2011, Florence, Italy.
2006-2010
- R. Sinha, M. J. F. Gales, D. Y. Kim, X. Liu, K. C. Kim
and P. C. Woodland. The CU-HTK Mandarin
Broadcast News Transcription System. IEEE ICASSP2006, Toulouse.
- M.J.F. Gales , X. Liu , R. Sinha , P.C. Woodland ,
K. Yu , S. Matsoukas , T. Ng , K. Nguyen , L. Nguyen , J-L
Gauvain , L. Lamel and A. Messaoudi.
Speech Recognition System Combination For Machine Translation. IEEE
ICASSP2007, Hawaii.
- M. Tomalin, M.J.F. Gales, X. A. Liu, K.C. Sim, R. Sinha,
L. Wang, P.C. Woodland and K. Yu.
Improving Speech Transcription for Mandarin-English Translation. IEEE
ICASSP2007, Hawaii.
- X. Liu, W. J. Byrne, M. J. F. Gales, A. de Gispert,
M. Tomalin, P. C. Woodland and K. Yu.
Discriminative Language Model Adaptation for Mandarin
Broadcast Speech Transcription and Translation,
IEEE ASRU2007, Kyoto.
- X. Liu, M. J. F. Gales and P. C. Woodland.
Context Dependent Language Model Adaptation, ISCA InterSpeech2008,
Brisbane.
- X. Liu, M. J. F. Gales and P. C. Woodland.
Use of Contexts in Language Model Interpolation and Adaptation,
ISCA Interspeech2009, Brighton.
- J. L. Hieronymus, X. Liu, M. J. F. Gales and P. C. Woodland.
Exploiting Chinese Character Models to Improve Speech
Recognition Performance, ISCA Interspeech2009, Brighton.
- X. Liu, M. J. F. Gales, J. L. Hieronymus and P. C. Woodland.
Language Model Combination and Adaptation Using Weighted Finite State Transducers,
IEEE ICASSP2010, Dallas.
- X. Liu, M. J. F. Gales and P. C. Woodland.
Language Model Cross Adaptation For LVCSR System Combination,
Best Paper Award, ISCA Interspeech2010, Makuhari, Japan.
- J. Park, X. Liu, M. J. F. Gales and P. C. Woodland.
Improved Neural Network Based Language Modelling and Adaptation,
ISCA Interspeech2010, Makuhari, Japan.
2003-2006
- X. Liu, M. J .F. Gales and P. C. Woodland. Automatic Complexity
Control for HLDA Systems. IEEE ICASSP2003, Hong Kong.
- X. Liu and M. J. F. Gales. Automatic Model
Complexity Control Using Marginalized Discriminative Growth Functions.
IEEE ASRU 2003, St. Thomas, U.S. Virgine Islands.
- X. Liu and M. J .F. Gales. Model Complexity
Control and Compression Using Discriminative Growth Functions. IEEE
ICASSP2004, Montreal.
- G. Evermann, H. Y. Chan, M. J. F. Gales, T. Hain, X. Liu,
D. Mrva, L. Wang and P. C. Woodland.
Development of
the 2003 CU-HTK Conversational Telephone Speech Transcription System.
IEEE ICASSP2004, Montreal.
- X. Liu, M. J. F. Gales, K. C. Sim and K. Yu. Investigation
of Acoustic Modeling Techniques for LVCSR Systems. IEEE
ICASSP2005, Philadelphia.
- M. J. F. Gales, B. Jia, X. Liu, K. C. Sim, P. C.
Woodland and K. Yu. Development of
the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription
System. IEEE ICASSP2005, Philadelphia.
Technical Reports:
- T. Hain, P. C. Woodland, G. Evermann, X. Liu, G. L.
Moore, D. Povey and L. Wang, Automatic Transcription
of Conversational Telephone Speech. Development of the CU-HTK
2002 System. Cambridge University Engineering Department
Technical Report, CUED/F-INFENG/TR-465. December 2003.
- X. Liu and M. J. F. Gales, Discriminative
Training of Multiple Subspace Projections for Large Vocabulary
Speech Recognition. Cambridge University Engineering
Department Technical Report, CUED/F-INFENG/TR-489. August 2004.
- X. Liu and M. J. F. Gales, Automatic Model
Complexity Control Using Marginalized Discriminative Growth
Functions. Cambridge University Engineering Department
Technical Report, CUED/F-INFENG/TR-490. August 2004.
- X. Liu, W. J. Byrne, M. J. F. Gales and P. C. Woodland et al,
Discriminative Language Model Adaptation for Mandarin
Broadcast Speech Transcription and Translation.
Cambridge University Engineering Department Technical
Report,CUED/F-INFENG/TR-586. Sept. 2007.
- X. Liu, M. J. F. Gales and P. C. Woodland, Use of Contexts in Language
Model Interpolation and Adaptation. Cambridge University Engineering Department
Technical Report, CUED/F-INFENG/TR-630. Februray 2009.
Workshop Papers, Presentations and Invited Talks:
- P. C. Woodland, G. Evermann, M. J. F. Gales, T. Hain, X.
Liu, G. Moore, D. Povey and L. Wang (2002) CU-HTK APRIL 2002
SWITCHBOARD SYSTEM. Rich Transcription Workshop, 2002, Vienna,
Virginia, U.S.
- X. Liu, M. J. F. Gales and P. C. Woodland (2003) Automatic Complexity Control
for HLDA Systems. Hong Kong, ICASSP 2003.
- X. Liu and M. J. F. Gales (2003) Automatic Model
Complexity Control Using Marginalized Discriminative Growth Functions.
DARPA EARS STT technical meeting, Martigny, Switzerland, September,
2003.
- X. Liu and M. J. F. Gales
(2004)
Discriminative Model Complexity
Control. Presentation to DARPA and GCHQ EARS Site Visit, August, 2004.
- M. J. F. Gales, X. Liu, K. C. Sim and K. Yu (2004). Investigation of
Acoustic Modeling Techniques for LVCSR Systems. DARPA EARS Rich
Transcription Workshop 2004.
- M. J. F. Gales, B. Jia, X. Liu, K. C. Sim, P. C.
Woodland and K. Yu (2004). Development of
the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription
System. DARPA EARS Rich Transcription Workshop 2004.
- X. Liu, M. J. F. Gales and P. C. Woodland
(2007). HTK Large
Vocabulary Decoder and Discriminative Training Tools. Part of
presentation on new features of HTK version 3.4, the HTK Meeting,
Hawaii, 2007.
- X. Liu, W. J. Byrne, M. J. F. Gales, P. C. Woodland (2007).
Investigation of System Combination, Improved Minimum Bayes Risk
Decoding, Discriminative Language Model Interpolation and
Adaptation for Machine Translation. Part of joint AGILE team
presentation to DARPA's GALE Site Visit, August, 2007.
- A. de Gispert, X. Liu, W. J. Byrne, M. J. F. Gales and
P. C. Woodland (2008)
Broadcast Speech Transcription and
Translation. Poster at Horizon Seminar: The Thinking Machine?,
Emmanuel College, Cambridge, Mar 2008.
- P. C. Woodland, X. Liu, M. J. F. Gales, K. Yu, T. Ng and
L. Nguyen et al. (2008).
Development of AGILE Chinese STT Systems. AGILE
team presentation to DARPA's GALE Principal Invigilator Meeting,
April, 2008.
- X. Liu, M. J. F. Gales and P. C. Woodland (2008).
Context
Dependent Language Model Interpolation and Adaptation. Presentation
at ISCA Interspeech2008; invited talk at the Chinese University
of Hong Kong and Shenzhen Institute of Advanced technology, Chinese Academy
of Sciences, September, 2008.
- X. Liu, F. Diehl, M. Tomalin M. J. F. Gales and P. C. Woodland
(2009). AGILE Speech to
Text (STT), part of AGILE team presentation to DARPA's GALE Principal
Invigilator Meeting, May, 2009.
|