删除或更新信息,请邮件至freekaoyan#163.com(#换成@)

香港科技大学工学院老师教师导师介绍简介-Brian Kan Wing MAK

本站小编 Free考研考试/2022-01-30

Brian Kan Wing MAK
麥鑑榮
PhD in Computer Science
Oregon Graduate Institute of Science and Technology, 1998

Associate Professor
Department of Computer Science and Engineering



(852) 2358 7012
bmak@ust.hk
Room 3513
Personal Web

Google Scholar
Zx8p8RsAAAAJ

ORCID
0000-0001-6787-5555

ResearcherID
E-5870-2012

Scopus ID
7003925556




Research Interest Publications Projects Teaching Assignment RPG Supervision Space used




Research Interest
Artificial intelligence
Speech recognition and synthesis
Speaker recognition and verification
Sign language recognition and generation
Natural language processing



Publications
All Years 109 2022 0 2021 3 2020 4 2019 2 2018 10 2017 6 2016 84





2021 3

A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer’s Disease Detection
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June, June 2021, p. 6423-6427
Li, Jinchao; Yu, Jianwei; Ye, Zi; Wong, Simon; Mak, Manwai; Mak, Brian Kan Wing; Liu, Xunying; Meng, Helen Conference paper
Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June, June 2021, p. 5924-5928
Yu, Xinyuan; Mak, Brian Kan Wing Conference paper
On-the-fly Data Augmentation for Text-to-speech Style Transfer
IEEE Automatic Speech Recognition and Understanding Workshop, Cartagena, Colombia, 13-17 December 2021
Chung, Man Hon; Mak, Brian Kan Wing Conference paper

2020 4

Multi-lingual multi-speaker text-to-speech synthesis for voice cloning with online speaker enrollment
Proceedings of the Annual Conference of the International Speech Communication Association, v. 2020, October 2020, p. 2932-2936
Liu, Zhaoyu; Mak, Brian Kan Wing Conference paper
Orthogonal Training for Text-independent Speaker Verification
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2020-May, May 2020, article number 9053198, p. 6584-6588
Zhu, Yingke; Mak, Brian Kan Wing Conference paper
Orthogonality Regularizations for End-to-End Speaker Verification
Proceeding of Odyssey 2020 The Speaker and Language Recognition Workshop / ISCA. ISCA, 2020, p. 17-23
Zhu, Yingke; Mak, Brian Kan Wing Conference paper
Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 12361 LNCS, 2020, p. 172-186
Niu, Zhe; Mak, Brian Kan Wing Conference paper

2019 2

Mixup Learning Strategies for Text-independent Speaker Verification
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September, 2019, p. 4345-4349
Zhu, Yingke; Ko, Tom; Mak, Brian Kan Wing Conference paper
Recurrent Poisson process unit for speech recognition
Proceedings of the AAAI Conference on Artificial Intelligence, v. 33, (1), 2019, p. 6538-6545
Huang, Hengguan; Wang, Hao; Mak, Brian Kan Wing Conference paper

2018 10

Denoised Senone I-Vectors for Robust Speaker Verification
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 26, (4), April 2018, p. 820-830
Tan, Zhili; Mak, Man-Wai; Mak, Brian Kan Wing; Zhu, Yingke Article
DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.26, (4), April 2018, p. 700-712
Tan, Zhili; Mak, Man-Wai; Mak, Brian Kan Wing Article
Domain adaptation of end-to-end speech recognition in low-resource settings
2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings, February 2019, article number 8639506, p. 382-388
Samarakoon, Lahiru; Mak, Brian Kan Wing; Lam, Albert Y.S. Conference paper
End-to-End Low-Resource Lip-Reading with Maxout CNN and LSTM
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018-April, 10 September 2018, article number 8462280, p. 2511-2515
Fung, Ho Long; Mak, Brian Kan Wing Conference paper
Fast Derivation of Cross-lingual Document Vectors from Self-attentive Neural Machine Translation Model
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September, 2018, p. 107-111
Li, Wei; Mak, Brian Kan Wing Conference paper
Learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018-April, September 2018, article number 8462112, p. 5954-5958
Mak, Brian Kan Wing; Samarakoon, Lahiru Thilina; Sim, Khe Chai Conference paper
Multi-Head Attention for End-to-End Neural Machine Translation
018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706667, p. 250-254
Fung, Ho Long; Mak, Brian Kan Wing Conference paper
Self-attentive Speaker Embeddings for Text-independent Speaker Verification
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September, 2018, p. 3573-3577
Zhu, Yingke; Ko, Tom; Snyder, David; Mak, Brian Kan Wing; Povey, Daniel Conference paper
Subspace Based Sequence Discriminative Training of LSTM Acoustic Models With Feed-forward Layers
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706623, p. 136-140
Samarakoon, Lahiru; Mak, Brian Kan Wing; Lam, Albert Y.S. Conference paper
WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706666, p. 141-145
Huang, Hengguang; Mak, Brian Kan Wing Conference paper

2017 6

An Investigation Into Learning Effective Speaker Subspaces for Robust Unsupervised DNN Adaptation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, 2017, p. 5035-5039, Article number 7953115
Samarakoon, Lahiru Thilina; Sim, Khe Chai; Mak, Brian K W Conference paper
Derivation of Document Vectors from Adaptation of LSTM Language Model
15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, v. 2, 2017, p. 456-461
Li, Wei; Mak, Brian K. W. Conference paper
Learning factorized transforms for unsupervised adaptation of LSTM-RNN acoustic models
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2017, p. 744-748
Samarakoon, Lahiru Thilina; Mak, Brian K W; Sim, Khe Chai Conference paper
Speeding Up Softmax Computations in DNN-Based Large Vocabulary Speech Recognition by Senone Weight Vector Selection
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, 2017, p. 5335-5339, Article number 7953175
Zhu, Yingke; Mak, Brian K W Conference paper
To Improve the Robustness of LSTM-RNN Acoustic Models Using Higher-order Feedback From Multiple Histories
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, International Speech Communication Association, 2017, p. 3862-3866
Huang, Hengguan; Mak, Brian Conference paper
Unsupervised Adaptation of Student DNNs Learned from Teacher RNNs for Improved ASR Performance
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017), v. 2017, December 2017, p. 200-205
Samarakoon, Lahiru Thilina; Mak, Brian Kan Wing Conference paper

2016 2

An Investigation of Adaptation Techniques for Building Acoustic Models for Hearing-impaired Children in a CAPT Application
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918449
Zhu, Yingke; Mak, Brian Kan Wing Conference paper
Senone I-Vectors for Robust Speaker Verification
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918462
Tan, Zhili; Zhu, Yingke; Mak, Man-Wai; Mak, Brian K W Conference paper

2015 2

Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition
IEEE Transactions on Audio, Speech and Language Processing, v. 23, (7), July 2015, article number 7084614, p. 1172-1183
Chen, Dongpeng; Mak, Brian Kan-Wing Article
Distinct Triphone Acoustic Modeling Using Deep Neural Networks
16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015): Speech Beyond Speech Towards a Better Understanding of the Most Important Biosignal, International Speech Communication Association (ISCA), 2015, p. 2645-2649
Chen, Dongpeng; Mak, Brian Kan Wing Conference paper

2014 5

Eigentrigraphemes for under-resourced languages
Speech Communication, v. 56, (1), January 2014, p. 132-141
Ko, Tom Yu Ting; Mak, Brian Kan Wing Article
Joint Acoustic Modeling of Triphones and Trigraphemes by Multi-Task Learning Deep Neural Networks for Low-Resource Speech Recognition
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), v. 2014, 2014, article number 6854673, p. 5592-5596
Chen, Dongpeng; Mak, Brian Kan Wing; Leung, Cheung-Chi; Sivadas, Sunil Conference paper
Joint Sequence Training of Phone and Grapheme Acoustic Model Based on Multi-task Learning Deep Neural Networks
Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech, v. 1-4, 2014, p. 1083-1087
Chen, Dongpeng; Mak, Brian; Sunil, Sivadas Conference paper
Modeling Inter-cluster and Intra-cluster Discrimination Among Triphones
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, October 2014, article number 6936683, p. 103-107
Ko, Tom; Mak, Brian; Chen, Dongpeng Conference paper
Subspace Gaussian Mixture Model with State-dependent Subspace Dimensions
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 1725-1729
Ko, Yu Ting; Mak, Brian Kan Wing; Leung, Cheung-Chi Conference paper

2013 2

Eigentriphones for Context-Dependent Acoustic Modeling
IEEE Transactions on Audio, Speech, and Language Processing, v. 21, (6), June 2013, p. 1285-1294
Ko, Tom; Mak, Brian Article
Distinct Triphone Modeling by Reference Model Weighting
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, October 2013, article number 6639050, p. 7150-7154
Chen, Dongpeng; Mak, Brian K W Conference paper

2012 4

Derivation of eigentriphones by weighted principal component analysis
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288819, p. 4097-4100
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Speaker-ensemble hidden Markov modeling for automatic speech recognition
2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, 2012, article number 6423532, p. 9-10
Ye, Guoli; Mak, Brian Conference paper
Subspace high-density discrete hidden Markov model for automatic speech recognition
European Signal Processing Conference, 2012, article number 6334110, p. 1643-1647
Ye, Guoli; Mak, Brian Conference paper
Transition Probabilities Are More Important Than We Once Thought
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288995, p. 4809-4812
Ye, Guoli; Chen, Dongpeng; Mak, Brian Kan Wing Conference paper

2011 2

A Fully Automated Derivation of State-based Eigentriphones for Triphone Modeling with No Tied States Using Regularization
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, 2011, p. 781-784
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Eigentriphones: A basis for context-dependent acoustic modeling
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2011, article number 5947452, p. 4892-4895
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper

2010 4

Improving speech recognition by explicit modeling of phone deletions
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, USA, 2010, p. 4858-4861
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Problems of modeling phone deletion in conversational speech for speech recognition
2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, Taiwan, 2010, p. 114-118
Mak, Brian Kan Wing; Ko, Yu Ting Conference paper
Subvector-quantized high-density discrete hidden Markov model and its re-estimation
Proceedings of 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010, November 2010, p. 109-113
Ye, G.; Mak, B. Conference paper
The Use of Subvector Quantization and Discrete Densities for Fast GMM Computation for Speaker Verification
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4, 2010, p. 1481-1484
Ye, Guoli; Mak, Brian Conference paper

2009 3

Maximum Penalized Likelihood Kernel Regression for Fast Adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 17, (7), September 2009, article number 5165120, p. 1372-1381
Mak, Brian Kan-Wing; Lai, Tsz-Chung; Tsang, Ivor W.; Kwok, James Tin-Yau Article
Automatic estimation of decoding parameters using large-margin iterative linear programming
Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, 2009, p. 1207-1210
Mak, Brian Kan Wing; Ko, Yu Ting Conference paper
Fast GMM Computation for Speaker Verification Using Scalar Quantization and Discrete Densities
Interspeech 2009: 10th Annual Conference of the International Speech Communication Association 2009, Vols 1-5, 2009, p. 2291-2294
Ye, Guoli; Mak, Brian; Mak, Man-Wai Conference paper

2008 3

Discriminative training by iterative linear programming optimization
2008 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 2008, Mar-Apr, p. 4061-4064
Mak, B.; Ng, B. Conference paper
Min-max Discriminative Training of Decoding Parameters Using Iterative Linear Programming
Proceedings of the 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, Brisbane, Australia, 22-26 September 2008, 2008, p. 915-918
Mak, Brian Kan Wing; Ko, Tom Conference paper
Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, 2008, p. 1897-1900
Huang, Chienlin; Ma, Bin; Wu, Chung-Hsien; Mak, Brian; Li, Haizhou Conference paper

2007 5

Kernel eigenspace-based MLLR adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 15, (3), March 2007, article number 4100690, p. 784-795
Mak, Brian Kan-Wing; Hsiao, Roger Wend-Huu Article
A Model-based estimation of phonotactic language verification performance
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1521-1524
Wong, K.K.; Siu, M.H.; Mak, B. Conference paper
A Model-based Estimation of Phonotactic Language Verification Performance
Proceedings of Interspeech, pages 186-189, Aug, 2007, Antwerp, Belgium
Wong, Ka Keung; Siu, Manhung; Mak, Brian Conference paper
Boosting with anti-models for automatic language identification
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1537-1540
Yang, X.; Siu, M.H.; Gish, H.; Mak, B. Conference paper
Robustness of several kernel-based fast adaptation methods on noisy LVCSR
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 1, 2007, p. 445-448
Mak, B.; Hsiao, R. Conference paper

2006 8

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting
IEEE Transactions on Audio, Speech, and Language Processing, v. 14, (4), July 2006
Mak, Brian Kan Wing; Hsiao, Roger; Ho, Simon; Kwok, James Tin-Yau Article
Joint optimization of the frequency-domain and time-domain transformations in deriving generalized static and dynamic MFCCs
IEEE signal processing letters, v. 13, (11), November 2006, p. 707-710
Lai, Yiu-Pong; Siu, Manhung; Mak, Brian Article
Minimization of utterance verification error rate as a constrained optimization problem
IEEE Signal Processing Letters, v. 13, (12), December 2006, p. 760-763
Siu, Man-Hung; Mak, Brian; Au, Wing-Hei Article
A comparison of various adaptation methods for speaker verification with limited enrollment data
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2006
Mak, M.W.; Hsiao, R.; Mak, B. Conference paper
Automatic Audio Indexing and Audio Playback Speed Control as Tools for Language Learning
Lecture Notes in Computer Science, v. 4181, 2006, p. 290-299
Rossiter, D.; Lam, G.; Mak, B. Conference paper
Fast speaker adaption via maximum penalized likelihood kernel regression
2006 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP 2006; Toulouse; France, Volume 1, 2006, Article number 1660191, Pages I997-I1000
Tsang, Ivor W.; Kwok, James Tin-Yau; Mak, Brian; Zhang, Kai; Pan, Jeffrey Junfeng Conference paper
Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, p. 229-232
Mak, Brian; Lai, Tsz-Chung; Hsiao, Roger Conference paper
Unsupervised speaker adaptation using reference speaker weighting
Lecture Notes in Computer Science, v. 4274, 2006, p. 380-389
Lai, T.C.; Mak, B. Conference paper

2005 7

Kernel eigenvoice speaker adaptation
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 984-992
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Article
Passenger route guidance system for multi-modal transit networks
Journal of Advanced Transportation, v. 39, (3), 2005, p. 271-288
Lo, Hong Kam; Yip, Chun Wing; Mak, Brian Kan Wing Article
Pruning hidden Markov models with optimal brain surgeon
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 993-1003
Mak, Brian Kan Wing; Chan, Kin Wah Article
A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition
Proceedings 9th European Conference on Speech Communication and Technology, Interspeech 2005-Eurospeech, Lisbon, Portugal, 4-8 September 2005, p. 1797-1800
Hsiao, R.; Mak, B. Conference paper
High-density discrete HMM with the use of scalar quantization indexing
9th European Conference on Speech Communication and Technology, 2005, p. 2121-2124
Mak, B.; Yeung, S.K.A.; Lai, Y.P.; Siu, M. Conference paper
Kernel Eigenspace-based MLLR adaptation using multiple regression classes
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 985-988
Hsiao, R.; Mak, B. Conference paper
Various reference speakers determination methods for embedded kernel Eigenvoice speaker adaptation
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 981-984
Mak, B.; Ho, S. Conference paper

2004 8

An acoustic-phonetic and a model-theoretic analysis of subspace distribution clustering hidden Markov models
International Journal of Speech Technology, v. 7, (1), 2004, p. 55-68
Mak, Brian Kan Wing Article
Discriminative auditory-based features for robust speech recognition
IEEE Transactions on Speech and Audio Processing, v. 12, (1), January 2004, p. 27-36
Mak, Brian Kan Wing; Tam, YC; Li, PQ Article
Discriminative feature transformation by guided discriminative training
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, p. 897-900
Hsiao, R.; Mak, B. Conference paper
Eigenvoice speaker adaptation via composite kernel PCA
Advances in neural information processing systems, v. 16, 2004, p. 1401-1408
Kwok, James Tin-Yau; Mak, Brian Kan Wing; Ho, Simon Ka-Lung Conference paper
Improving Eigenspace-based MLLR Adaptation by Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, Jeju Island, South Korea, October 4-8, 2004, volume I, pages 13-16,
Mak, Brian; Hsiao, Roger Conference paper
Speedup of Kernel Eigenvoice Speaker Adaptation by Embedded Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, , Jeju Island, South Korea, volume IV, pages 2913-2916,
Mak, Brian; Ho, Simon; Kwok, James Conference paper
Study of various composite kernels for kernel eigenvoice speaker adaptation
2004 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Montreal, Que; Canada, 2004, 17 May 2004 through 21 May 2004; Code 63500, p. 325-328
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung Conference paper
Using kernel PCA to improve eigenvoice speaker adaptation
Proceedings of 2004 International Conference on Machine Learning and Cybernetics, v. 5 / IEEE. Piscataway, NJ : IEEE, 2004, p. 3062-3067
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung Conference paper

2003 4

Discriminative training of auditory filters of different shapes for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2, 2003, p. 45-48
Mak, B.; Tam, Y.C.; Hsiao, R. Conference paper
Joint Estimation of Thresholds in a Bi-threshold Verification Problem
European Conference on Speech Communication and Technology, pages 893--896, Sept 1-4
Ho, Simon; Mak, Brian Conference paper
PLASER: Pronunciation Learning via Automatic Speech Recognition
Proceedings of HLT-NAACL Workshop on Building Educational Applications using Natural Language Processing, Edmonton, Canada, May
Mak, Brian; Siu, Man Hung; Ng, Mimi; Tam, Yik Cheung; Chan, Yu Chung; Chan, Kin Wah; Leung, Ka Yee; Ho, Simon; Chong, Fong Ho; Wong, Jimmy; Lo, Jacqueline Conference paper
Pruning Transitions in a Hidden Markov Model with Optimal Brain Surgeon
European Conference on Speech Communication and Technology, pages 2521--2524, Sept 1-4
Mak, Brian; Chan, Kin-Wah Conference paper

2002 5

A mathematical relationship between full-band and multiband mel-frequency cepstral coefficients
IEEE Signal Processing Letters, v. 9, (8), August 2002, p. 241-244
Mak, Brian Kan Wing Article
An alternative approach of finding competing hypotheses for better minimum classification error training
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, p. 101-104
Tam, YC; Mak, B. Conference paper
Discriminative auditory features for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2002, p. 381-384
Mak, B.; Tam, Y.C.; Li, Q. Conference paper
Knowledge-based Sense Pruning using the HowNet: An Alternative to Word Sense Disambiguation
Proceedings of the International Symposium of Chinese Spoken Language Processing, August, Taiwan, pp. 189-192
Gan, Kok-Wee; Wang, Chi Yung; Mak, Brian Conference paper
Performance of Discriminatively Trained Auditory Features on Aurora2 and Aurora3
Proceedings of the International Conference on Spoken Language Processing, September, Denver, Colorado, USA, Vol. 1, pp. 33-36
Mak, Brian; Tam, Yik Cheung Conference paper

2001 4

Direct training of subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (4), May 2001, p. 378-387
Mak, Brian Kan Wing; Bocchieri, E. Article
Subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (3), 2001, p. 264-275
Bocchieri, E.; Mak, Brian Kan Wing Article
Development of an Asynchronous Multi-band System for Continuous Speech Recognition
Proceedings of the European Conference on Speech Communication and Technology, 1, 575-578
Tam, Yik-Cheung; Mak, Brian Conference paper
Rapid Speaker Adaptation Using MLLR and Subspace Regression Classes
Proceedings of the European Conference on Speech Communication and Technology, 2, 1253-1256
Wong, Kwok-Man; Mak, Brian Conference paper

2000 4

Asynchrony with re-trained transition probabilities improves performance in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 149-152
Mak, Brian K.W.; Tam, Y.C. Conference paper
MAP Adaptation with Subspace Regression Classes and Tying
Proceedings of 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 3, 2000, article number 861963, p. 1551-1554
Wong, Kwok-Man; Mak, Brian Conference paper
Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. I, 2000, p. 313-316
Tam, Y.C.; Mak, Brian K.W. Conference paper
Pruning of state-tying tree using Bayesian Information Criterion with multiple mixtures
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 294-297
Chan, Y.C.; Siu, M.; Mak, Brian K.W. Conference paper

1998 2

Training of context-dependent subspace distribution clustering hidden Markov model
Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, v. 1, 1998, p. 308-311
Mak, Brian K.W.; Bocchieri, E. Conference paper
Training of subspace distribution clustering hidden Markov model
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, Washington, USA, v. 2, 1998, p. 673-676
Mak, Brian K.W.; Bocchieri, E. Conference paper

1997 3

Combining ANNs to improve phone recognition
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, v. 4, 1997, p. 3253-3256
Mak, Brian K.W. Conference paper
Stream derivation and clustering schemes for subspace distribution clustering HMM
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Santa Barbara, California, USA, 1997, p. 339-346
Mak, Brian K.W.; Bocchieri, E.; Barnard, E. Conference paper
Subspace distribution clustering for continuous observation density hidden Markov models
Proceedings of the European Conference on Speech Communication and Technology, Rhodes, Greece, v. 1, 1997, p. 107-110
Bocchieri, E.; Mak, Brian K.W. Conference paper

1996 2

Phone clustering using the Bhattacharyya distance
Proceedings of the International Conference on Spoken Language Processing, Philadelphia, USA, v. 4, 1996, p. 2005-2008
Mak, Brian K.W.; Barnard, E. Conference paper
The contribution of consonants versus vowels to word recognition in fluent speech
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Atlanta, Georgia, USA, v. 2, 1996, p. 853-856
Cole, R.; Yan, Y.; Mak, Brian K.W.; Fanty, M.; Bailey, T. Conference paper

1995 1

Tone recognition of isolated Cantonese syllables
IEEE Transactions on Speech and Audio Processing, v. 3, (3), May 1995, p. 204-209
Lee, T.; Ching, P.C.; Chan, L.W.; Cheng, Y.H.; Mak, Brian K.W. Article

1994 1

A robust algorithm for word boundary detection in the presence of noise
IEEE Transactions on Speech and Audio Processing, v. 2, (3), July 1994, p. 406-412
Junqua, J.; Mak, Brian K.W.; Reaves, B. Article

1993 1

An NN based tone classifier for Cantonese
International Joint Conference on Neural Networks, Japan, v. 1, 1993, p. 287-290
Lee, Tan; Ching, P.C.; Chan, L.W.; Mak, Brian K.W. Conference paper

1992 1

A robust speech/non-speech detection algorithm using time and frequency-based features
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, California, USA, v. 1, 1992, p. 269-272
Mak, Brian K.W.; Junqua, J.; Reaves, B. Conference paper

1990 1

Communication parameter tests and parallel back propagation algorithms on iPSC/2 hypercube multiprocessor
Proceedings of the Fifth Distributed Memory Computer Conference, South Carolina, USA, v. 2, 1990, p. 1353-1364
Mak, Brian K.W.; Egecioglu, O. Conference paper





Conference paper 3

A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer’s Disease Detection
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June, June 2021, p. 6423-6427
Li, Jinchao; Yu, Jianwei; Ye, Zi; Wong, Simon; Mak, Manwai; Mak, Brian Kan Wing; Liu, Xunying; Meng, Helen
Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2021-June, June 2021, p. 5924-5928
Yu, Xinyuan; Mak, Brian Kan Wing
On-the-fly Data Augmentation for Text-to-speech Style Transfer
IEEE Automatic Speech Recognition and Understanding Workshop, Cartagena, Colombia, 13-17 December 2021
Chung, Man Hon; Mak, Brian Kan Wing





Conference paper 4

Multi-lingual multi-speaker text-to-speech synthesis for voice cloning with online speaker enrollment
Proceedings of the Annual Conference of the International Speech Communication Association, v. 2020, October 2020, p. 2932-2936
Liu, Zhaoyu; Mak, Brian Kan Wing
Orthogonal Training for Text-independent Speaker Verification
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2020-May, May 2020, article number 9053198, p. 6584-6588
Zhu, Yingke; Mak, Brian Kan Wing
Orthogonality Regularizations for End-to-End Speaker Verification
Proceeding of Odyssey 2020 The Speaker and Language Recognition Workshop / ISCA. ISCA, 2020, p. 17-23
Zhu, Yingke; Mak, Brian Kan Wing
Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 12361 LNCS, 2020, p. 172-186
Niu, Zhe; Mak, Brian Kan Wing





Conference paper 2

Mixup Learning Strategies for Text-independent Speaker Verification
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2019-September, 2019, p. 4345-4349
Zhu, Yingke; Ko, Tom; Mak, Brian Kan Wing
Recurrent Poisson process unit for speech recognition
Proceedings of the AAAI Conference on Artificial Intelligence, v. 33, (1), 2019, p. 6538-6545
Huang, Hengguan; Wang, Hao; Mak, Brian Kan Wing





Article 2

Denoised Senone I-Vectors for Robust Speaker Verification
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v. 26, (4), April 2018, p. 820-830
Tan, Zhili; Mak, Man-Wai; Mak, Brian Kan Wing; Zhu, Yingke
DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.26, (4), April 2018, p. 700-712
Tan, Zhili; Mak, Man-Wai; Mak, Brian Kan Wing

Conference paper 8

Domain adaptation of end-to-end speech recognition in low-resource settings
2018 IEEE Spoken Language Technology Workshop, SLT 2018 - Proceedings, February 2019, article number 8639506, p. 382-388
Samarakoon, Lahiru; Mak, Brian Kan Wing; Lam, Albert Y.S.
End-to-End Low-Resource Lip-Reading with Maxout CNN and LSTM
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018-April, 10 September 2018, article number 8462280, p. 2511-2515
Fung, Ho Long; Mak, Brian Kan Wing
Fast Derivation of Cross-lingual Document Vectors from Self-attentive Neural Machine Translation Model
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September, 2018, p. 107-111
Li, Wei; Mak, Brian Kan Wing
Learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2018-April, September 2018, article number 8462112, p. 5954-5958
Mak, Brian Kan Wing; Samarakoon, Lahiru Thilina; Sim, Khe Chai
Multi-Head Attention for End-to-End Neural Machine Translation
018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706667, p. 250-254
Fung, Ho Long; Mak, Brian Kan Wing
Self-attentive Speaker Embeddings for Text-independent Speaker Verification
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v. 2018-September, 2018, p. 3573-3577
Zhu, Yingke; Ko, Tom; Snyder, David; Mak, Brian Kan Wing; Povey, Daniel
Subspace Based Sequence Discriminative Training of LSTM Acoustic Models With Feed-forward Layers
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706623, p. 136-140
Samarakoon, Lahiru; Mak, Brian Kan Wing; Lam, Albert Y.S.
WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition
2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings, May 2019, article number 8706666, p. 141-145
Huang, Hengguang; Mak, Brian Kan Wing





Conference paper 6

An Investigation Into Learning Effective Speaker Subspaces for Robust Unsupervised DNN Adaptation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, 2017, p. 5035-5039, Article number 7953115
Samarakoon, Lahiru Thilina; Sim, Khe Chai; Mak, Brian K W
Derivation of Document Vectors from Adaptation of LSTM Language Model
15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, v. 2, 2017, p. 456-461
Li, Wei; Mak, Brian K. W.
Learning factorized transforms for unsupervised adaptation of LSTM-RNN acoustic models
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2017, p. 744-748
Samarakoon, Lahiru Thilina; Mak, Brian K W; Sim, Khe Chai
Speeding Up Softmax Computations in DNN-Based Large Vocabulary Speech Recognition by Senone Weight Vector Selection
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, IEEE, 2017, p. 5335-5339, Article number 7953175
Zhu, Yingke; Mak, Brian K W
To Improve the Robustness of LSTM-RNN Acoustic Models Using Higher-order Feedback From Multiple Histories
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2017, International Speech Communication Association, 2017, p. 3862-3866
Huang, Hengguan; Mak, Brian
Unsupervised Adaptation of Student DNNs Learned from Teacher RNNs for Improved ASR Performance
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017), v. 2017, December 2017, p. 200-205
Samarakoon, Lahiru Thilina; Mak, Brian Kan Wing





Conference paper 2

An Investigation of Adaptation Techniques for Building Acoustic Models for Hearing-impaired Children in a CAPT Application
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918449
Zhu, Yingke; Mak, Brian Kan Wing
Senone I-Vectors for Robust Speaker Verification
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918462
Tan, Zhili; Zhu, Yingke; Mak, Man-Wai; Mak, Brian K W





Article 1

Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition
IEEE Transactions on Audio, Speech and Language Processing, v. 23, (7), July 2015, article number 7084614, p. 1172-1183
Chen, Dongpeng; Mak, Brian Kan-Wing

Conference paper 1

Distinct Triphone Acoustic Modeling Using Deep Neural Networks
16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015): Speech Beyond Speech Towards a Better Understanding of the Most Important Biosignal, International Speech Communication Association (ISCA), 2015, p. 2645-2649
Chen, Dongpeng; Mak, Brian Kan Wing





Article 1

Eigentrigraphemes for under-resourced languages
Speech Communication, v. 56, (1), January 2014, p. 132-141
Ko, Tom Yu Ting; Mak, Brian Kan Wing

Conference paper 4

Joint Acoustic Modeling of Triphones and Trigraphemes by Multi-Task Learning Deep Neural Networks for Low-Resource Speech Recognition
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), v. 2014, 2014, article number 6854673, p. 5592-5596
Chen, Dongpeng; Mak, Brian Kan Wing; Leung, Cheung-Chi; Sivadas, Sunil
Joint Sequence Training of Phone and Grapheme Acoustic Model Based on Multi-task Learning Deep Neural Networks
Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech, v. 1-4, 2014, p. 1083-1087
Chen, Dongpeng; Mak, Brian; Sunil, Sivadas
Modeling Inter-cluster and Intra-cluster Discrimination Among Triphones
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, October 2014, article number 6936683, p. 103-107
Ko, Tom; Mak, Brian; Chen, Dongpeng
Subspace Gaussian Mixture Model with State-dependent Subspace Dimensions
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 1725-1729
Ko, Yu Ting; Mak, Brian Kan Wing; Leung, Cheung-Chi





Article 1

Eigentriphones for Context-Dependent Acoustic Modeling
IEEE Transactions on Audio, Speech, and Language Processing, v. 21, (6), June 2013, p. 1285-1294
Ko, Tom; Mak, Brian

Conference paper 1

Distinct Triphone Modeling by Reference Model Weighting
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, October 2013, article number 6639050, p. 7150-7154
Chen, Dongpeng; Mak, Brian K W





Conference paper 4

Derivation of eigentriphones by weighted principal component analysis
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288819, p. 4097-4100
Ko, Yu Ting; Mak, Brian Kan Wing
Speaker-ensemble hidden Markov modeling for automatic speech recognition
2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, 2012, article number 6423532, p. 9-10
Ye, Guoli; Mak, Brian
Subspace high-density discrete hidden Markov model for automatic speech recognition
European Signal Processing Conference, 2012, article number 6334110, p. 1643-1647
Ye, Guoli; Mak, Brian
Transition Probabilities Are More Important Than We Once Thought
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288995, p. 4809-4812
Ye, Guoli; Chen, Dongpeng; Mak, Brian Kan Wing





Conference paper 2

A Fully Automated Derivation of State-based Eigentriphones for Triphone Modeling with No Tied States Using Regularization
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, 2011, p. 781-784
Ko, Yu Ting; Mak, Brian Kan Wing
Eigentriphones: A basis for context-dependent acoustic modeling
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2011, article number 5947452, p. 4892-4895
Ko, Yu Ting; Mak, Brian Kan Wing





Conference paper 4

Improving speech recognition by explicit modeling of phone deletions
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, USA, 2010, p. 4858-4861
Ko, Yu Ting; Mak, Brian Kan Wing
Problems of modeling phone deletion in conversational speech for speech recognition
2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, Taiwan, 2010, p. 114-118
Mak, Brian Kan Wing; Ko, Yu Ting
Subvector-quantized high-density discrete hidden Markov model and its re-estimation
Proceedings of 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010, November 2010, p. 109-113
Ye, G.; Mak, B.
The Use of Subvector Quantization and Discrete Densities for Fast GMM Computation for Speaker Verification
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4, 2010, p. 1481-1484
Ye, Guoli; Mak, Brian





Article 1

Maximum Penalized Likelihood Kernel Regression for Fast Adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 17, (7), September 2009, article number 5165120, p. 1372-1381
Mak, Brian Kan-Wing; Lai, Tsz-Chung; Tsang, Ivor W.; Kwok, James Tin-Yau

Conference paper 2

Automatic estimation of decoding parameters using large-margin iterative linear programming
Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, 2009, p. 1207-1210
Mak, Brian Kan Wing; Ko, Yu Ting
Fast GMM Computation for Speaker Verification Using Scalar Quantization and Discrete Densities
Interspeech 2009: 10th Annual Conference of the International Speech Communication Association 2009, Vols 1-5, 2009, p. 2291-2294
Ye, Guoli; Mak, Brian; Mak, Man-Wai





Conference paper 3

Discriminative training by iterative linear programming optimization
2008 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 2008, Mar-Apr, p. 4061-4064
Mak, B.; Ng, B.
Min-max Discriminative Training of Decoding Parameters Using Iterative Linear Programming
Proceedings of the 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, Brisbane, Australia, 22-26 September 2008, 2008, p. 915-918
Mak, Brian Kan Wing; Ko, Tom
Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, 2008, p. 1897-1900
Huang, Chienlin; Ma, Bin; Wu, Chung-Hsien; Mak, Brian; Li, Haizhou





Article 1

Kernel eigenspace-based MLLR adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 15, (3), March 2007, article number 4100690, p. 784-795
Mak, Brian Kan-Wing; Hsiao, Roger Wend-Huu

Conference paper 4

A Model-based estimation of phonotactic language verification performance
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1521-1524
Wong, K.K.; Siu, M.H.; Mak, B.
A Model-based Estimation of Phonotactic Language Verification Performance
Proceedings of Interspeech, pages 186-189, Aug, 2007, Antwerp, Belgium
Wong, Ka Keung; Siu, Manhung; Mak, Brian
Boosting with anti-models for automatic language identification
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1537-1540
Yang, X.; Siu, M.H.; Gish, H.; Mak, B.
Robustness of several kernel-based fast adaptation methods on noisy LVCSR
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 1, 2007, p. 445-448
Mak, B.; Hsiao, R.





Article 3

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting
IEEE Transactions on Audio, Speech, and Language Processing, v. 14, (4), July 2006
Mak, Brian Kan Wing; Hsiao, Roger; Ho, Simon; Kwok, James Tin-Yau
Joint optimization of the frequency-domain and time-domain transformations in deriving generalized static and dynamic MFCCs
IEEE signal processing letters, v. 13, (11), November 2006, p. 707-710
Lai, Yiu-Pong; Siu, Manhung; Mak, Brian
Minimization of utterance verification error rate as a constrained optimization problem
IEEE Signal Processing Letters, v. 13, (12), December 2006, p. 760-763
Siu, Man-Hung; Mak, Brian; Au, Wing-Hei

Conference paper 5

A comparison of various adaptation methods for speaker verification with limited enrollment data
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2006
Mak, M.W.; Hsiao, R.; Mak, B.
Automatic Audio Indexing and Audio Playback Speed Control as Tools for Language Learning
Lecture Notes in Computer Science, v. 4181, 2006, p. 290-299
Rossiter, D.; Lam, G.; Mak, B.
Fast speaker adaption via maximum penalized likelihood kernel regression
2006 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP 2006; Toulouse; France, Volume 1, 2006, Article number 1660191, Pages I997-I1000
Tsang, Ivor W.; Kwok, James Tin-Yau; Mak, Brian; Zhang, Kai; Pan, Jeffrey Junfeng
Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, p. 229-232
Mak, Brian; Lai, Tsz-Chung; Hsiao, Roger
Unsupervised speaker adaptation using reference speaker weighting
Lecture Notes in Computer Science, v. 4274, 2006, p. 380-389
Lai, T.C.; Mak, B.





Article 3

Kernel eigenvoice speaker adaptation
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 984-992
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon
Passenger route guidance system for multi-modal transit networks
Journal of Advanced Transportation, v. 39, (3), 2005, p. 271-288
Lo, Hong Kam; Yip, Chun Wing; Mak, Brian Kan Wing
Pruning hidden Markov models with optimal brain surgeon
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 993-1003
Mak, Brian Kan Wing; Chan, Kin Wah

Conference paper 4

A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition
Proceedings 9th European Conference on Speech Communication and Technology, Interspeech 2005-Eurospeech, Lisbon, Portugal, 4-8 September 2005, p. 1797-1800
Hsiao, R.; Mak, B.
High-density discrete HMM with the use of scalar quantization indexing
9th European Conference on Speech Communication and Technology, 2005, p. 2121-2124
Mak, B.; Yeung, S.K.A.; Lai, Y.P.; Siu, M.
Kernel Eigenspace-based MLLR adaptation using multiple regression classes
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 985-988
Hsiao, R.; Mak, B.
Various reference speakers determination methods for embedded kernel Eigenvoice speaker adaptation
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 981-984
Mak, B.; Ho, S.





Article 2

An acoustic-phonetic and a model-theoretic analysis of subspace distribution clustering hidden Markov models
International Journal of Speech Technology, v. 7, (1), 2004, p. 55-68
Mak, Brian Kan Wing
Discriminative auditory-based features for robust speech recognition
IEEE Transactions on Speech and Audio Processing, v. 12, (1), January 2004, p. 27-36
Mak, Brian Kan Wing; Tam, YC; Li, PQ

Conference paper 6

Discriminative feature transformation by guided discriminative training
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, p. 897-900
Hsiao, R.; Mak, B.
Eigenvoice speaker adaptation via composite kernel PCA
Advances in neural information processing systems, v. 16, 2004, p. 1401-1408
Kwok, James Tin-Yau; Mak, Brian Kan Wing; Ho, Simon Ka-Lung
Improving Eigenspace-based MLLR Adaptation by Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, Jeju Island, South Korea, October 4-8, 2004, volume I, pages 13-16,
Mak, Brian; Hsiao, Roger
Speedup of Kernel Eigenvoice Speaker Adaptation by Embedded Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, , Jeju Island, South Korea, volume IV, pages 2913-2916,
Mak, Brian; Ho, Simon; Kwok, James
Study of various composite kernels for kernel eigenvoice speaker adaptation
2004 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Montreal, Que; Canada, 2004, 17 May 2004 through 21 May 2004; Code 63500, p. 325-328
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung
Using kernel PCA to improve eigenvoice speaker adaptation
Proceedings of 2004 International Conference on Machine Learning and Cybernetics, v. 5 / IEEE. Piscataway, NJ : IEEE, 2004, p. 3062-3067
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung





Conference paper 4

Discriminative training of auditory filters of different shapes for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2, 2003, p. 45-48
Mak, B.; Tam, Y.C.; Hsiao, R.
Joint Estimation of Thresholds in a Bi-threshold Verification Problem
European Conference on Speech Communication and Technology, pages 893--896, Sept 1-4
Ho, Simon; Mak, Brian
PLASER: Pronunciation Learning via Automatic Speech Recognition
Proceedings of HLT-NAACL Workshop on Building Educational Applications using Natural Language Processing, Edmonton, Canada, May
Mak, Brian; Siu, Man Hung; Ng, Mimi; Tam, Yik Cheung; Chan, Yu Chung; Chan, Kin Wah; Leung, Ka Yee; Ho, Simon; Chong, Fong Ho; Wong, Jimmy; Lo, Jacqueline
Pruning Transitions in a Hidden Markov Model with Optimal Brain Surgeon
European Conference on Speech Communication and Technology, pages 2521--2524, Sept 1-4
Mak, Brian; Chan, Kin-Wah





Article 1

A mathematical relationship between full-band and multiband mel-frequency cepstral coefficients
IEEE Signal Processing Letters, v. 9, (8), August 2002, p. 241-244
Mak, Brian Kan Wing

Conference paper 4

An alternative approach of finding competing hypotheses for better minimum classification error training
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, p. 101-104
Tam, YC; Mak, B.
Discriminative auditory features for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2002, p. 381-384
Mak, B.; Tam, Y.C.; Li, Q.
Knowledge-based Sense Pruning using the HowNet: An Alternative to Word Sense Disambiguation
Proceedings of the International Symposium of Chinese Spoken Language Processing, August, Taiwan, pp. 189-192
Gan, Kok-Wee; Wang, Chi Yung; Mak, Brian
Performance of Discriminatively Trained Auditory Features on Aurora2 and Aurora3
Proceedings of the International Conference on Spoken Language Processing, September, Denver, Colorado, USA, Vol. 1, pp. 33-36
Mak, Brian; Tam, Yik Cheung





Article 2

Direct training of subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (4), May 2001, p. 378-387
Mak, Brian Kan Wing; Bocchieri, E.
Subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (3), 2001, p. 264-275
Bocchieri, E.; Mak, Brian Kan Wing

Conference paper 2

Development of an Asynchronous Multi-band System for Continuous Speech Recognition
Proceedings of the European Conference on Speech Communication and Technology, 1, 575-578
Tam, Yik-Cheung; Mak, Brian
Rapid Speaker Adaptation Using MLLR and Subspace Regression Classes
Proceedings of the European Conference on Speech Communication and Technology, 2, 1253-1256
Wong, Kwok-Man; Mak, Brian





Conference paper 4

Asynchrony with re-trained transition probabilities improves performance in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 149-152
Mak, Brian K.W.; Tam, Y.C.
MAP Adaptation with Subspace Regression Classes and Tying
Proceedings of 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 3, 2000, article number 861963, p. 1551-1554
Wong, Kwok-Man; Mak, Brian
Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. I, 2000, p. 313-316
Tam, Y.C.; Mak, Brian K.W.
Pruning of state-tying tree using Bayesian Information Criterion with multiple mixtures
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 294-297
Chan, Y.C.; Siu, M.; Mak, Brian K.W.





Conference paper 2

Training of context-dependent subspace distribution clustering hidden Markov model
Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, v. 1, 1998, p. 308-311
Mak, Brian K.W.; Bocchieri, E.
Training of subspace distribution clustering hidden Markov model
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, Washington, USA, v. 2, 1998, p. 673-676
Mak, Brian K.W.; Bocchieri, E.





Conference paper 3

Combining ANNs to improve phone recognition
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, v. 4, 1997, p. 3253-3256
Mak, Brian K.W.
Stream derivation and clustering schemes for subspace distribution clustering HMM
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Santa Barbara, California, USA, 1997, p. 339-346
Mak, Brian K.W.; Bocchieri, E.; Barnard, E.
Subspace distribution clustering for continuous observation density hidden Markov models
Proceedings of the European Conference on Speech Communication and Technology, Rhodes, Greece, v. 1, 1997, p. 107-110
Bocchieri, E.; Mak, Brian K.W.





Conference paper 2

Phone clustering using the Bhattacharyya distance
Proceedings of the International Conference on Spoken Language Processing, Philadelphia, USA, v. 4, 1996, p. 2005-2008
Mak, Brian K.W.; Barnard, E.
The contribution of consonants versus vowels to word recognition in fluent speech
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Atlanta, Georgia, USA, v. 2, 1996, p. 853-856
Cole, R.; Yan, Y.; Mak, Brian K.W.; Fanty, M.; Bailey, T.





Article 1

Tone recognition of isolated Cantonese syllables
IEEE Transactions on Speech and Audio Processing, v. 3, (3), May 1995, p. 204-209
Lee, T.; Ching, P.C.; Chan, L.W.; Cheng, Y.H.; Mak, Brian K.W.





Article 1

A robust algorithm for word boundary detection in the presence of noise
IEEE Transactions on Speech and Audio Processing, v. 2, (3), July 1994, p. 406-412
Junqua, J.; Mak, Brian K.W.; Reaves, B.





Conference paper 1

An NN based tone classifier for Cantonese
International Joint Conference on Neural Networks, Japan, v. 1, 1993, p. 287-290
Lee, Tan; Ching, P.C.; Chan, L.W.; Mak, Brian K.W.





Conference paper 1

A robust speech/non-speech detection algorithm using time and frequency-based features
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, California, USA, v. 1, 1992, p. 269-272
Mak, Brian K.W.; Junqua, J.; Reaves, B.





Conference paper 1

Communication parameter tests and parallel back propagation algorithms on iPSC/2 hypercube multiprocessor
Proceedings of the Fifth Distributed Memory Computer Conference, South Carolina, USA, v. 2, 1990, p. 1353-1364
Mak, Brian K.W.; Egecioglu, O.





2016 2

An Investigation of Adaptation Techniques for Building Acoustic Models for Hearing-impaired Children in a CAPT Application
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918449
Zhu, Yingke; Mak, Brian Kan Wing Conference paper
Senone I-Vectors for Robust Speaker Verification
Proceedings of 2016 10th International Symposium on Chinese Spoken Language Processing, ISCSLP 2016, October 2016, article number 7918462
Tan, Zhili; Zhu, Yingke; Mak, Man-Wai; Mak, Brian K W Conference paper

2015 2

Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition
IEEE Transactions on Audio, Speech and Language Processing, v. 23, (7), July 2015, article number 7084614, p. 1172-1183
Chen, Dongpeng; Mak, Brian Kan-Wing Article
Distinct Triphone Acoustic Modeling Using Deep Neural Networks
16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015): Speech Beyond Speech Towards a Better Understanding of the Most Important Biosignal, International Speech Communication Association (ISCA), 2015, p. 2645-2649
Chen, Dongpeng; Mak, Brian Kan Wing Conference paper

2014 5

Eigentrigraphemes for under-resourced languages
Speech Communication, v. 56, (1), January 2014, p. 132-141
Ko, Tom Yu Ting; Mak, Brian Kan Wing Article
Joint Acoustic Modeling of Triphones and Trigraphemes by Multi-Task Learning Deep Neural Networks for Low-Resource Speech Recognition
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2014), v. 2014, 2014, article number 6854673, p. 5592-5596
Chen, Dongpeng; Mak, Brian Kan Wing; Leung, Cheung-Chi; Sivadas, Sunil Conference paper
Joint Sequence Training of Phone and Grapheme Acoustic Model Based on Multi-task Learning Deep Neural Networks
Proceedings of the Annual Conference of the International Speech Communication Association, Interspeech, v. 1-4, 2014, p. 1083-1087
Chen, Dongpeng; Mak, Brian; Sunil, Sivadas Conference paper
Modeling Inter-cluster and Intra-cluster Discrimination Among Triphones
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, ISCSLP 2014, October 2014, article number 6936683, p. 103-107
Ko, Tom; Mak, Brian; Chen, Dongpeng Conference paper
Subspace Gaussian Mixture Model with State-dependent Subspace Dimensions
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014), Institute of Electrical and Electronics Engineers (IEEE), 2014, p. 1725-1729
Ko, Yu Ting; Mak, Brian Kan Wing; Leung, Cheung-Chi Conference paper

2013 2

Eigentriphones for Context-Dependent Acoustic Modeling
IEEE Transactions on Audio, Speech, and Language Processing, v. 21, (6), June 2013, p. 1285-1294
Ko, Tom; Mak, Brian Article
Distinct Triphone Modeling by Reference Model Weighting
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, October 2013, article number 6639050, p. 7150-7154
Chen, Dongpeng; Mak, Brian K W Conference paper

2012 4

Derivation of eigentriphones by weighted principal component analysis
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288819, p. 4097-4100
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Speaker-ensemble hidden Markov modeling for automatic speech recognition
2012 8th International Symposium on Chinese Spoken Language Processing, ISCSLP 2012, 2012, article number 6423532, p. 9-10
Ye, Guoli; Mak, Brian Conference paper
Subspace high-density discrete hidden Markov model for automatic speech recognition
European Signal Processing Conference, 2012, article number 6334110, p. 1643-1647
Ye, Guoli; Mak, Brian Conference paper
Transition Probabilities Are More Important Than We Once Thought
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, article number 6288995, p. 4809-4812
Ye, Guoli; Chen, Dongpeng; Mak, Brian Kan Wing Conference paper

2011 2

A Fully Automated Derivation of State-based Eigentriphones for Triphone Modeling with No Tied States Using Regularization
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2011, 2011, p. 781-784
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Eigentriphones: A basis for context-dependent acoustic modeling
Proceedings of ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, 2011, article number 5947452, p. 4892-4895
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper

2010 4

Improving speech recognition by explicit modeling of phone deletions
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, USA, 2010, p. 4858-4861
Ko, Yu Ting; Mak, Brian Kan Wing Conference paper
Problems of modeling phone deletion in conversational speech for speech recognition
2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, Taiwan, 2010, p. 114-118
Mak, Brian Kan Wing; Ko, Yu Ting Conference paper
Subvector-quantized high-density discrete hidden Markov model and its re-estimation
Proceedings of 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010, November 2010, p. 109-113
Ye, G.; Mak, B. Conference paper
The Use of Subvector Quantization and Discrete Densities for Fast GMM Computation for Speaker Verification
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4, 2010, p. 1481-1484
Ye, Guoli; Mak, Brian Conference paper

2009 3

Maximum Penalized Likelihood Kernel Regression for Fast Adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 17, (7), September 2009, article number 5165120, p. 1372-1381
Mak, Brian Kan-Wing; Lai, Tsz-Chung; Tsang, Ivor W.; Kwok, James Tin-Yau Article
Automatic estimation of decoding parameters using large-margin iterative linear programming
Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009, 2009, p. 1207-1210
Mak, Brian Kan Wing; Ko, Yu Ting Conference paper
Fast GMM Computation for Speaker Verification Using Scalar Quantization and Discrete Densities
Interspeech 2009: 10th Annual Conference of the International Speech Communication Association 2009, Vols 1-5, 2009, p. 2291-2294
Ye, Guoli; Mak, Brian; Mak, Man-Wai Conference paper

2008 3

Discriminative training by iterative linear programming optimization
2008 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), v. 2008, Mar-Apr, p. 4061-4064
Mak, B.; Ng, B. Conference paper
Min-max Discriminative Training of Decoding Parameters Using Iterative Linear Programming
Proceedings of the 9th Annual Conference of the International Speech Communication Association, INTERSPEECH 2008, Brisbane, Australia, 22-26 September 2008, 2008, p. 915-918
Mak, Brian Kan Wing; Ko, Tom Conference paper
Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions
INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association, 2008, p. 1897-1900
Huang, Chienlin; Ma, Bin; Wu, Chung-Hsien; Mak, Brian; Li, Haizhou Conference paper

2007 5

Kernel eigenspace-based MLLR adaptation
IEEE Transactions on Audio, Speech and Language Processing, v. 15, (3), March 2007, article number 4100690, p. 784-795
Mak, Brian Kan-Wing; Hsiao, Roger Wend-Huu Article
A Model-based estimation of phonotactic language verification performance
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1521-1524
Wong, K.K.; Siu, M.H.; Mak, B. Conference paper
A Model-based Estimation of Phonotactic Language Verification Performance
Proceedings of Interspeech, pages 186-189, Aug, 2007, Antwerp, Belgium
Wong, Ka Keung; Siu, Manhung; Mak, Brian Conference paper
Boosting with anti-models for automatic language identification
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 3, 2007, p. 1537-1540
Yang, X.; Siu, M.H.; Gish, H.; Mak, B. Conference paper
Robustness of several kernel-based fast adaptation methods on noisy LVCSR
International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007, v. 1, 2007, p. 445-448
Mak, B.; Hsiao, R. Conference paper

2006 8

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting
IEEE Transactions on Audio, Speech, and Language Processing, v. 14, (4), July 2006
Mak, Brian Kan Wing; Hsiao, Roger; Ho, Simon; Kwok, James Tin-Yau Article
Joint optimization of the frequency-domain and time-domain transformations in deriving generalized static and dynamic MFCCs
IEEE signal processing letters, v. 13, (11), November 2006, p. 707-710
Lai, Yiu-Pong; Siu, Manhung; Mak, Brian Article
Minimization of utterance verification error rate as a constrained optimization problem
IEEE Signal Processing Letters, v. 13, (12), December 2006, p. 760-763
Siu, Man-Hung; Mak, Brian; Au, Wing-Hei Article
A comparison of various adaptation methods for speaker verification with limited enrollment data
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2006
Mak, M.W.; Hsiao, R.; Mak, B. Conference paper
Automatic Audio Indexing and Audio Playback Speed Control as Tools for Language Learning
Lecture Notes in Computer Science, v. 4181, 2006, p. 290-299
Rossiter, D.; Lam, G.; Mak, B. Conference paper
Fast speaker adaption via maximum penalized likelihood kernel regression
2006 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, ICASSP 2006; Toulouse; France, Volume 1, 2006, Article number 1660191, Pages I997-I1000
Tsang, Ivor W.; Kwok, James Tin-Yau; Mak, Brian; Zhang, Kai; Pan, Jeffrey Junfeng Conference paper
Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, p. 229-232
Mak, Brian; Lai, Tsz-Chung; Hsiao, Roger Conference paper
Unsupervised speaker adaptation using reference speaker weighting
Lecture Notes in Computer Science, v. 4274, 2006, p. 380-389
Lai, T.C.; Mak, B. Conference paper

2005 7

Kernel eigenvoice speaker adaptation
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 984-992
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Article
Passenger route guidance system for multi-modal transit networks
Journal of Advanced Transportation, v. 39, (3), 2005, p. 271-288
Lo, Hong Kam; Yip, Chun Wing; Mak, Brian Kan Wing Article
Pruning hidden Markov models with optimal brain surgeon
IEEE Transactions on Speech and Audio Processing, v. 13, (5), September 2005, p. 993-1003
Mak, Brian Kan Wing; Chan, Kin Wah Article
A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition
Proceedings 9th European Conference on Speech Communication and Technology, Interspeech 2005-Eurospeech, Lisbon, Portugal, 4-8 September 2005, p. 1797-1800
Hsiao, R.; Mak, B. Conference paper
High-density discrete HMM with the use of scalar quantization indexing
9th European Conference on Speech Communication and Technology, 2005, p. 2121-2124
Mak, B.; Yeung, S.K.A.; Lai, Y.P.; Siu, M. Conference paper
Kernel Eigenspace-based MLLR adaptation using multiple regression classes
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 985-988
Hsiao, R.; Mak, B. Conference paper
Various reference speakers determination methods for embedded kernel Eigenvoice speaker adaptation
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, p. 981-984
Mak, B.; Ho, S. Conference paper

2004 8

An acoustic-phonetic and a model-theoretic analysis of subspace distribution clustering hidden Markov models
International Journal of Speech Technology, v. 7, (1), 2004, p. 55-68
Mak, Brian Kan Wing Article
Discriminative auditory-based features for robust speech recognition
IEEE Transactions on Speech and Audio Processing, v. 12, (1), January 2004, p. 27-36
Mak, Brian Kan Wing; Tam, YC; Li, PQ Article
Discriminative feature transformation by guided discriminative training
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, p. 897-900
Hsiao, R.; Mak, B. Conference paper
Eigenvoice speaker adaptation via composite kernel PCA
Advances in neural information processing systems, v. 16, 2004, p. 1401-1408
Kwok, James Tin-Yau; Mak, Brian Kan Wing; Ho, Simon Ka-Lung Conference paper
Improving Eigenspace-based MLLR Adaptation by Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, Jeju Island, South Korea, October 4-8, 2004, volume I, pages 13-16,
Mak, Brian; Hsiao, Roger Conference paper
Speedup of Kernel Eigenvoice Speaker Adaptation by Embedded Kernel PCA
Proceedings of the International Conference on Spoken Language Processing, , Jeju Island, South Korea, volume IV, pages 2913-2916,
Mak, Brian; Ho, Simon; Kwok, James Conference paper
Study of various composite kernels for kernel eigenvoice speaker adaptation
2004 IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Montreal, Que; Canada, 2004, 17 May 2004 through 21 May 2004; Code 63500, p. 325-328
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung Conference paper
Using kernel PCA to improve eigenvoice speaker adaptation
Proceedings of 2004 International Conference on Machine Learning and Cybernetics, v. 5 / IEEE. Piscataway, NJ : IEEE, 2004, p. 3062-3067
Mak, Brian Kan Wing; Kwok, James Tin-Yau; Ho, Simon Ka-Lung Conference paper

2003 4

Discriminative training of auditory filters of different shapes for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 2, 2003, p. 45-48
Mak, B.; Tam, Y.C.; Hsiao, R. Conference paper
Joint Estimation of Thresholds in a Bi-threshold Verification Problem
European Conference on Speech Communication and Technology, pages 893--896, Sept 1-4
Ho, Simon; Mak, Brian Conference paper
PLASER: Pronunciation Learning via Automatic Speech Recognition
Proceedings of HLT-NAACL Workshop on Building Educational Applications using Natural Language Processing, Edmonton, Canada, May
Mak, Brian; Siu, Man Hung; Ng, Mimi; Tam, Yik Cheung; Chan, Yu Chung; Chan, Kin Wah; Leung, Ka Yee; Ho, Simon; Chong, Fong Ho; Wong, Jimmy; Lo, Jacqueline Conference paper
Pruning Transitions in a Hidden Markov Model with Optimal Brain Surgeon
European Conference on Speech Communication and Technology, pages 2521--2524, Sept 1-4
Mak, Brian; Chan, Kin-Wah Conference paper

2002 5

A mathematical relationship between full-band and multiband mel-frequency cepstral coefficients
IEEE Signal Processing Letters, v. 9, (8), August 2002, p. 241-244
Mak, Brian Kan Wing Article
An alternative approach of finding competing hypotheses for better minimum classification error training
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, p. 101-104
Tam, YC; Mak, B. Conference paper
Discriminative auditory features for robust speech recognition
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, v. 1, 2002, p. 381-384
Mak, B.; Tam, Y.C.; Li, Q. Conference paper
Knowledge-based Sense Pruning using the HowNet: An Alternative to Word Sense Disambiguation
Proceedings of the International Symposium of Chinese Spoken Language Processing, August, Taiwan, pp. 189-192
Gan, Kok-Wee; Wang, Chi Yung; Mak, Brian Conference paper
Performance of Discriminatively Trained Auditory Features on Aurora2 and Aurora3
Proceedings of the International Conference on Spoken Language Processing, September, Denver, Colorado, USA, Vol. 1, pp. 33-36
Mak, Brian; Tam, Yik Cheung Conference paper

2001 4

Direct training of subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (4), May 2001, p. 378-387
Mak, Brian Kan Wing; Bocchieri, E. Article
Subspace distribution clustering hidden Markov model
IEEE Transactions on Speech and Audio Processing, v. 9, (3), 2001, p. 264-275
Bocchieri, E.; Mak, Brian Kan Wing Article
Development of an Asynchronous Multi-band System for Continuous Speech Recognition
Proceedings of the European Conference on Speech Communication and Technology, 1, 575-578
Tam, Yik-Cheung; Mak, Brian Conference paper
Rapid Speaker Adaptation Using MLLR and Subspace Regression Classes
Proceedings of the European Conference on Speech Communication and Technology, 2, 1253-1256
Wong, Kwok-Man; Mak, Brian Conference paper

2000 4

Asynchrony with re-trained transition probabilities improves performance in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 149-152
Mak, Brian K.W.; Tam, Y.C. Conference paper
MAP Adaptation with Subspace Regression Classes and Tying
Proceedings of 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, v. 3, 2000, article number 861963, p. 1551-1554
Wong, Kwok-Man; Mak, Brian Conference paper
Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. I, 2000, p. 313-316
Tam, Y.C.; Mak, Brian K.W. Conference paper
Pruning of state-tying tree using Bayesian Information Criterion with multiple mixtures
Proceedings of the International Conference on Spoken Language Processing, Beijing, China, v. IV, 2000, p. 294-297
Chan, Y.C.; Siu, M.; Mak, Brian K.W. Conference paper

1998 2

Training of context-dependent subspace distribution clustering hidden Markov model
Proceedings of the International Conference on Spoken Language Processing, Sydney, Australia, v. 1, 1998, p. 308-311
Mak, Brian K.W.; Bocchieri, E. Conference paper
Training of subspace distribution clustering hidden Markov model
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, Washington, USA, v. 2, 1998, p. 673-676
Mak, Brian K.W.; Bocchieri, E. Conference paper

1997 3

Combining ANNs to improve phone recognition
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, v. 4, 1997, p. 3253-3256
Mak, Brian K.W. Conference paper
Stream derivation and clustering schemes for subspace distribution clustering HMM
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, Santa Barbara, California, USA, 1997, p. 339-346
Mak, Brian K.W.; Bocchieri, E.; Barnard, E. Conference paper
Subspace distribution clustering for continuous observation density hidden Markov models
Proceedings of the European Conference on Speech Communication and Technology, Rhodes, Greece, v. 1, 1997, p. 107-110
Bocchieri, E.; Mak, Brian K.W. Conference paper

1996 2

Phone clustering using the Bhattacharyya distance
Proceedings of the International Conference on Spoken Language Processing, Philadelphia, USA, v. 4, 1996, p. 2005-2008
Mak, Brian K.W.; Barnard, E. Conference paper
The contribution of consonants versus vowels to word recognition in fluent speech
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Atlanta, Georgia, USA, v. 2, 1996, p. 853-856
Cole, R.; Yan, Y.; Mak, Brian K.W.; Fanty, M.; Bailey, T. Conference paper

1995 1

Tone recognition of isolated Cantonese syllables
IEEE Transactions on Speech and Audio Processing, v. 3, (3), May 1995, p. 204-209
Lee, T.; Ching, P.C.; Chan, L.W.; Cheng, Y.H.; Mak, Brian K.W. Article

1994 1

A robust algorithm for word boundary detection in the presence of noise
IEEE Transactions on Speech and Audio Processing, v. 2, (3), July 1994, p. 406-412
Junqua, J.; Mak, Brian K.W.; Reaves, B. Article

1993 1

An NN based tone classifier for Cantonese
International Joint Conference on Neural Networks, Japan, v. 1, 1993, p. 287-290
Lee, Tan; Ching, P.C.; Chan, L.W.; Mak, Brian K.W. Conference paper

1992 1

A robust speech/non-speech detection algorithm using time and frequency-based features
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, California, USA, v. 1, 1992, p. 269-272
Mak, Brian K.W.; Junqua, J.; Reaves, B. Conference paper

1990 1

Communication parameter tests and parallel back propagation algorithms on iPSC/2 hypercube multiprocessor
Proceedings of the Fifth Distributed Memory Computer Conference, South Carolina, USA, v. 2, 1990, p. 1353-1364
Mak, Brian K.W.; Egecioglu, O. Conference paper


No Publications






Teaching Assignment
2021-22 Winter 0 2021-22 Fall 2 2020-21 Summer 2 2020-21 Spring 4 2020-21 Winter 0 2020-21 Fall 3


COMP4981 Final Year Project
RMBI4980 Risk Management and Business Intelligence Capstone Project I


COMP4981 Final Year Project
CPEG4901 Computer Engineering Final Year Project in COMP


COMP2012 Object-Oriented Programming and Data Structures
COMP4981 Final Year Project
RMBI4990 Risk Management and Business Intelligence Capstone Project II
SBMT5710 Artificial Intelligence


COMP2011 Programming with C++
COMP4981 Final Year Project
RMBI4980 Risk Management and Business Intelligence Capstone Project I


No Teaching Assignments


No Teaching Assignments






Research Postgraduate (RPG) Supervision From January 2019 to December 2022 (As of 30 January 2022)


All Supervisions Current RPGs Graduated RPGs




Current RPGs


Doctor of Philosophy ZUO, Ronglai
Computer Science and Engineering( 2020 - )

NIU, Zhe
Computer Science and Engineering( 2018 - )

ZHU, Yingke
Computer Science and Engineering( 2014 - )




Master of Philosophy CHUNG, Man Hon
Computer Science and Engineering( 2020 - )

HUANG, Chun Fung Ranzo
Computer Science and Engineering( 2020 - )





Graduated RPGs


Master of Philosophy LIU, Zhaoyu
Computer Science and Engineering( Completed in 2020 )

YU, Xinyuan
Computer Science and Engineering( Completed in 2020 )









ProjectsFrom January 2020 to December 2022

All Projects 3 Leading Projects 2 Participating Projects 1


Research and Development of Artificial Intelligence in Extraction and Identification of Spoken Language Biomarkers for Screening and Monitoring of Neurocognitive Disorders


以人工智能提取和鑑定口語生物標誌物供神經認知障礙篩查和監測的研究及技術開發 Participating


RGC - Theme-based Research Scheme


Project Team (HKUST)
MAK Brian Kan Wing


2020 -




End-to-end Automatic Sign Language Recognition and Translation of the Hong Kong Sign Language


端到端的自動香港手語識別與翻譯 Leading


RGC - General Research Fund


Project Team (HKUST)
MAK Brian Kan Wing (Lead)


2019 -




Training Big and Deep Neural Networks for Automatic Speech Recognition


訓練用在自動語音辨識上的龐大和深度神經網絡 Leading


RGC - General Research Fund


Project Team (HKUST)
MAK Brian Kan Wing (Lead)


2017 - 2020






相关话题/香港科技大学 工学院