
Pinzhen Chen
I am a final-year PhD student in the School of Informatics at the University of Edinburgh, supervised by Kenneth Heafield and Barry Haddow. I am a member of the large machine translation group, EdinburghNLP, and Institute for Language, Cognition and Computation.
My research is on neural machine translation and low-resource data augmentation. Recently I am also exploring the applications of large language models in multilingual scenarios.
I also go by Patrick, or 陈品桢 in Chinese. [pinzhen.chen@ed.ac.uk | Semantic Scholar | Google Scholar | LinkedIn]
Experience
- 2020-present, University of Edinburgh, PhD student
- 2022, Huawei Noah's Ark Lab, Research Scientist Intern
- 2019, University of Edinburgh. Research Assistant
- 2015-2019, University of Edinburgh, BEng Artificial Intelligence and Software Engineering
- 2018, Goldman Sachs, Technology Analyst Intern
Papers
-
Monolingual or multilingual instruction tuning: Which makes a better Alpaca
Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Barry Haddow, and Kenneth Heafield. 2023. arXiv preprint.
[pdf
| bib]
-
Iterative translation refinement with large language models
Pinzhen Chen, Zhicheng Guo, Barry Haddow, and Kenneth Heafield. 2023. arXiv preprint.
[pdf
| bib]
-
Towards Effective Disambiguation for Machine Translation with Large Language Models
Vivek Iyer, Pinzhen Chen, and Alexandra Birch. 2023. arXiv preprint.
[pdf
| bib]
-
PMIndiaSum: Multilingual and cross-lingual headline summarization for languages in India
Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay B. Cohen, Manish Shrivastava, and Barry Haddow. 2023. arXiv preprint.
[pdf
| bib
| code and data]
-
Exploring data augmentation for code generation tasks
Pinzhen Chen and Gerasimos Lampouras. 2023. In Findings of the Association for Computational Linguistics: EACL 2023.
[pdf
| bib
| poster
| talk]
-
A unified model for reverse dictionary and definition modelling
Pinzhen Chen and Zheng Zhao. 2022. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing.
[pdf
| bib
| poster
| talk
| code]
-
Edinburgh at SemEval-2022 task 1: Jointly fishing for word embeddings and definitions
Pinzhen Chen and Zheng Zhao. 2022. In Proceedings of the 16th International Workshop on Semantic Evaluation.
[pdf
| bib
| poster
| talk
| code
| best paper honorable mention out of 221]
-
Approaching neural Chinese word segmentation as a low-resource machine translation task
Pinzhen Chen and Kenneth Heafield. 2022. In Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation.
[pdf
| bib
| best paper award out of 94]
-
To adapt or to fine-tune: A case study on abstractive summarization
Zheng Zhao and Pinzhen Chen. 2022. In Proceedings of the 21st Chinese National Conference on Computational Linguistics.
[pdf
| bib
| poster
| code]
-
The University of Edinburgh's English-German and English-Hausa submissions to the WMT21 news translation task
Pinzhen Chen, Jindřich Helcl, Ulrich Germann, Laurie Burchell, Nikolay Bogoychev, Antonio Valerio Miceli Barone, Jonas Waldendorf, Alexandra Birch, and Kenneth Heafield. 2021. In Proceedings of the Sixth Conference on Machine Translation.
[pdf
| bib
| poster]
-
The highs and lows of simple lexical domain adaptation approaches for neural machine translation
Nikolay Bogoychev and Pinzhen Chen. 2021. In Proceedings of the Second Workshop on Insights from Negative Results in NLP.
[pdf
| bib
| poster]
-
Parallel sentence mining by constrained decoding
Pinzhen Chen, Nikolay Bogoychev, Kenneth Heafield, and Faheem Kirefu. 2020. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
[pdf
| bib
| talk
| code]
-
Character mapping and ad-hoc adaptation: Edinburgh's IWSLT 2020 open domain translation system
Pinzhen Chen, Nikolay Bogoychev, and Ulrich Germann. 2020. In Proceedings of the 17th International Conference on Spoken Language Translation.
[pdf
| bib]
-
ParaCrawl: Web-scale acquisition of parallel corpora
Marta Bañón, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Esplà-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Elsa Sarrías, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, and Jaume Zaragoza. 2020. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.
[pdf
| bib
| website]
-
Sentence and word weighting for neural machine translation domain adaptation
Pinzhen Chen. 2019. Undergraduate thesis.
[pdf
| bib]
Professional Services
- Program Committee/Reviewer:
- ACL Rolling Review (ARR), 2021, 2023
- Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
- Joint Conference on Lexical and Computational Semantics (*SEM), 2022, 2023
- Conference on Machine Translation (WMT), 2021, 2022
- International Workshop on Semantic Evaluation (SemEval), 2022
- Teaching Assistant at University of Edinburgh:
- Machine Learning Practical, Mentor and Marker, 2020-21, 2021-22, 2022-23
- In 2022-23, one research project I supervised was shortlisted for a best project prize donated by IBM UK, 5 out of 88
- Introductory Applied Machine Learning, Marker, 2020-21, 2021-22
- Informatics Research Proposal, Tutor, 2020-21
- System Design Project, Mentor, 2018-19
Personal
I enjoy travelling and cooking. I sometimes play badminton, basketball, as well as board and card games.
Last updated on 22 Sep 2023.