Dr Scott Piao
Senior Lecturer in Computer ScienceProfile
I am a Senior Lecturer (Associate Professor) in Computer Science of the School of Computing and Communications of Lancaster University. I have academic degrees of PhD (Lancaster University) and PGCert (Postgraduate Certificate Academic Practice). I have been engaged in research in the areas of Natural Language Processing, Text Mining, Social Computing and related system development. Since my completion of PhD degree, I have mostly worked in Lancaster University, but I also worked in The Department of Computer Science (Natural Language Processing Group) of Sheffield University (2000~2002) and The School of Computer Science (The National Centre for Text Mining) of Manchester University (2006~2009) as a Research Associate.
Research Overview
My research interests span Natural Language Processing, Text Mining, Social Computing and Data Science. I am interested in developing algorithms and tools for automatically analysing information hidden in langauge data and applying such techniques in various information systems. Over the past years, I have worked on seven major projects funded by EPSRC, ESRC, AHRC and EU, and served on program committees of forty-seven leading international conferences covering the areas of Natural Language Processing, Social Computing, Big Data and Corpus Linguistics. I served as an Area Chair in the LREC-COLING 2024 Conference, and currently I am an Area Chair of the COLING 2025 Conference. Recently I focus on exploring Large Language Models (LLMs) and generative AI models for autimatic analysis of various semantic information of language data.
Web Links
My ORCID ID: 0000-0003-3890-6521
Current Teaching
1) From 2018 - present: CNSCC110: Software Development (Part 1)
2) From 2018 - 2021: CNSCC130: Information Systems (Part 1)
3) From 2019 - present: CNSCC311: Distributed Systems
My Role
2019-present: Director of BJTU Scheme, School of Computing and Communications, Lancaster University.
2019-present: Director of Computer Science Programme, Lancaster University College at Beijing Jiaotong University.
External Roles
- Member of The Association for Computational Linguistics (ACL) (http://www.aclweb.org).
- Consultant on semantic tagger development (January 2018 ~ December 2020) in the Project "Financial Text Analytics in Spanish: Tools and Language Resources", led by The Computational Linguistics Laboratory ('Laboratorio de Lingüística Informática', LLI) of the Autonomous University of Madrid, Spain (www.lllf.uam.es).
- From February 2007 to April 2009, member of OASIS (Organization for the Advancement of Structured Information Standards) UIMA (Unstructured Information Management Architecture) TC.
PhD Supervision Interests
If you are interested in doing PhD research in any of the following areas, you are welcome to contact me: a) Natural Language Processing (NLP) and Text Mining; b) Application of NLP to Digital Health; c) Application of NLP to information systems.
The National Corpus of Contemporary Welsh
01/03/2016 → 31/08/2019
Research
IEEE Global Communications Conference 2024
Participation in conference -Mixed Audience
The 7th Healthcare Text Analytics Conference
Participation in conference -Mixed Audience
Joint Workshop on Multiword Expressions and Universal Dependencies, in LREC-COLING 2024
Participation in workshop, seminar, course
The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Participation in conference - Academic
IEEE Global Communications 2023 Conference
Participation in conference -Mixed Audience
The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics 2023
Participation in conference - Academic
The twelfth International Corpus Linguistics Conference
Participation in conference - Academic
The 19th Workshop on Multiword Expressions in EACL 2023
Participation in workshop, seminar, course
The 17th Conference of the European Chapter of the Association for Computational Linguistics
Participation in conference - Academic
The 2022 IEEE Global Communications Conference
Participation in conference -Mixed Audience
The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing
Participation in conference -Mixed Audience
The Twelfth International Conference on Social Media Technologies, Communication, and Informatics
Participation in conference -Mixed Audience
8th Workshop on Multiword Expressions
Participation in workshop, seminar, course
The 13th Edition of Language Resources and Evaluation Conference
Participation in conference -Mixed Audience
2021 IEEE Global Communications Conference
Participation in conference -Mixed Audience
The 2021 Conference on Empirical Methods in Natural Language Processing
Participation in conference -Mixed Audience
17th Workshop on Multiword Expressions (in ACL-IJCNLP 2021)
Participation in workshop, seminar, course
2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Participation in conference -Mixed Audience
2020 IEEE Global Communications Conference
Participation in conference -Mixed Audience
The 2020 Conference on Empirical Methods in Natural Language Processing
Participation in conference -Mixed Audience
The 12th Language Resources and Evaluation Conference
Participation in conference -Mixed Audience
The Ninth International Conference on Social Media Technologies, Communication, and Informatics 2019
Participation in conference -Mixed Audience
Joint Workshop on Multiword Expressions and WordNet at ACL2019
Participation in conference -Mixed Audience
The 10th International Corpus Linguistics Conference
Participation in conference -Mixed Audience
The North American Chapter of the Association for Computational Linguistics - Human Language Technologies 2019
Participation in conference -Mixed Audience
Hypothesis Generating in Genetics and Biomedical Text Mining
Participation in workshop, seminar, course
The 2019 IEEE Global Communications Conference
Participation in conference -Mixed Audience
Using CorCenCC: Corpws Cenedlaethol Cymraeg Cyfoes – National Corpus of Contemporary Welsh
Participation in workshop, seminar, course
The 2018 IEEE Global Communications Conference
Participation in conference -Mixed Audience
The Eighth International Conference on Social Media Technologies, Communication, and Informatics
Participation in conference -Mixed Audience
The 11th Edition of the Language Resources and Evaluation Conference (LREC2018)
Participation in conference -Mixed Audience
The 1st Financial Narrative Processing Workshop in LREC 2018
Participation in workshop, seminar, course
The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Participation in conference -Mixed Audience
Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions, in COLING 2018
Participation in workshop, seminar, course
The Seventh International Conference on Social Media Technologies, Communication, and Informatics
Participation in conference -Mixed Audience
The Corpus Linguistics Conference 2017
Participation in conference -Mixed Audience
The 13th Workshop on Multiword Expressions at EACL 2017
Participation in workshop, seminar, course
2017 IEEE Global Communications Conference (Globecom2017) (Event)
Membership of committee
Special Issue of the Phraseology and Multiword Expressions (PMWE) book series of Language Science Press (LangSci) (Event)
Membership of committee
The Association for Computational Linguistics (ACL) (External organisation)
Member of an organisation
IEEE GLOBECOM 2016 (Event)
Membership of committee
The 12th Workshop on Multiword Expressions in ACL 2016 Conference
Participation in workshop, seminar, course
The 10th Language Resources and Evaluation Conference (LREC2016)
Participation in conference -Mixed Audience
The Fifth ASE International Conference on Big Data (BigData 2015)
Participation in conference -Mixed Audience
The Fourth ASE International Conference on Social Informatics 2015
Participation in conference -Mixed Audience
The 11th Workshop on Multiword Expressions (MWE 2015) in NAACL2015,
Participation in workshop, seminar, course
The Eighth International Corpus Linguistics Conference (CL2015)
Participation in conference -Mixed Audience
The Seventh IEEE International Conference on Social Computing and Networking (SocialCom 2014)
Participation in conference -Mixed Audience
The Third ASE International Conference on Social Informatics (2014)
Participation in conference -Mixed Audience
The Seventh ASE International Conference on Social Computing (2014)
Participation in conference -Mixed Audience
The Sixth ASE International Conference on Social Computing (2014)
Participation in conference -Mixed Audience
The 10th Workshop on Multiword Expressions in EACL 2014 (Event)
Membership of committee
2013 ASE/IEEE International Conference on Social Computing
Participation in conference -Mixed Audience
The 2013 International Workshop on Social Computing, Network, and Services (SocialComNet 2013)
Participation in workshop, seminar, course
Corpus Linguistics Conference 2013
Participation in conference -Mixed Audience
The 9th Workshop on Multiword Expressions (MWE 2013) Workshop at NAACL 2013
Participation in workshop, seminar, course
The First ASE/IEEE International Conference on Social Informatics
Participation in conference -Mixed Audience
The 2012 ASE/IEEE International Conference on Social Computing (SocialCom 2012)
Participation in conference -Mixed Audience
The Eighth International Conference on Language Resources and Evaluation (LREC 2012)
Participation in conference -Mixed Audience
ACM Transactions on Speech and Language Processing (Journal)
Publication peer-review
The Third IEEE International Conference on Social Computing
Participation in conference -Mixed Audience
Corpus Linguistics 2011 Conference
Participation in conference -Mixed Audience
Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011) Workshop at ACL 2011
Participation in workshop, seminar, course
Multiword Expressions: from Theory to Applications (MWE 2010) Workshop at COLING 2010
Participation in conference -Mixed Audience
The Seventh International Conference on Language Resources and Evaluation (LREC 2010)
Participation in conference -Mixed Audience
The Conference of eLexicography in the 21st Century: New Challenges, New Applications
Participation in conference -Mixed Audience
Multiword Expressions: Identification, Interpretation, Disambiguation and Applications (MWE 2009) at the ACL/IJCNLP 2009 Conference
Participation in conference -Mixed Audience
Corpus Linguistics 2009 Conference
Participation in conference -Mixed Audience
The Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Participation in conference -Mixed Audience
the Sixth SIGHAN Workshop on Chinese Language Processing of the Third International Joint Conference on Natural Language Processing (IJCNLP 2008)
Participation in workshop, seminar, course
International Journal of Language Resources and Evaluation (Journal)
Editorial activity
SCC (Data Science), UCREL - University Centre for Computer Corpus Research on Language
SCC (Data Science)
SCC (Data Science)
- DSI - Foundations
- SCC (Data Science)
- UCREL - University Centre for Computer Corpus Research on Language