Dr Paul Cook

  • Room: Level: 08 Room: 8.12
  • Building: Doug McDonell Building
  • Campus: Parkville

Research interests

  • Computational linguistics (language technology) with a focus on slang and neologisms (Computational linguistics, Natural language processing, Language technology)

Personal webpage

http://people.eng.unimelb.edu.au/paulcook/

Biography

Dr (Christopher) Paul Cook is a McKenzie Postdoctoral Research Fellow in the Dept of Computing and Information Systems at The University of Melbourne.

Research interests
- Computational linguistics (with a focus on slang and neologisms)
- Natural language processing
- Language technology

Dr Paul Cook was encouraged to apply for the McKenzie Fellowship by a member of staff who examined his PhD thesis. “The opportunity to spend three years doing research at the University of Melbourne with funding that isn’t restricted to any particular project was just too good to pass up,” he says.

“I work in the field of language technology (computer programs that process human language, in the form of text or speech data). My current project is on automatically identifying differences in the ways in which words are used in English dialects. For example, can we automatically identify words, or combinations of words, that are unique to, or have a unique meaning in, a dialect, such as Australian English?

“This project could lead to the discovery of previously-undocumented differences between dialects. This could help to improve dictionaries, which are important tools for language-learners, and also interesting cultural artefacts, and could also lead to the development of language technology tools, such as spelling checkers, that are better suited to particular dialects.”

Recent publications

  1. Lau JH, Cook C, McCarthy D, Gella S, Baldwin T. Learning word sense distributions, detecting unattested senses and identifying novel senses using topic models. 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference. 2014, Vol. 1.
  2. Han B, Cook C, Baldwin T. Text-Based Twitter User Geolocation Prediction. Journal of Artificial Intelligence Research. AI Access Foundation. 2014, Vol. 49.
  3. Salehi B, Cook C, Baldwin T. Using Distributional Similarity of Multi-way Translations to Predict Multiword Expression Compositionality. 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL). ACL Anthology. 2014.
  4. Cook C, Lau J, Rundell M, McCarthy D, Baldwin T. A lexicographic appraisal of an automatic approach for detecting new word-senses. Electronic lexicography in the 21st century: thinking outside the paper. Proceedings of the eLex 2013 conference. Trojina, Institute for Applied Slovene Studies. 2013.
  5. Han B, Cook C, Baldwin T. A Stacking-based Approach to Twitter User Geolocation Prediction. 51st Annual Meeting of the Association for Computational Linguistics. Omnipress. 2013, Vol. 1.
  6. Cook C, Hirst G. Automatically Assessing Whether a Text Is Clichéd, with Applications to Literary Analysis. 9th Workshop on Multiword Expressions (MWE). ACL Anthology. 2013.
  7. Lui M, Cook C. Classifying English Documents by National Dialect. Australasian Language Technology Association Workshop (ALTA). ACL Anthology. 2013, Vol. 11.
  8. Baldwin T, Cook C, Lui M, MacKinlay A, Wang L. How Noisy Social Media Text, How Diffrnt Social Media Sources?. International Joint Conference on Natural Language Processing. ACL Anthology. 2013.
  9. Duong T, Cook C, Bird S, Pecina P. Increasing the quality and quantity of source language data for unsupervised cross-lingual POS tagging. Proceedings of the International Joint Conference on Natural Language Processing. ACL Anthology. 2013.
  10. Han B, Cook C, Baldwin T. Lexical Normalization for Social Media Text. ACM Transactions on Intelligent Systems and Technology. ACM Press. 2013, Vol. 4, Issue 1.
  11. Salehi B, Cook C. Predicting the Compositionality of Multiword Expressions Using Translations in Multiple Languages. *SEM 2013: The Second Joint Conference on Lexical and Computational Semantics. ACL Anthology. 2013, Vol. 1.
  12. Duong T, Cook C, Bird S, Pecina P. Simpler unsupervised POS tagging with bilingual projections. 51st Annual Meeting of the Association for Computational Linguistics. ACL Anthology. 2013, Vol. 2.
  13. Han B, Cook C, Baldwin T. unimelb: Spanish Text Normalisation. Conference of the Spanish Society for Natural Language Processing. SEPLN (Sociedad Española para el Procesamiento del Lenguaje Natural). 2013.
  14. Lau J, Cook C, Baldwin T. unimelb: Topic Modelling-based Word Sense Induction. Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013). Omnipress. 2013, Vol. 2.
  15. Lau J, Cook C, Baldwin T. unimelb: Topic Modelling-based Word Sense Induction for Web Snippet Clustering. Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013). Omnipress. 2013, Vol. 2.

View a full list of publications on the University of Melbourne’s ‘Find An Expert’ profile