Dr Julian Arthur Brooke

  • Room: Level: 08 Room: 8.12
  • Building: Doug McDonell Building
  • Campus: Parkville

Research interests

  • Computational Linguistics (Stylistics, digital humanities, educational applications)


My general area of interest within computer science is computational linguistics (CL). The research I do is focused mainly on semantics and style, in particular the stylistic aspects of the lexicon, sentiment analysis, and native language identification. I've also been employed to work on projects in other areas of CL, including syntatic and discourse parsing, word prediction, and entity recognition. In my work I use a fairly wide range of statistical and rule-based approaches to CL problems, but I have a certain preference for "big data" methods, i.e. unsupervised and semi-supervised approaches that leverage large internet corpora. I enjoy cross-disciplinary research in general, including applications in education, linguistics, and analysis of literature.

Recent publications

  1. Brooke J, Baldwin T, Hammond A. Bootstrapped Text-level Named Entity Recognition for Literature. 54th Annual Meeting of the Association for Computational Linguistics (ACL). The Association for Computational Linguistics. 2016.
  2. Brooke J, Baldwin T, Uitdenbogerd AL. Melbourne at SemEval 2016 Task 11: Classifying Type-level Word Complexity using Random Forests with Corpus and Word List Features. 10th International Workshop on Semantic Evaluation (SemEval). The Association for Computational Linguistics. 2016.
  3. Tagliamonte SA, Brooke J. A weird (Language) tale: Variation and change in the adjectives of strangeness. American Speech. 2014, Vol. 89, Issue 1.
  4. Brooke J, Hirst G. Supervised ranking of co-occurrence profiles for acquisition of continuous lexical attributes. COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers. 2014.
  5. Brooke J, Tsang V, Hirst G, Shein F. Unsupervised multiword segmentation of large corpora using prediction-Driven decomposition of n-Grams. COLING 2014 - 25th International Conference on Computational Linguistics, Proceedings of COLING 2014: Technical Papers. 2014.
  6. Brooke J, Hirst G. A multi-dimensional bayesian approach to lexical style. NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference. 2013.
  7. Brooke J, Hirst G. Hybrid Models for Lexical Acquisition of Correlated Styles. International Joint Conference on Natural Language Processing (IJCNLP). 2013.
  8. Brooke J, Hirst G. Measuring Interlanguage: Native Language Identification with L1-influence Metrics. 8th International Conference on Language Resources and Evaluation (LREC). 2012.
  9. Brooke J, Hirst G. Robust, lexicalized native language identification. 24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers. 2012.
  10. Taboada M, Brooke J, Tofiloski M, Voll K, Stede M. Lexicon-basedmethods for sentiment analysis. Computational Linguistics. MIT Press. 2011, Vol. 37, Issue 2.
  11. Tofiloski M, Brooke J, Taboada M. A syntactic and lexical-based discourse segmenter. ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.. 2009.
  12. Brooke J, Tofiloski M, Taboada M. Cross-linguistic sentiment analysis: From English to Spanish. International Conference Recent Advances in Natural Language Processing, RANLP. 2009.
  13. Taboada M, Brooke J, Stede M. Genre-based paragraph classification for sentiment analysis. Proceedings of the SIGDIAL 2009 Conference: 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue. 2009.
  14. Brooke J, Hurst M. Patterns in the stream: Exploring the interaction of polarity, topic, and discourse in a large opinion corpus. International Conference on Information and Knowledge Management, Proceedings. 2009.