School of Computing

FACULTY OF ENGINEERING

 

NLP research group ALUMNI (with Supervisor)

Claire Brierley (EA)Prosody resources and symbolic prosodic features for automated phrase break prediction. 2011
Owen Tregurtha Nancarrow (EA)A comparative study of the tagging of adverbs in modern English corpora. 2011
Majdi Shaker Sawalha (EA)Open-source resources and standards for Arabic word structure analysis: fine grained morphological analysis of Arabic text corpora. 2011
Fangzhong Su (KM,EA) Computational modelling of word sense sentiment. 2011
Noorhan Abbas (EA)Qurany: A Tool to Search for Concepts in the Quran (PDF). 2009
Andrew Roberts (EA)
Grammatical Inference and Corpus Linguistics (PDF). 2008
Eric
Atwell

Corpus Linguistics and Language Learning: Bootstrapping Linguistic Knowledge and Resources from Text (PDF). 2008
Debra
Elliott
(EA,AH)
Corpus-based machine translation evaluation via automated error detection in output texts. 2007
Bayan
Abu Shawar
(EA)
A Corpus Based Approach to Generalise a Chatbot System (PDF). 2005
Bogdan(AH,EA)
Babych
Information Extraction techniques in Machine Translation. 2005
Mandy
Schiffrin
(CS)
Modelling Speech Acts in Conversational Discourse (PDF). 2005
John
Elliott (EA)
Natural Language Learning for the Search for Extraterrestrial Intelligence. 2004
Latifa
Al-Sulaiti
(EA)
Designing and Developing a Corpus of Contemporary Arabic (PDF). 2004
Toshifumi
Oba
(EA)
Using the HTK Speech Recogniser to Analyse Prosody in a Corpus of German Spoken Learner's English (PDF). 2003
Xiao Yuan
Duan
(EA)
Lexical Semantic Association Between Web Documents (PDF). 2002
Menno
van Zaanen
(RB)
Bootstrapping Structure into Language: Alignment-Based Learning (PDF). 2002
Xuegang
Wang
(PM)
Negation in logic and deductive databases (PDF). 2000
George
Demetriou
(EA,CS)
Lexical semantic information processing for large vocabulary human-computer speech communication. 1997
Clive
Souter (EA)
A corpus-trained parser for systemic-functional syntax (PDF). 1996
Adam
Bull
(EA)
The formal description of aerobic dance exercise. 1996
Gavin
Churcher
(EA,CS)
Improving the performance of speech driven applications using linguistic knowledge . 1996
Michael
Schillo
(EA)
Working while driving: corpus based modelling of a natural English voice user-interface to the in-car personal assistant (PDF). 1996
Xiaoda
Zhang
(EA)
MIRTH Chinese and English search engine: a multilingual retrieval tool hierarchy for a World Wide Web virtual corpus. 1996
Nik
Silver
(PM)
Inferencing methods using systemic functional grammar. 1995
Uwe
Jost

(EA)
Probabilistic language modelling for speech recognition. 1995
Simon
Arnfield
(EA)
Prosody and syntax in corpus-based analysis of spoken English. 1994
Alec
Grierson
(RP)
Generating cohesive texts from simulations used in computer-aided instruction. 1994
John S
Hughes
(EA)
Automatically acquiring a classification of words (PDF). 1994
Tim
O'Donoghue
(EA)
Reversing the process of generation in Systemic Grammar. 1993

NATURAL LANGUAGE PROCESSING research group

Language research in Computing is also known as Natural Language Processing , Computational Linguistics , Corpus Linguistics ,or Language Engineering . Central to our research is the computational modelling of language data; a CORPUS is a text dataset representative of the language to be analysed. Our research at Leeds University focusses on bootstrapping linguistic knowledge and resources from text, and is reported in our PUBLICATIONS.
Google Scholar
Language Research Group graduates have gone on to work in Web search, text analytics, translation and language consulting, online news, voice-to-text, the Search for Extra-Terrestrial Intelligence, and, of course, as University academics !

Staff

Eric Atwell Eric Atwell, Senior Lecturer
Corpus Linguistics, Machine Learning and Data Mining with text in English, Arabic, and other languages; Text Analytics applied to Understanding the Quran, detecting terrorist activites, and Healthcare patient records. PUBLICATIONS.
Katja Markert Katja Markert, Senior Lecturer (on sabbatical 2011-12)
Data-intensive, corpus-based and web-based natural language processing, Anaphora Resolution, Figurative Language Resolution, Textual Entailment, Sentiment Analysis. PUBLICATIONS.

Claire Brierley Claire Brierley, Senior Research Fellow
Corpus Linguistics, prosodic phrase break prediction, text analytics applied to detecting terrorist activities PUBLICATIONS.
Owen Johnson Owen Johnson, Senior Fellow
Corpus linguistics for Health Informatics, Healthcare patient records research methods. PUBLICATIONS.
Majdi Sawalha Majdi Sawalha, Research Fellow
A Web-as-Corpus approach to collating teaching resources for Islamic Studies; Corpus-based Arabic morphological analysis. PUBLICATIONS.
Justin Washtell Justin Washtell, Research Fellow
Corpus-based distributional models of lexical semantics PUBLICATIONS.

We collaborate with academic staff in the Centre for Translation Studies :

Tony Hartley Tony Hartley
Evaluation of machine translation systems, Controlled languages, Natural Language Generation, Quality in translation and interpreting, Computer Supported Collaborative Working. PUBLICATIONS
Serge Sharoff Serge Sharoff
Corpus linguistics, Natural Language Understanding, Natural Language Generation, Lexical semantics, Systemic-Functional grammar. PUBLICATIONS

At the German Christmas Market in Leeds
R-to-L: Eric Atwell, Majdi Sawalha, Justin Washtell, Claire Brierley, Fangzhong Su, Josiah Wang, Owen Tregurtha Nancarrow - at the German Christmas Market next to Leeds University

Research Students

STUDENT +Supervisor(s)RESEARCH TOPIC
Amal Alsaif (KM)An Automatic analyser of Discourse structure for Arabic
Samuel Danso (EA,OJ) Text Analytics to Predict Cause of Death in Verbal Autopsies
Kais Dukes (EA)Arabic Language Computing Applied to the Quran
Saman Hina (EA,OJ) SNOMED semantic tagger for medical corpus linguistics
Svitlana Kurella (AH,SS)Methodology for computer-assisted acquisition of reading abilities in L3
Andrew McKinlay (KM) Automatic Detection of Discourse Structure: Relation and Entity Graphs
Noushin Rezapour Asheghi (SS,KM)Genre classification of web pages
Alina Secară (AH,SS)Developing a semi-automatic subtitler's workbench within the systemic functional grammar framework
Abdul-Baquee Muhammad Sharaf (EA) A Computational Model for Knowledge Representation of the Quran
Josiah Wang (ME,KM) Learning Visual Object Recognition from Text
Justin Washtell (EA,KM) The benefits of proximity as opposed to frequency as a basis for modelling language


Kais Dukes - University of Leeds Engineering Postgraduate Researcher of the year 2011

Potential research collaborators and PhD students are very welcome to contact any of the academic staff (Atwell, Markert, Hartley, and Sharoff). Please send us an outline project proposal (see guidelines and example project ideas). You can apply for our PhD programme online.


Selected Publications

or you can select a fuller list
Brierley, C; Atwell, ES Non-Traditional Prosodic Features for Automated Phrase-Break Prediction. Literary and Linguistic Computing Journal, vol. 26, pp.279-284. 2011.LINK
Dukes, K; Atwell, ES; Habash, N Supervised Collaboration for Syntactic Annotation of Quranic Arabic. Language Resources and Evaluation Journal, pp.1-30. 2011.LINK
Rapp, AM; Erb, M; Langohr, K; Markert, K Neural Correlates of Metonymy Comprehension in Schizophrenia in: Schizophrenia Bulletin, vol. 37, pp.150-150. Oxford University Press. 2011.
Brierley, C; Atwell, ES Holy smoke: vocalic precursors of phrase breaks in Milton's Paradise Lost. Literary and Linguistic Computing Journal, vol. 25, pp.137-151. 2010.LINK
DOI
Hina, S; Atwell, ES; Johnson, O Semantic Tagging of Medical Narratives with Top Level Concepts from SNOMED CT Healthcare Data Standard. International Journal of Intelligent Computing Research (IJICR), vol. 1, pp.118-123. 2010.LINK
Markert, K; Nissim, M Data and models for metonymy resolution. Language Resources and Evaluation, vol. 43, pp.123-138. 2009.

Natural Language Processing research seminars at Leeds University

... and links to related Leeds University research groups in Modern Languages, Knowledge Management, Translation Studies, Linguistics and Phonetics, Modern Languages and Cultures, Language Education, English.

Bookmarks for corpus-based linguistics

International Conferences in Language Computing and Computer Science