This is an archived syllabus from 2013-2014
COMP34411 Natural Language Systems syllabus 2013-2014
COMP34411 Natural Language Systems
Level 3
Credits: 10
Enrolled students: 40
Course leader: Allan Ramsay
Additional staff: view all staff
Assessment methods
- 80% Written exam
- 20% Coursework
Semester | Event | Location | Day | Time | Group |
---|---|---|---|---|---|
Sem 1 | Lecture | LF17 | Wed | 10:00 - 10:00 | - |
Sem 1 | Lecture | IT407 | Fri | 13:00 - 13:00 | - |
- Natural Language, Representation and Reasoning
Overview
Enabling computers to use 'natural language' (the kind of language that people use to communicate with one another) is becoming more and more important. It allows people to communicate with them without having to use strange artificial languages and awkward devices like keyboards and mice; and it allows the computer to access the enormous amount of material that is stored as natural language text on the web.This course provides an introduction to this area, mixing theory (if you don't understand the theory of how language works you cannot possibly write programs that understand it) with practice (if you haven't written or played with tools that embody the theory, you can't get a concrete handle on what the theory means).
Aims
The course unit aims to teach the techniques required to extend the theoretical principles of computational linguistics to applications in a number of critical areas.
- To demonstrate how the essential components of pracftical NLP systems are built and modified.
- To introduce the principal applications of NLP, including information retrieval & extraction, spoken language access to software services, and machine translation.
- To explain the major challenges in processing large-scale, real-world natural language.
- To explain the principles underlying speech recognition and synthesis, and to explore the power of 'black box' tools for these tasks.
- To give students an understanding of the issues involved in evaluating NLP systems.
Syllabus
Introduction, motivation, review of NLP principles (1)
Large scale and robust NLP algorithms (3)
Part-of-speech tagging: probabilistic tagging, transformation-based learning
Parsing: chunking, shallow parsing, statistical parsing
Lexical semantics: lexical resources, word sense disambiguation algorithms
Infomation retrieval and extraction (2)
Document matching
Template-filling, free text question answering systems
Summarisation algorithms
Spoken language systems (3)
The nature of speech: vocal tract, acoustic analysis, the phonetics:phonology boundary, local and global phonetic contours
Speech synthesis: formant based synthesis, N-phone based synthesis (coursework 2)
Speech recognition: acoustic features, the role of linguistic constraints
Machine translation (2)
Transfer-based approaches: the MT pyramid, transfer rules
Statistical MT, memory-based MT
Teaching methods
Lectures
11 x 2 hours
Feedback methods
The course contains two pieces of coursework. The first involves writing rules to analyse the structure and/or content of natural language sentences: these rules are tested on a set of examples, and written feedback on their effectiveness is provided.The second exercise involves using speech synthesis software to produce spoken output from input text. All the generated sound files are anonymised and put on the web, and students are required to rank them, with the highest ranked examples being given the highest marks. The task of ranking the examples is part of the exercise, as it carries lessons about the difficulty of evaluating 'soft' computer systems.
Study hours
- Lectures (22 hours)
Employability skills
- Analytical skills
- Problem solving
Learning outcomes
On successful completion of this unit, a student will be able to:
Learning outcomes are detailed on the COMP34411 course unit syllabus page on the School of Computer Science's website for current students.
Reading list
No reading list found for COMP34411.
Additional notes
Course unit materials
Links to course unit teaching materials can be found on the School of Computer Science website for current students.