Skip to navigation | Skip to main content | Skip to footer
Menu
Menu

This is an archived syllabus from 2013-2014

COMP34411 Natural Language Systems syllabus 2013-2014

COMP34411 Natural Language Systems

Level 3
Credits: 10
Enrolled students: 40

Course leader: Allan Ramsay


Additional staff: view all staff

Assessment methods

  • 80% Written exam
  • 20% Coursework
Timetable
SemesterEventLocationDayTimeGroup
Sem 1 Lecture LF17 Wed 10:00 - 10:00 -
Sem 1 Lecture IT407 Fri 13:00 - 13:00 -
Themes to which this unit belongs
  • Natural Language, Representation and Reasoning

Overview

Enabling computers to use 'natural language' (the kind of language that people use to communicate with one another) is becoming more and more important. It allows people to communicate with them without having to use strange artificial languages and awkward devices like keyboards and mice; and it allows the computer to access the enormous amount of material that is stored as natural language text on the web.

This course provides an introduction to this area, mixing theory (if you don't understand the theory of how language works you cannot possibly write programs that understand it) with practice (if you haven't written or played with tools that embody the theory, you can't get a concrete handle on what the theory means).

Aims

The course unit aims to teach the techniques required to extend the theoretical principles of computational linguistics to applications in a number of critical areas.

  • To demonstrate how the essential components of pracftical NLP systems are built and modified.
  • To introduce the principal applications of NLP, including information retrieval & extraction, spoken language access to software services, and machine translation.
  • To explain the major challenges in processing large-scale, real-world natural language.
  • To explain the principles underlying speech recognition and synthesis, and to explore the power of 'black box' tools for these tasks.
  • To give students an understanding of the issues involved in evaluating NLP systems.

Syllabus

Introduction, motivation, review of NLP principles (1)

Large scale and robust NLP algorithms (3)

Part-of-speech tagging: probabilistic tagging, transformation-based learning

Parsing: chunking, shallow parsing, statistical parsing

Lexical semantics: lexical resources, word sense disambiguation algorithms

Infomation retrieval and extraction (2)

Document matching

Template-filling, free text question answering systems

Summarisation algorithms

Spoken language systems (3)

The nature of speech: vocal tract, acoustic analysis, the phonetics:phonology boundary, local and global phonetic contours

Speech synthesis: formant based synthesis, N-phone based synthesis (coursework 2)

Speech recognition: acoustic features, the role of linguistic constraints

Machine translation (2)

Transfer-based approaches: the MT pyramid, transfer rules

Statistical MT, memory-based MT

Teaching methods

Lectures

11 x 2 hours

Feedback methods

The course contains two pieces of coursework. The first involves writing rules to analyse the structure and/or content of natural language sentences: these rules are tested on a set of examples, and written feedback on their effectiveness is provided.

The second exercise involves using speech synthesis software to produce spoken output from input text. All the generated sound files are anonymised and put on the web, and students are required to rank them, with the highest ranked examples being given the highest marks. The task of ranking the examples is part of the exercise, as it carries lessons about the difficulty of evaluating 'soft' computer systems.

Study hours

  • Lectures (22 hours)

Employability skills

  • Analytical skills
  • Problem solving

Learning outcomes

On successful completion of this unit, a student will be able to:

Learning outcomes are detailed on the COMP34411 course unit syllabus page on the School of Computer Science's website for current students.

Reading list

No reading list found for COMP34411.

Additional notes

Course unit materials

Links to course unit teaching materials can be found on the School of Computer Science website for current students.