Skip to navigation | Skip to main content | Skip to footer
Menu
Menu

COMP38211 Documents and Data on the Web syllabus 2019-2020

COMP38211 Documents and Data on the Web

Level 3
Credits: 10
Enrolled students: 73

Course leader: Riza Batista-Navarro


Additional staff: view all staff

Assessment methods

  • 70% Written exam
  • 30% Report
Timetable
SemesterEventLocationDayTimeGroup
Sem 1 w1,3 Lecture IT407 Thu 14:00 - 16:00 -
Sem 1 w2,4-5,7-12 Lecture 1.4 Thu 14:00 - 16:00 -
Sem 1 w7-8 DROP-IN 1.8 Thu 16:00 - 17:00 -

Overview

This course unit will enable students to explore principles and techniques that underpin the web, and to investigate how these are applied to provide webs of documents and data. In so doing, the concepts and standards associated with resource identification, access, indexing, classification/categorisation and scalability will be introduced, along with recurring functionalities such as publication and search.

NB This 10CR course is a reduced version of the 20CR course COMP38120 Documents, Services and Data on the Web, and will replace that course from Academic Year 2019-20.

 


Aims

This course unit is aimed at providing insights into and experience of techniques relating to searching and retrieving documents and data on the web. Fundamental drivers, concepts and techniques for using and maintaining the web of documents and data are presented and discussed in workshop settings, while techniques in practice are applied and evaluated in the laboratory.

Teaching methods

Lectures, workshops, coursework, face to face mentoring by TAs.

Study hours

  • Lectures (22 hours)
  • Practical classes & workshops (6 hours)

Learning outcomes

Learning outcomes are unknown for COMP38211.

Reading list

COMP38211 does not have a specified reading list.

Additional notes

Indicative Reading List

Manning, Raghavan and Schutze (2008) Introduction to information retrieval, ISBN:9780521865715

Lin and Dyer (2010) Data-intensive text processing with MapReduce, ISBN:9781608453429

Miner and Shook (2012) MapReduce design patterns: building effective algorithms and analytics for Hadoop and other systems, ISBN:9781449327170

Williams (2012) Economics of cloud computing: an overview for decision makers, ISBN:9781587143069

Heath and Bizer (2011) Linked data: evolving the web into a global data space, ISBN:9781608454303