Skip to navigation | Skip to main content | Skip to footer
Menu
Menu

This is an archived syllabus from 2020-2021

COMP38211 Documents and Data on the Web syllabus 2020-2021

COMP38211 Documents and Data on the Web

Level 3
Credits: 10
Enrolled students: 117

Course leader: Goran Nenadic


Additional staff: view all staff

Assessment methods

  • 100% Coursework
Timetable
SemesterEventLocationDayTimeGroup
Sem 1 ONLINE Lecture Fri 15:00 - 17:00 -
Sem 1 w3-12 ONLINE LabORATORY Fri 17:00 - 18:00 -

Overview

This course unit will enable students to explore principles and techniques that underpin the web, and to investigate how these are applied to provide webs of documents and data. In so doing, the concepts and standards associated with resource identification, access, indexing, classification/categorisation and scalability will be introduced, along with recurring functionalities such as publication and search.

This course unit detail provides the framework for delivery in 20/21 and may be subject to change due to any additional Covid-19 impact.  Please see Blackboard / course unit related emails for any further updates.


Aims

This course unit is aimed at providing insights into and experience of techniques relating to searching and retrieving documents and data on the web. Fundamental drivers, concepts and techniques for using and maintaining the web of documents and data are presented and discussed in workshop settings, while techniques in practice are applied and evaluated in the laboratory.

Syllabus

At the end of this course a student will be able to:
 
identify the key properties of web architectures and standards
identify key challenges in indexing and searching documents and data available on the web
discuss how key web properties have been applied to the document web, and distinguish between different techniques for document indexing, classification/categorisation and search
discuss how key web properties have been applied to the web of data, and demonstrate how they underpin data publication and consumption
apply and evaluate techniques employed in cloud services, within the context of tasks dealing with data and document webs

Teaching methods

Lectures, workshops, coursework, face to face mentoring by TAs.

Feedback methods

Workshops, practicals, surgeries, coursework

Study hours

  • Lectures (22 hours)
  • Practical classes & workshops (6 hours)

Learning outcomes

On successful completion of this unit, a student will be able to:

At the end of this course a student will be able to:

  • identify the key properties of web architectures and standards
  • identify key challenges in indexing and searching documents and data available on the web
  • discuss how key web properties have been applied to the document web, and distinguish between different techniques for document indexing, classification/categorisation and search
  • discuss how key web properties have been applied to the web of data, and demonstrate how they underpin data publication and consumption
  • apply and evaluate techniques employed in cloud services, within the context of tasks dealing with data and document webs

 

Reading list

No reading list found for COMP38211.

Additional notes

Indicative Reading List

Manning, Raghavan and Schutze (2008) Introduction to information retrieval, ISBN:9780521865715

Lin and Dyer (2010) Data-intensive text processing with MapReduce, ISBN:9781608453429

Miner and Shook (2012) MapReduce design patterns: building effective algorithms and analytics for Hadoop and other systems, ISBN:9781449327170

Williams (2012) Economics of cloud computing: an overview for decision makers, ISBN:9781587143069

Heath and Bizer (2011) Linked data: evolving the web into a global data space, ISBN:9781608454303