Skip to navigation | Skip to main content | Skip to footer
Menu
Menu

Current postgraduate taught students

COMP60370: Semi-structured Data and the Web (2007-2008)

This is an archived syllabus from 2007-2008

Semi-structured Data and the Web
Level: 6
Credit rating: 15
Pre-requisites: Good familiarity with relational databases and programming
Co-requisites: No Co-requisites
Lecturers: Bijan Parsia, Uli Sattler
Course lecturers: Bijan Parsia

Uli Sattler

Additional staff: view all staff
Timetable
SemesterEventLocationDayTimeGroup
Sem 2 w19-23 Lecture 2.19 Mon 09:00 - 17:00 -
Assessment Breakdown
Exam: 50%
Coursework: 50%
Lab: 0%

Introduction

This course unit is divided into two parts: one on Semi-Structured Data and its use in Web based applications. Semi-structured data focuses on describing and querying data that comes in a format less tightly structured than that found in relational databases. Such data is dominant on the Web, from HTML pages and weblog feeds to SOAP messages and vector graphics and are a key component of Web based applications from Web sites to Web Services.

Aims

The semi-structured data part of the course unit aims to give students
a good overview of the ideas and the techniques which are behind the
description and query mechanisms for semi-structured data. We discuss
semi-structured data and their representation, XML, Schemata for XML
data (DTD and XMLSchema), processing and manipulating XML data (XPath,
XQuery), and some theoretical aspects of XML data
processing.

The Web applications part of the course aims to give students a good
foundation in the variety of architectures and design patterns used to
build robust, interoperable Web based programs. We examine the
two primary general contenders (Representational State Transfer/Web Arch
and Service Oriented Architecture) as well as concrete technology which
embodies and enables these architectures.

Laboratory sessions
will ground the abstract notions on practical cases and tools.

Learning Outcomes

A student completing this course unit should:

1. have an understanding of the foundations semi-structured data and
their representation, XML, Schemata for XML data (DTD and XMLSchema),
processing and manipulating XML data (XPath, XQuery), and some
theoretical aspects of XML data processing. (A)

2. have mastered the basic range of techniques for representing,
modelling, and querying semi-structured data, and be able to use
tools developed for them. (B, C and D)

3. have an understanding of Web and service oriented architectures,
their motivations, applicability, advantages and pitfalls. (A)

4. have an understanding of the principles, methods, design tradeoffs,
technologies and tools that underpin successful Web applications (A)

5. have mastered the basic range of techniques for building Web applications
using standard tooling (B, C and D)

Assessment of Learning outcomes

Learning outcomes (1) and (3) are assessed by examination, learning outcome (2), (4), and (5) by examination and in the laboratory.

Contribution to Programme Learning Outcomes

A2, B2, B3, C2, D3, D4

Syllabus

Part I
Introduction: Semi-structured data.
XML: core concepts
DTDs, a simple schema language for XML documents
XPath, a navigation language for XML documents
XML namespace: a concept ignored so far
XSLT, a transformation language for XML documents
DOM and SAX, a programmatic manipulation language for XML documents
XML Schema, a more expressive schema language for XML documents
XQuery, a query language for XML documents
Storing XML documents in RDBMSs
Part II
Introduction: Web based applications
Web Architecture
URIs, HTTP, and (X)HTML
REST and "Semantic" Markup
Service Oriented Architecture
Message and data formats (e.g., SOAP)
Service description (e.g., WSDL, WS-Policy)
Composition, Orchestration, and Choreography
(Rich) Internet Clients

Reading List

There's no need for the students taking the course to buy any book. However, there are some resources that a student may wish to consult:

W3C documents at http://www.w3.org/TR/...

Special resources


We will use the XML Editor, Firefox and other web browsers as well as a web server + Web Services stack for the laboratory sessions.

Supplementary Text
Title: Data on the Web From Relations to Semistructured Data and XML
Author: Serge Abiteboul, Peter Buneman, Dan Suciu
ISBN: 155860622X
Publisher: Elsevier Science & Technology
Edition:
Year: 1999


Supplementary Text
Title: Building Web Services with Java: Making Sense of XML, SOAP, WSDL, and UDDI: Making Sense of XML, Soap, WSDL and UDDI
Author: S. Graham, D. Davis, S. Simeonov, G. Daniels, P. Brittenham, Y. Nakamura, P. Fremantle, D. Koenig, C. Zentner
ISBN: 0672326418
Publisher: Pearson Education
Edition: 2nd
Year: 2004


Supplementary Text
Title: Essence of XML
Author: J. Simeon and P. Wadler
ISBN:
Publisher: POPL
Edition:
Year: 2003