Manchester Est. 1824
 

Supporting Material for INFO30023: Databases 3

 
 
  • Module Code: INFO30023 (CN-3023)
  • Dept. module Code: CT314
  • Module leader: Dr Goran Nenadic
  • School resposnible: Informatics
  • Credit rating: 15
  • Pre-requisite: CN-1005 (Databases 1)
 
 

News

There will be no lecture on Monday 23rd April 2007. This lecture is rescheduled for Tuesday 1st May 2007, between 2pm - 5pm in Renold/J17.

 
 

Aims

The aim of this module is to provide students with a broad understanding and knowledge of emerging database technologies. Students will learn about data warehousing, OLAP and data mining techniques, XML, textual and multimedia databases, and their role in data and information management.

 
 

Syllabus

  • Emerging database technologies and applications (overview)
  • Data warehousing and OLAP
  • Data mining techniques
  • XML, textual, multimedia and Web databases

 
 

Reading list

  • (textbook) T.M. Connolly, C.E. Begg, Database systems: a practical approach to design, implementation, and management, 4th edition, Addison-Wesley, 2005 (ISBN 0-321-21025-5)
  • (textbook) R. Elmasri & S.B. Navathe, Fundamentals of Database Systems, 4th edition, Addison Wesley, 2004 (ISBN 0-321-20448-4)
  • C.J. Date, An Introduction to Database Systems, 8th ed, Addison Wesley, 2004 (ISBN 0-321-19784-4)
  • L. Dunckley, Multimedia Databases: An Object Relational Approach, Addison-Wesley, 2003 (ISBN 0-201-78899-3)
  • M. Graves, Designing XML Databases, Prentice Hall, 2001, (ISBN 0-13-088901-6)
  • A. Silberschatz, H. Korth, S. Sudarshan, Database System Concepts, 5th edition, McGraw-Hill, 2005 (ISBN 0-07-295886-3)

 
 

Assessment

Assessment consists of 100% examination. The exam consists of one compulsory question (containing short questions) and two long questions (selected from three). Calculators are allowed.

Past examination papers

 
 

 
 

Lecture notes and materials

(lecture notes available during the term time only to the Informatics/CS students)

  • Lecture 1: "Overview of Database Technologies and Architectures" (handouts)

    • Object Data Management Group: http://www.odmg.org
    • Object Database Management Systems, the Resouce Portal for Education and Research: http://www.odbms.org

  • Lecture 2: "Introduction to Data Warehousing" (handouts)

    • Chaudhuri, Dayal: An overview of data warehousing and OLAP technology
    • Case study: EDEKA
    • Case study: Barnes and Noble, Inc.
    • More case studies
    • Oracle9i Warehouse Builder demo
  • Lecture 3: "Introduction to OLAP" (handouts)

    • SQL for Aggregation in Data Warehouses (ORACLE)
    • Microsoft Business Intelligence
    • SQL Server 2000 Resource Kit, Part 5: Data Warehousing
    • OLAP demo from BusinessObjects
    • OLAP report web site

  • Revision 1: Databases, data warehousing and OLAP

  • Lecture 4: "Introduction to Data Mining" (handouts)

    • U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth: From Data Mining to Knowledge Discovery in Databases, AI Magazine 17(3): Fall 1996, pp. 37-54
    • Data mining tutorial (1998)
    • Weka 3: data mining software
    • Weka tutorial
  • Lecture 5: "Mining Association Rules" (handouts)

    • Association rules using WebSphere Commerce Analyzer
    • Association rules tutorial
    • Apriori - Association Rule Induction / Frequent Item Set Mining software
  • Lecture 6: "Introduction to Classification" (handouts)

    • Decision tree tutorial
    • Dtree - Decision and Regression Tree Induction software
    • svmlight: http://www.cs.cornell.edu/People/tj/svm_light/
    • SVMLIB: http://www.csie.ntu.edu.tw/~cjlin/libsvm
    • Precision and recall
  • Lecture 7: "Introduction to Clustering" (handouts - NEW)

    • K-means tutorial
    • Hierarchical clustering example
  • Revision 2: Data mining techniques

  • Lecture 8: "Introduction to Information Extraction and Text Mining" (handouts)

    • Hotho et al.: A Brief Survey of Text Mining
    • M. Hearst: Untangling Text Data Mining
    • Spasic et al.: Text mining and ontologies in biomedicine: Making sense of raw text"
    • http://www.text-mining.org/
    • M. Sharp: Text Mining
    • Text mining tools on the Internet
    • Various case studies: The European Text Analytics Summit 2007
  • Lecture 9: "Introduction to XML Databases" (handouts)

    • XML in 10 points
    • An XML tutorial
    • On-line XML validation

    • XML Database Products: Native XML Databases
    • XQuery/X-Hive demo
    • XQuery!
    • XML Query Use Cases
    • XQuery
  • Lecture 10: "Introduction to Multimedia Databases" (handouts)

    • RIAO conferences on Large-Scale Semantic Access to Content (Text, Image, Video and Sound)
      (see RIAO proceedings 2004)
    • Building the Semantic Web
  • Revision 3: Text, multimedia and XML databases

  • Lecture 11: Revision

 
 

Other resources



  • UK National centre for Text Mining (NaCTeM)
  • www.textmining.net

  • www.w3.org/
  • www.w3schools.com/
  • International conference on Semantics and Digital Media Technologies (SAMT)