Advanced Data Structures

You are here

Credits
6
Types
Specialization compulsory (Advanced Computing)
Requirements
This subject has not requirements

Department
CS
Mail
Efficient strategies and techniques for "structure data" are key in modern computer science to design fast algorithms useful in a variety of every day applications (like web archiving, mail servers, network routers, video games).
This course explores selected topics on fundamental data structures that may be multidimensional, metric, geometric, kinetic, self-adjusting, concurrent, distributed, etc.
The tour covers, for each topic, major results and characteristic ways of analysis as well as possible directions of research.

Teachers

Person in charge

  • Amalia Duch Brown ( )

Others

  • Conrado Martínez Parra ( )
  • Joaquin Gabarró Vallés ( )
  • Salvador Roura Ferret ( )
  • Xavier Messeguer Peypoch ( )

Weekly hours

Theory
2
Problems
1
Laboratory
0
Guided learning
0
Autonomous learning
6

Competences

Technical Competences of each Specialization

Advanced computing

  • CEE3.1 - Capability to identify computational barriers and to analyze the complexity of computational problems in different areas of science and technology as well as to represent high complexity problems in mathematical structures which can be treated effectively with algorithmic schemes.
  • CEE3.2 - Capability to use a wide and varied spectrum of algorithmic resources to solve high difficulty algorithmic problems.

Generic Technical Competences

Generic

  • CG1 - Capability to apply the scientific method to study and analyse of phenomena and systems in any area of Computer Science, and in the conception, design and implementation of innovative and original solutions.
  • CG3 - Capacity for mathematical modeling, calculation and experimental designing in technology and companies engineering centers, particularly in research and innovation in all areas of Computer Science.

Transversal Competences

Information literacy

  • CTR4 - Capability to manage the acquisition, structuring, analysis and visualization of data and information in the area of informatics engineering, and critically assess the results of this effort.

Reasoning

  • CTR6 - Capacity for critical, logical and mathematical reasoning. Capability to solve problems in their area of study. Capacity for abstraction: the capability to create and use models that reflect real situations. Capability to design and implement simple experiments, and analyze and interpret their results. Capacity for analysis, synthesis and evaluation.

Basic

  • CB6 - Ability to apply the acquired knowledge and capacity for solving problems in new or unknown environments within broader (or multidisciplinary) contexts related to their area of study.
  • CB8 - Capability to communicate their conclusions, and the knowledge and rationale underpinning these, to both skilled and unskilled public in a clear and unambiguous way.
  • CB9 - Possession of the learning skills that enable the students to continue studying in a way that will be mainly self-directed or autonomous.

Objectives

  1. Become acquainted with the main and classic data structures of central areas of computer science and identify their major properties.
    Related competences: CB8, CB9, CTR4,
  2. Become familiar with the mathematical tools usually used to analyze the performance of data structures.
    Related competences: CG3, CEE3.1, CEE3.2, CB9, CTR6,
  3. Examine ideas, analysis and implementation details of data structures in order to assess their fitness to different classes of problems.
    Related competences: CG1, CG3, CEE3.1, CEE3.2, CB6, CB8, CB9, CTR4, CTR6,
  4. Select, design and implement appropriate data structures to solve given problems.
    Related competences: CG1, CG3, CEE3.1, CEE3.2, CB6, CB9, CTR4, CTR6,

Contents

  1. Preliminaries.
    Review of required previous knowledge: asymptotic notation, basic algorithm analysis, arrays, linked lists, stacks and queues, basics of hashing, binary search trees, AVL trees, red-black trees, heaps.
  2. Hashing.
    Hashing: Universal Hashing (construction of hash functions), Cuckoo Hashing (collision resolution strategies), Applications (Bloom Filters).
  3. Heaps.
    Heaps: Binomial Heaps.
  4. Self-adjusting data structures.
    Self-adjusting data structures: List updates, Splay trees.
  5. Randomized data structures.
    Randomized data structures: randomized BSTs, treaps.
  6. Multidimensional and metric data structures, searching in metric spaces, associative retrieval and object representation.
    Multidimensional and metric data structures, searching in metric spaces, associative retrieval and object representation: grid files, kd trees, point quad trees, PR quad trees, octrees.
  7. Geometric and kinetic data structures.
    Geometric and kinetic data structures: interval, segment and partition trees, sweep lines.
    Data structures for points in motion.
  8. Strings.
    Strings: tries, Patricia tries, suffix trees, suffix arrays, BW-transform, FM-index
  9. External memory / cache oblivious.
    External memory / cache oblivious: models, B-trees, ordered-file maintenance, van Emde-Boas layout.
  10. Succinct Data Structures.
    Succinct rank and select operations.
  11. Miscellaneous.
    Miscellaneous: concurrent, distributed, augmented, persistent data structures.

Activities

Teaching methodology

There will be two kinds of classes: theoretical sessions and practical
sessions. On average, two hours a week is dedicated to theory and one
hour a week to exercises. The lecturer will allocate the hours in
accordance with the subject matter.

The theory classes take the form of lectures in which the lecturer sets
out new concepts or techniques and examples illustrating them.
Sessions will consist of a presentation of the main topics of each content's item,
mainly based in selected original research papers.
A high level of students' participation is expected at each session.
Current lines of research in each topic will be discussed at the end of each topics' presentation.

The practical classes are used to carry out exercises in which students
take an active part. Lecturers set the exercises in advance. Students are
required to submit the exercises and then discuss the various
solutions/alternatives in class.

Evaluation methodology

Grade = 60% FW + 20% FT + 10% SP + 10%Q

FW = Final Work (graded from 0 to 10) in which each participant is required to present a research paper (previously assigned by the lecturer). The presentation consists of:
3-5 minutes backround on the topic of the paper, a motivation.
1 minute overview of the key ideas of the paper.
15 minutes presentation with most important details.
5 minutes demo of a program that implements the ideas introduced in the paper.

FT = Final test graded from (0 to 10) including all the contents of ADS.

SP = Summaries and participation (graded from 0 to 10) in which each participant is required to deliver a summary (1 page extent) of each others presentation and to participate (with questions and comments).

Q = Either quizzes (graded from 0 to 10), one per content's item or three (3) of the following deliverables:

*Notes of one topic in latex (well explained and completed).
*Read and resume one research paper.
*Implement and prove experimentally one of the studied data structures.
*Add to Wikipedia a data structure that isn't.

Bibliografy

Basic:

Previous capacities

Basic knowledge of the C++ programming language.
Basic knowledge of algorithm analysis methods (in particular asymptotic complexity).
Basic knowledge of elementary data structures such as stacks, queues, linked lists, trees, and graphs as well as of sorting methods such as insertion sort, heap sort, merge sort, and quick sort.