Credits
6
Types
- MDS: Elective
- MIRI: Specialization complementary (Advanced Computing)
- MEI: Elective
Requirements
This subject has not requirements
, but it has got previous capacities
Department
CS
Web
www.cs.upc.edu/~CSN/
Summary of syllabus: (1) Network description: structural properties, types of networks, finding structure in networks. (2) Computational models of networks. (3) Processes on networks: search, diffusion, rumour spreading, etc.
Teachers
Person in charge
- Ramon Ferrer Cancho ( rferrericancho@cs.upc.edu )
Others
- Argimiro Arratia Quesada ( argimiro@cs.upc.edu )
- Marta Arias Vicente ( marias@cs.upc.edu )
Weekly hours
Theory
2
Problems
0
Laboratory
1
Guided learning
0
Autonomous learning
4
Competences
Advanced computing
Specific
Generic
Teamwork
Information literacy
Appropiate attitude towards work
Reasoning
Basic
Contents
-
Introduction. What are networks? Simple measures and models for networks
- Examples of real networks: social networks, information networks, technological networks and biological networks.
- Terminology: points (vertices, nodes, sites, actors) and lines (edges, arcs, links, bonds, ties,...).
- Types of networks: unweighted/weighted networks, undirected/directed networks, ...
- Classic network properties: the small-world phenomenon (distance metrics), heterogeneous degree distribution (power-law degree distribution, high clustering or transitivity.
- Classic network models: the Erdös-Rényi model, the Watts-Strogatz model, the Barabási-Albert model. -
The degree distribution of a network and its analysis
- The empirical and the theoretical degree distribution. Undirected degree, in-degree and out-degree.
- Theoretical distributions: zeta family, binomial distribution.
- Fitting of the degree distribution: visual fitting, linear and non-linear regression and maximum likelihood.
- Introduction to standard model selection: parsimony versus quality of fit, Akaike information criterion. -
Measures of networks
- Distance metrics: geodesic paths, local distance metrics (mean geodesic distance, closeness centrality), global distance metrics (diameter, mean geodesic distance, mean closeness centrality). Algorithms for computing distance.
- Clustering metrics: transitivity, clustering (different metrics). Algorithms for computing clustering.
- Degree correlations: assortative vs disassortative mixing by degree, degree correlation metrics (Pearson correlation vs Spearman rank correlation, mean degree of nearest neighbours. -
Statistical testing of network measures
- Introduction to hypothesis testing: qualitative versus quantitative testing, families of null hypotheses (Erdös-Renyi model, configuration model, switching model,...), p-values.
- Introduction to Monte Carlo testing: general scheme, uniform random number generators, uniformly random permutation, random Erdös-Rényi graph with constant and variable number of edges.
- The configuration or matching model.
- The switching model. -
Advanced measures for networks. Centrality.
Centrality:
- Qualitative notions of centrality of a node.
- Quantitative definitions of centrality: degree centrality, closeness centrality, betweenness centrality, eigenvector centrality and PageRank (google). -
Finding community structure in networks
- Introduction to community structure.
- How to quantify the quality of a community structure: intra-cluster versus inter-cluster density. Other metrics: conductance, expansion, internal density, cut ratio, normalized cut, Flake's out degree fraction .... and modularity.
- Methods for detection of community structure: hierarchical clustering algorithms (agglomerative hierarchical clustering, node similarity measures), Girvan-Newman algorithm, modularity optimization algorithms (the modularity Q, algorithms to maximize modularity, the Louvain method, spectral modularity optimization), graph partitioning algorithms (minimum bisection algorithms, Kernighan-Lin algorithm) and clique percolation method. -
The Minimum Linear Arrangement problem
- Linear arrangements of networks: linear/Euclidean distance, metrics (mean dependency length).
- Mean dependency length in trees: expectation in random linear arrangements, lower bounds, upper bounds (non-crossing trees),...
- The minimum linear arrangement problem: cost function and complexity.
- The minimum linear arrangement of trees: algorithms, particular cases.
- Introduction to crossing theory for trees. Number of crossings: upper bound, expectation in random linear arrangements (the particular case of uniformly random trees). -
Network dynamics
Introduction to network dynamics:
- Classic models that generate networks: the Barabáasi-Albert model (growth and preferential attachment), copying models, fitness based models and optimization models.
- The Barabási-Albert model. The statistical properties: vertex degree as a function of time, the power-law degree distribution.
- The copying model. The statistical properties: vertex degree as a function of time and degree distribution.
- Fitness models. Fitness functions and degree distribution.
- Optimization models. Trade-off between geodesic distance and link density. Cost function.
Advanced network models:
- Modifications of the Barabási-Albert model
- Liu et al's hybrid model.
- Bianconi-Barabási hybrid model.
- Dorogovtsev-Mendes model (accelerated growth).
- Other models. -
Sampling in networks
- Introduction to sampling: motivation and importance and goals.
- Sampling strategies: random node selection, random edge selection, crawling-based.
- Biases of sampling strategies.
- How to compensate for biases or how to reduce them. -
Epidemic spreading in networks
- Introduction to the dynamics of epidemics: relevant questions and the full mixing assumption.
- Classic epidemic models (full mixing): the SI model (the logistic growth equation), the SIR model and the SIS model. Thresholding phenomena.
- Epidemic models over networks: homogeneous network models, scale-free network model for SIS and a general network model for SIS. Thresholding phenomena. -
Percolation and network resilience
- Introduction to percolation: percolation in lattices, giant cluster.
- Network resilience upon node removal:
- Uniform node removal: the configuration model (thresholding phenomena), Erdös-Rényi networs, scale-free networks.
- Non-uniform node removal: random versus targeted attack, exponential versus scale-free network. Size of the giant cluster. -
Other dynamic processes over networks: random walks and diffusion processes; search on networks
Other dynamic processes over networks: random walks and diffusion processes; search on networks
Teaching methodology
The theory sessions will be done primarily by the professor using either the blackboard or projected slides.The lab work will be done in front of the computer. Students are expected to be working on their assignment, and the professor will explain all that is necessary to follow the class in the beginning of the session. Each lab session will be accompanied by a thorough guide describing the work that the students need to do.
All the material relevant for the course will be available from the course's website.
Evaluation methodology
There will be no exam in this course. Grading is done entirely through reports on various tasks throughout the course.Students are expected to hand in 7 lab work reports about two weeks after its corresponding lab session, which count toward 50% of the final grade. We will also have a course project that the students will hand in towards the end of the course that accounts for 50% of the final grade.
The formula to compute the final grade is therefore:
0.1 * ( L1 + L2 + L3 + L4 + L5) + 0.5 * CP - (0.25 * P)
where Li, for i=1..5, stands for the grade for the best 5 lab reports, CP stands for grade of the course project, and P is the nr of lab reports not handed in.
For each lab report not handed in, there is a penalty of 0.25 points towards the final grade. So, if a student does not hand in 2 reports, then 0.5 points will be deducted from the final grade. Grades are over 10.
Bibliography
Basic
-
Networks
- Newman, M.E.J,
Oxford University Press,
2018.
ISBN: 0198805098
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991004164149706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Statistical analysis of network data: methods and models
- Kolaczyk, E.D,
Springer,
2009.
ISBN: 9780387881454
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003831819706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Trends in Cognitive Sciences
- Baronchelli, A., Ferrer-i-Cancho, R., Pastor-Satorras, R., Chater, N. & Christiansen, M. H.,
http://cataleg.upc.edu/record=b1243234~S1*cat -
Dynamical processes on complex networks
- Barrat, A.; Barthélemy, M.; Vespignani, A,
Cambridge University Press,
2008.
ISBN: 9780521879507
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003840329706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Complementary
-
Evolution and structure of the Internet: a statistical physics approach
- Pastor, R.; Vespignani, A,
Cambridge Univeristy Press,
2004.
ISBN: 0521826985
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002674799706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Scale-free networks: complex webs in nature and technology
- Caldarelli, G,
Oxford University Press,
2007.
ISBN: 9780199211517
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003241579706711&context=L&vid=34CSUC_UPC:VU1&lang=ca
Web links
- Supporting page of the course http://www.cs.upc.edu/~CSN/
Previous capacities
ProgrammingAlgorithms and data structures
Linear algebra
Probability and Statistics