Credits
6
Types
Compulsory
Requirements
This subject has not requirements
, but it has got previous capacities
Department
UB
Mail
jabril@ub.edu
Teachers
Person in charge
- Josep Francesc Abril Ferrando ( jabril@ub.edu )
Others
- Maria Tarradas Alemany ( mtarradasi@ub.edu )
Weekly hours
Theory
2
Problems
2
Laboratory
0
Guided learning
0
Autonomous learning
6
Competences
Knowledge
Skills
Competences
Objectives
-
To acquire advanced knowledge on the Computational Genomics Field.
Related competences: K1, K2, K3, K7, -
Understand the computational protocols, which parameters affect the outcomes from the bioinformatics tools used in the field, and properly interpret the results of the analyses.
Related competences: S1, S2, S3, S5, S7, S8, C2, C3, C4,
Contents
-
Introduction to Computational Genomics
Sequences and annotations; basic data formats; sequence ontology; sequence basic stats and biases; sequencing methods (from Sanger to single molecule technologies) and range of applications (DNA-seq, RNA-eq, ChIP-seq, chromatin conformation, ...); sequencing quality and coverage. Sequence repositories (NCBI, EnsEMBL/BioMart, UCSC, others). -
Sequence Analysis
Alphabets and strings; sequence complexity, entropy, and information content; k-mer analysis; repetitive elements (types, detection, and masking). -
Sequence Assembly
Genomes, transcriptomes, and meta-genomes assemblies; reads, contigs, scaffolds, and chromosomes; assembly algorithms, from prefix-suffix alignment to assembly graphs; de-Bruijn graphs; compression-based string matching: mapping reads over assemblies (DNA aligners); assembly assessment metrics, N50, completeness (CEGMA, BUSCO); accuracy assessment of assembler tools (GAGE, Assemblathon) and DNA aligners (RGASP). -
Sequence Models
Consensus sequence. Modeling signals and content: regular expressions, position weight matrices (PWMs), Markov chains, hidden Markov models (HMMs), other models. -
Computational Gene Finding
The genome landscape: signals, exons, genes, regulatory elements, chromatin marks, etc...; comparing gene-finding on prokaryota vs eukaryota; computational gene-finding approaches: ab-initio, similarity-based, homology-based, comparative genomics, NGS; dynamic programming to assemble exons; generalized hidden Markov models (GHMMs); phylogenetic models (phyloHMMs); prediction of non-cannonical features: selenoproteins, pseudogenes, non-coding RNAs, ...; GF accuracy assessment: metrics (sensitivity/specificity, ...), benchmarks (*GASP). -
Regulatory Elements Prediction
Regulatory elements: regulatory programs (network complexity, space and time compartimentalization), transcription factors and transcription factor binding sites (TFBSs), promoters, enhancers; pattern matching (TranFac/Jaspar/Oreganno); pattern discovery (PEAKS, MEME); phylogenetic footprinting; NGS-approaches, decyphering epigenetic code with ChIP-seq; annotating chromatin conformation over genomic sequences. -
Functional Annotation
From sequence to function: genes, transcripts, and proteins; gene ontologies (GO, KO); annotating domains: patterns (PROSITE), profile HMMs (PFAM, RFAM), homology-based approaches: BLAST searches versus NOG models; meta-genomic samples: ecological network functional components, species composition and diversity measures. -
Managing Annotations' Data
Annotation pipelines: manual curation procedures (NCBI, VEGA), Maker, Galaxy, EnsEMBL; visualization paradigms: from gff2ps to circos, from command-line tools to graphical interfaces (Apollo, IGV), genome browsers (EnsEMBL, UCSC-Genome browser, GBrowse/JBrowse); distributed annotation systems; database tracks versus custom tracks.
Activities
Activity Evaluation act
Theoretical expository lectures.
Contents:
Theory
28h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
30h
Practicals Sessions.
Contents:
Theory
0h
Problems
30h
Laboratory
0h
Guided learning
0h
Autonomous learning
30h
Practicals Exercises
Contents:
Theory
0h
Problems
0h
Laboratory
0h
Guided learning
0h
Autonomous learning
30h
Teaching methodology
Conceptual materials necessary to understand the topics of the Computational Genomics field will be delivered as face-to-face lectures (theory sessions). Then, each student will play with example cases and/or protocols to apply some of the ideas exposed in the theory sessions (practical sessions), which can lead to further work outside the classroom in order to complete the requested exercises for the continued evaluation (exercise submissions to the Virtual Campus).Independent individual study and work efforts will be necessary, in very varying amounts depending on the profit and capability of each student, in order to absorb and extend, if needed, the essentials of the concepts provided in class.
Teaching Resources:
+ Lecture notes: Slides will be made available before the classes from Virtual Campus.
+ Practicals: Materials for the exercises will be available from Virtual Campus.
+ Links to further resources will be accessible through the Virtual Campus.
Evaluation methodology
Evaluation of academic performance for this subject will be based on these two blocs:+ Practicals (Continued Evaluation): Students must submit to the Virtual Campus several exercises that will be proposed all along the practical sessions. Details about the formatting and submission procedure will be provided on the first practical session. Students will have about a week to submit each exercise through the links provided on the Virtual Campus. This part does not include any re-evaluation exam, as the scores are based on the assessment of the submitted exercises that must be delivered along the quarter.
+ Lectures (End Term Synthesis Exam): Theoretical lectures will be assessed by a synthesis exam to be realized at the end of the term on the date assigned in the calendar. Only those students failing this exam can present to the Re-Evaluation exam, if they had a minimum score of 2.5 out of 10, also on the date assigned in the calendar for this purpose. The grade of the Re-Evaluation exam will replace that of the Synthesis Test.
With regard to the Honor Code that students agreed to follow, any attempt of copy detected during the exams (End-Term or Re-Evaluation) will imply the FAILURE of the course. Furthermore, tasks to be submitted individually cannot be solved in groups and each student is responsible for her/his deliverables.
The final mark is obtained by summing up the continued evaluation score (60%) and the end term score (40%), once the end term or the reassessment test has been passed.
To pass the course requires a minimum score of 5 out of 10, once all the grades have been aggregated.
Bibliography
Basic
-
Introduction to genomics
- Lesk, Arthur,
Oxford University Press,
2025.
ISBN: 9780198866893
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991005476510506711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Introduction to computational genomics: a case studies approach
- Cristianini, N.; Hahn, M.W.,
Cambridge University Press,
2007.
ISBN: 9780511808982
https://www-cambridge-org.recursos.biblioteca.upc.edu/core/books/introduction-to-computational-genomics/863C62220C06825CE3B8F8E462D0390F -
Bioinformatics: sequence and genome analysis
- Mount, D.W,
Cold Spring Harbor Laboratory Press,
2004.
ISBN: 0879696877
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002934579706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
An Introduction to Bioinformatics Algorithms
- Jones, N.C.; Pevzner, P.A.,
The MIT Press,
2004.
ISBN: 0262101068
https://ebookcentral-proquest-com.recursos.biblioteca.upc.edu/lib/upcatalunya-ebooks/detail.action?pq-origsite=primo&docID=3338847 -
Genómica Computacional
- García, E.B,
UOC,
2013.
ISBN: 9788490299104
https://elibro.net/es/lc/upcatalunya/titulos/56698
Complementary
-
Encyclopedia of Bioinformatics and Computational Biology
- Ranganathan, S.; Nakai, K.; Schönbach, C.; Gribskov, M. (editors),
Elsevier Inc,
2019.
-
Concise Encyclopaedia of Bioinformatics and Computational Biology
- Hancock, J.M.; Zvelebil, M.J. (editors),
Wiley,
2013.
ISBN: 9780470978726
https://ebookcentral-proquest-com.recursos.biblioteca.upc.edu/lib/upcatalunya-ebooks/detail.action?pq-origsite=primo&docID=1565907 -
Methods for Computational Gene Prediction
- Majoros ,W.H,
Cambridge University Press,
2007.
ISBN: 1107710839
https://www-cambridge-org.recursos.biblioteca.upc.edu/core/books/methods-for-computational-gene-prediction/28E6CFE22A004983A3D42363EDF96A71 -
Handbook of Hidden Markov Models in Bioinformatics
- Gollery, M,
Chapman & Hall/CRC Press,
2008.
ISBN: 9781584886846
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003597229706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids
- Durbin, R.; Eddy, S.R.; Krogh, A.; Mitchinson, G.,
Cambridge University Press,
1998.
ISBN: 0521629713
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002516379706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Algorithms on strings, trees, and sequences : computer science and computational biology
- Gusfield, D,
Cambridge University Press,
1997.
ISBN: 0521585198
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991001989459706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Bioinformatics: a practical guide to the analysis of genes and proteins
- Baxenavis, A.D.; Bader, G.D.; Wishart, D.S,
Wiley,
2020.
ISBN: 9781119335580
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991005476510006711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Discovering genomics, proteomics, and bioinformatics
- Campbell, A.M.; Heyer, L.J.,
Benjamin Cummings,
2007.
ISBN: 0805382194
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991005476509906711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Bioinformatics and functional genomics
- Pevsner, J,
Wiley-Blackwell,
2009.
ISBN: 9780470085851
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003867879706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Developing bioinformatics computer skills
- Gibas, C.; Jambeck, P,
O¿Reilly,
2001.
ISBN: 1565926641
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991002291529706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
Sequence analysis in a nutshell: a guide to tools and databases
- Markel, S.; Leon, D,
O'Reilly,
2003.
ISBN: 059600494X
https://discovery.upc.edu/discovery/fulldisplay?docid=alma991003499539706711&context=L&vid=34CSUC_UPC:VU1&lang=ca -
UNIX and Perl to the Rescue!
- Bradnam, K.; Korf, I,
Cambridge University Press,
2012.
ISBN: 1139368575
https://www-cambridge-org.recursos.biblioteca.upc.edu/core/books/unix-and-perl-to-the-rescue/783D7CE1B7C65E27FD2E79197D2C0EF2