Bioinformatics and Computational Biology

A.Y. 2023/2024
Overall hours
BIO/10 BIO/11 INF/01
Learning objectives
The course will introduce computational approaches recently developed for studying biological systems with a focus on biotechnological applications: the identification of essential genes (Tn-Seq and network analysis), or genes that are involved in interesting processes (Tn-Seq) together with methods to study gene regulation (ChIP-Seq, small RNAs). On these premises we will then discuss how to engineer eco-systems (community engineering) and how metabolic optimization can be achieved (model-guided metabolic engineering).
An introduction to computational methods for the characterization of protein structures, with biochemical basis.
Expected learning outcomes
The course will introduce students to the computational techniques that are at the basis of the identification of important genes in Tn-seq datasets and to the structural analysis of networks with the aim of identifying genes that can be manipulated for specific objectives. Techniques to study gene regulation will also be discussed (ChIP-Seq, sRNA).

In the part of the course relating to the computational study of proteins, the student will learn the biochemical and biophysical bases on which the secondary and tertiary structure prediction algorithms, structural disorder and protein dynamics are based. The student will also directly perform a series of prediction tests by learning to use structural analysis and prediction programs.
Course syllabus and organization

Single session

Lesson period
Prerequisites for admission
Basic knowledge of genetics, molecular biology and biochemistry.
Assessment methods and Criteria
Bioinformatics: students will perform bioinformatics analyses, describing methods employed and results obtained in a lab notebook. At the exam, students will discuss the notebook with the teacher, and the grade will depend on their understanding of the methods employed as well as the results obtained.

Computational biology: written and oral test.
Course syllabus
Introduction: definition and aims of Bioinformatics. Genome projects and next-generation sequencing. Gene and genome annotations. A bioinformatic view of the structure of protein-coding genes: exons, introns, promoters, and alternative splicing. The structure of mature eukaryotic mRNAs. Primary and specialized biological databases. Genome browsers. Definition of sequence similarity, homology, orthology, and paralogy. Global and local alignments. Scoring matrices for nucleotide and amino acid sequence alignments (PAM and BLOSUM). BLAST sequence similarity search: algorithm and usage. Multiple sequence alignments. Expression data and RNA-Seq. Functional gene annotation and gene ontology.
Teaching methods
Theoretical lectures will be alternated with practical exercises with the PC.
Teaching Resources
Slides and handouts will be shared with students.
Reference textbook (suggested):
M. Helmer Citterich, F. Ferrè, G. Pavesi, C. Romualdi, G. Pesole, Fondamenti di bioinformatica, Zanichelli editore 2018
Computational Biology
Course syllabus
1. Exploring function and regulation
a. Transposon insertion mutagenesis for the discovery of essential or otherwise important genes
b. ChIP-Seq, transcription factor binding sites and gene regulatory networks;
c. Metagenomics and metatranscriptomics
2. Small RNAs in Bacteria, mechanisms of action, function and dynamical behavior of small genetic circuits implementing sRNA-mediated regulations;
3. Introduction to network theory with applications in Biology (+Practice in R).
4. Protein Structure and its analysis
a. The main chemical and geometrical properties of protein structures will be shown: secondary (alpha helix, beta sheets and coil) and tertiary structures. TIM barrel will be used as an example of protein fold ductility.
b. Covalent and non-covalent bonds are fundamental for protein folding: peptide bond, salt bridges, van der Waals interactions and hydrogen bonds. The role of water in protein folding.
c. Computer analysis of protein structures to verify several of the protein properties discussed during course.
d. The evolution of the structure of globular proteins, of membrane proteins and of intrinsically disordered proteins will be accompanied by test of protein structure predictions.
e. Structure prediction by homology modelling
f. Molecular dynamics simulations
g. Structure prediction and refinement by molecular dynamics
Teaching methods
Lessons supported by projected material plus interactive lessons at the computer. Students will be stimulated to participate actively to the lesson/discussion to improve their skills by analysing the cited literature. We strongly suggest to attend all the lessons.
Teaching Resources
For the exam we will mainly refer to the slides that will be available for download after each lesson on the ariel website.

The following is a list of articles/books that can be used by students to explore more in detail some of the issues discussed in the lessons.

