Introduction to Machine Learning

Anno accademico 2020/2021
Docente Francesco Raimondi, Franco Flandoli, Giacomo De Palma

Didattica integrativa

Esercitazioni

Modalità d'esame

Prova orale

Prerequisiti

Programma del corso

Module 1 (20hrs) Foundations of supervised and unsupervised classification and neural networks (Prof. Flandoli and Dr. De Palma)

1)      Introduction to machine learning: problem class, assumptions, evaluation criteria, model type, parameter fitting, algorithm

2)      (unsupervised learning) Principal Component Analysis (foundations and implementation by software R)

3)      (u. l.) complements on PCA, few ideas about clustering.

4)      (supervised learning) multiple linear regression (foundations and implementation by R)

5)      (s. l.) logistic regression and classification by regression

6)      (s. l.) classification by linear perceptron.

7)      Feature representation, gradient descent, stochastic gradient descent

8)      Neural networks: basic element, networks, activation function, back-propagation, training

9)      Neural networks: loss function, optimization, regularization

10)   Convolutional neural networks: filters, max pooling, architectures

 

Module 2 (20hrs) Application of Machine Learning algorithms in Bioinformatics and Life Sciences  (Dr. Raimondi)

1)      protein structure/function prediction using machine learning

2)      protein structure prediction using deep learning and co-evolution

3)      application of graph neural network for the prediction of protein interaction networks

4)      deep learning applications to genomics :DNA motif discovery(with hands-on in Python)

5)      deep learning applications to genomics: variant intepretation

6)      deep learning applications to genomics: single cell RNAseq analysis and interpretation

7)      multiomics data integration: (MOFA hands-on in R or Python)

8)      additional hands-on sessions using R and Python workflows for the analysis and classification of omics data.

Seminars in Bioinformatics and Machine Learning

12/04 at  17.00: Guanming Wu (OHSU, USA), "Reactome: An Open Knowledgebase of Human Pathways"
23/04 at  11.00:  Pier Luigi Martelli (UniBO, Italy), "Machine learning tools for predicting subcellular localization of proteins starting from sequence."
28/04 at 18.00: Mohammed AlQuraishi (Columbia University, USA), "Machine-learned molecular models for a structural systems biology."
07/05 at 11.00 Francesco Iorio (Human Technopole and Wellcome Sanger, Italy/UK), "CRISPR-cas9 screens and multi-omic data integration for identifying cancer dependencies and new therapeutic targets"
11/05 at 16.00  Cristoph Bock (CEMM, Austria), "Looking into the past and future of cells:Single-cell sequencing and high-throughput analysis
of epigenetic cell states in cancer and immunology"
23/06 at 12.00, Patrick Aloy (IRB Barcelona, Spain), TBD

Riferimenti bibliografici

Ad hoc selected scientific papers