GATK Workshop 2023

13 - 17 Nov Granada, Spain

From reads to diagnostic and secondary findings

5-day workshop

About this workshop

This workshop will focus on the core steps involved in calling variants from Illumina next generation sequencing data using the Genome Analysis Toolkit (GATK).

You will learn about best practices in calling both germline and somatic variants: single nucleotide variants (SNVs), short insertion/deletions (indels), and structural variants (SVs) with a focus on copy number variants (CNVs).

We will also cover considerations to take when calling variants on the mitochondrial genome, as well as variant calling from single-cell RNA-seq data and long reads sequencing data. We will cover how the data structures provided by GATK can help you process large datasets in parallel and at scale. Although this workshop focuses on human data, the majority of the concepts and approaches apply to non-human data, and we will cover some adaptations needed in those situations.

The course is followed by a tutorial on tertiary analysis (diagnostic and secondary findings) using the tools developed by the Platform of Computational Medicine.

Pre-registration is closed.

Selected candidates will be notified soon.

Organizers

Andalusian Platform for Computational Medicine, Fundación Progreso y Salud, Departamento de Ingeniería de Computadores, Automática y Robótica, Universidad de Granada, CIBERER, Broad Institute

Instructors

Broad Institute of MIT and Harvard - GATK team (GATK workshop)

Lee Lichtenstein

Director

Computational Methods at Broad Institute of MIT and Harvard

George Grant

Principal Software Engineer

Data Science Platform

Andrey Smirnov

Machine Learning Engineer

Cambridge, Massachusetts

Andalusian Platform for Computational Medicine (tutorial on tertiary analysis)

Javier Pérez Florido

Postdoctoral Researcher

Fundación Progreso y Salud

Rosario Carmona Muñoz

Postdoctoral Researcher

Fundación Progreso y Salud

José Luis Fernandez Rueda

Engineer of the Andalusian Platform for Computational Medicine

Fundación Progreso y Salud

Target audience

The course is aimed at life scientists interested in genomic variant analysis and its applications. We don’t assume prior experience in this topic, but as per the requirements below it is essential to have command line experience and a basic understanding of sequencing technologies.

This course is also suitable for participants with prior experience in other types of ‘omics’ data analysis (e.g. RNA-seq), who would like to learn about this topic.

Prerequisites

Familiarity with the basic terms and concepts of genetics and short-read sequencing technologies.
Familiarity with the Unix command line environment is essential.
Participants must bring their own laptops.

Schedule

Day 1 - Introductions and Germline Variant Discovery

09:00 - 10:45

OPENING

09:00Registration and Welcome
09:25Welcome! An intro to GATK
09:45Intro to Sequencing Data
10:15Intro to Data Pre-Processing

10:45 - 12:10

GETTING STARTED WITH PIPELINING

10:45Intro to Pipelining
11:15Coffee Break
11:30[Guided Demo] Terra for the first time

12:10 - 12:50

GERMLINE VARIANT DISCOVERY & JOINT CALLING

12:10Intro to Germline Variant Discovery

12:50 - 17:00

HAPLOTYPE CALLER

12:50HaplotypeCaller
13:20Lunch Break (incl. time to walk to venue)
14:40[Exercise] Hands-on with HaplotypeCaller
15:40Coffee Break
15:55Joint Calling
16:25Audience questions, and adjourn Day 1

Day 2 - Variant Filtering & Evaluation and Long Reads

09:25 - 10:25

MORE ON GERMLINE VARIANT DISCOVERY

09:25gnomAD
09:55[Exercise] Abbreviated Case Study: Germline Variants

10:25 - 10:55

VARIANT FILTERING

10:25Intro to Variant Filtering

10:55 - 14:55

REFINING & EVALUATING

10:55Genotype Refinement
11:25Coffee Break
11:40Call set Evaluation
12:00[Exercise] Hands-on with VQSR
13:20Lunch Break (incl. time to walk to venue)
14:40[Exercise] Hands-on with VQSR

14:55 - 15:55

LONG READS

14:55Processing Long Reads
15:40Coffee Break

15:55 - 17:00

YET MORE ON GERMLINE VARIANT DISCOVERY

15:55Intro to DRAGEN-GATK
16:10Audience questions, and adjourn Day 2

Day 3 - Somatic Variants and CNVs

09:25 - 10:10

RETURNING TO REFINING & EVALUATING

09:25[Exercise] Running refinement and evaluation tools

10:10 - 10:40

SOMATIC VARIANT DISCOVERY

10:10Intro to Somatic Variant Discovery

10:40 - 12:25

SOMATIC SNVS

10:40Somatic SNVs and Indels
11:10Coffee Break
11:25[Exercise] Messing around with Mutect2

12:25 - 15:35

SOMATIC CNAS

12:25Intro to Somatic CNAs
13:20Lunch Break (incl. time to walk to venue)
14:40[Exercise] Hands-on with Somatic CNAs
15:20Coffee Break

15:35 - 17:00

GERMLINE CNVS

15:35Germline Copy Number Variation
16:25Audience questions, and adjourn Day 3

20:00

NETWORKING DINNER AT CARMEN DE LA VICTORIA

Day 4 - Mitochondria, Germline CNVs, and scRNA-seq

09:25 - 10:45

GERMLINE CNVS

09:25[Exercise] Hands-on with the gCNV pipeline

10:45 - 11:30

MITOCHONDRIA

10:45[Guided Demo] Mitochondria Data
11:15Coffee Break

11:30 - 12:30

STRUCTURAL VARIANTS

11:30Intro to calling SVs with GATK-SV

12:30 - 15:45

SINGLE-CELL RNA-SEQ ANALYSIS

12:30Intro to single-cell RNA-seq analysis
13:20Lunch Break (incl. time to walk to venue)
14:40[Exercise] Single-cell RNA-seq analysis
15:30Coffee Break

15:45 - 16:15

BONUS TOPIC

15:45Machine Learning for Health (ML4H)

16:15 - 17:00

WORKSHOP WRAP-UP

16:15Workshop Wrap-Up and Survey
16:25Audience questions, and adjourn the GATK Workshop

Day 5 - Analysis of diagnostic and secondary findings

10:00 - 14:00

10:00General remarks
10:30Introduction to openCGA and MMP for variant prioritzation
11:15Coffee Break
11:45Self-paced variant prioritization with MMP
13:00Secondary findings
13:30Wrap-up and workshop closing

Course application

Pricing

€180

Registration includes access to the course, coffee break (monday to friday) and lunch (monday to thrusday)

Venue

Venue and social events

IAVANTE CMAT
CMAT (Complejo Multifuncional Avanzado de Simulación e Innovación Tecnológica). Averroes room
Health Sciences Technology Park (Parque Tecnológico de la Salud)
21, Avenida de la Investigación, 18016 Granada
Location

How To Get Here

Línea IAVANTE - Fundación Progreso y Salud - Sede Granada
Public transport is the best way to get to the venue:

By Bus

Lines 4 and U3
More info about buses

plano buses

By Tramway

Facilities & Accommodations

Hotels with a close tramway stop

Hotel Occidental Granada ****

Calle Alhamar, 46s 18004 Granada (Tramway stop: Recogidas)
Tel: (+34) 958 290 303
10 minutes by car to CMAT.

Hotel Andalucía Center ****

Avenida Fuente Nueva, s/n 18002 Granada (Tramway stop: Hípica)
Tel: (+34) 958 181 500
8 minutes by car to CMAT.

Hotel Abades Nevada Palace ****

C/ de la Sultana, 3 - Granada (Tramway stop: Palacio Deportes)
Tel: (+34) 902 222 570
4 minutes by car to CMAT

Other hotels

Hotel Urban Dream Nevada ***

Calle Benjamín Franklin, 3
18100 Armilla (Granada)
Tel: (+34) 958 985 020
7 minutes by car to CMAT

Hotel Ibis Granada *

CALLE GRAHAM BELL, 3
IN FRONT OF CARREFOUR GRANADA
HOTEL CIUDAD 1* 18100
Armilla (Granada)
Tel: (+34) 958 184 250
9 minutes by car to CMAT

Sponsors