Variant Analysis with GATK Prerequisites
This workshop will focus on the core steps involved in calling variants with the Broad’s Genome Analysis Toolkit, using the “Best Practices” developed by the GATK team. You will learn why each step is essential to the variant discovery process, what are the operations performed on the data at each step, and how to use the GATK tools to get the most accurate and reliable results out of your dataset.
In the course of this workshop, we highlight key functionalities such as the GVCF workflow for joint variant discovery in cohorts, RNAseq specific processing, and somatic variant discovery using MuTect2. We also preview capabilities of the upcoming GATK version 4, including a new workflow for CNV discovery.
The workshop is composed of one day of lectures (including many opportunities for Q&A) and one day of handson training. On the first day, we explain the rationale, theory and application of our Best Practices for Variant Discovery in high-throughput sequencing data. On the second day, we walk attendees through handson exercises that teach how to manipulate the standard data formats involved in variant discovery and how to apply GATK tools appropriately to various use cases and data types. In the course of these exercises, we demonstrate useful tips and tricks for interacting with GATK, dealing with problems, and using thirdparty tools such as Samtools, IGV and RStudio.
Please note that this workshop is focused on human data analysis. The majority of the materials presented does apply equally to non human data, and we will address some questions regarding adaptations that are needed for analysis of non human data, but we will not go into much detail on those points.
The timetable can be found here.
Please note that if you are not eligible for a University of Cambridge Raven account you will need to Book or register Interest by linking here.
- The lecture based component of the workshop is aimed at a mixed audience of people who are new to the topic of variant discovery or to GATK, seeking an introductory course into the tools, or who are already GATK users seeking to improve their understanding of and proficiency with the tools.
- The handson component is aimed at novice to intermediate users who are seeking detailed guidance with GATK and related tools.
- Graduate students, Postdocs and Staff members from the University of Cambridge, Affiliated Institutions and other external Institutions or individuals
- Further details regarding eligibility criteria are available here
- Further details regarding the charging policy are available here
- Familiarity with the basic terms and concepts of genetics and genomics.
- Basic familiarity with the command line environment is required
- Sufficient UNIX experience might be obtained from one of the many UNIX tutorials available online.
Number of sessions: 2
# | Date | Time | Venue | Trainer | |
---|---|---|---|---|---|
1 | Mon 13 Jun 2016 09:30 - 17:30 | 09:30 - 17:30 | Bioinformatics Training Room, Craik-Marshall Building | map | Geraldine Van der Auwera |
2 | Tue 14 Jun 2016 09:30 - 17:00 | 09:30 - 17:00 | Bioinformatics Training Room, Craik-Marshall Building | map | Geraldine Van der Auwera |
Bioinformatics, ChIP-seq, Data handling, Data mining, Data visualisation, Exome sequencing, Genomics, RNA-seq, Sequence variations
After this course you should be able to:
- Understand the overall variant discovery workflow rationale and requirements
- Understand key methods and functionalities in light of the latest research
- Apply the Best Practices tools and workflow to a real data set
- Interpret analysis results and troubleshoot common problems
During this course you will learn about:
- Variant detection for next-generation DNA sequencing
- Data pre-processing
- Variant discovery
- Variant functional annotation and evaluation
Presentations, demonstrations and practicals
2
Once a year
Booking / availability