New Tutorial: VGP assembly pipeline


Posted on: 14 March 2022

We are proud to announce that, as result of the collaboration with the Vertebrate Genomes Project (VGP), a new training describing the VGP assembly pipeline is now available in the Galaxy Training Network. The Vertebrate Genomes Project aims to generate high-quality, near-error-free, gap-free, chromosome-level, haplotype-phased, annotated reference genome assemblies for every vertebrate species.

VGP pipeline.
Figure 1: VPG Pipeline 2.0. The pipeline starts with assembly of the HiFi reads into contigs, yielding the primary and alternate assemblies. Then, duplicated and erroneously assigned contigs will be removed by using purge_dups. Finally, Bionano optical maps and HiC data are used to generate a scaffolded primary assembly.

The tutorial organized in four sections: genome profile, HiFi phased assembly, post-assembly pocessing and hybrid scaffolding. During the genome profiling stage, diverse tools based on the analsys of k-mer frequencies are used for infering the properties of the genome. After that, a draft assembly is generated by using high accuracy long-read PacBio HiFi reads. In the third stage, the initial assembly is preprocessed for identifying and reassign allelic contigs. Finally, in the last step the assembed contigs are assembled into scaffolds by using two additional technologies: Bionano optical maps and Hi-C data.

View Material
Assembly of vertebrate genomes

Recent News

See all news

New Feature: my.galaxy.training

20 April 2023   new feature gtn

The GTN has set up a very simple “redirection” service based on my.home-assistant.io which Helena discovered after reading some Home Assistant documentation and saw a really neat link which led to her own internal home assistant.

New Feature: Persistent URLs (PURLs) / Shortlinks

19 April 2023   new feature gtn

The GTN has added persistent URL identifiers for our tutorials, slides, and FAQs. Using the gxy.io service, we have reserved GTN namespace for our persistent URLs. These work similarly to DOIs in that they will always point to a specific training material.

New Feature: GTN Rdoc

19 April 2023   new feature gtn

The internal functions of the GTN have, in the past, not been very well documented. Now we have started the process of fully documenting the functions and modules available to GTN developers.