Galaxy Europe monitor COG-UK Covid-19 genomic data

Galaxy Europe monitor COG-UK Covid-19 genomic data

Galaxy Europe has started to monitor COG-UK data. COG-UK is the COVID-19 Genomics UK Consortium, which was set up in 2020 in response to the Coronavirus pandemic.  The consortium sequences SARS-CoV-2 genomes, providing vital information on transmission and tracking of viral variants. This work has been crucial in the UK’s response to the pandemic and to our ability to identify and track new variants with such success. 

Galaxy Europe is now able to use this database to monitor the genomes being sequenced for important data such as provenance and variants present. The protocol means that new samples in the database are processed every three hours and the results are then put into public storage and integrated into Beacon by the Centre of Genomic Regulation. Beacon is an ELIXIR tool to find SARS-CoV-2 variability at genomic, amino acid and motif level and allows researchers to search for things such as specific genomic variants and explore associated metadata.

WorkflowHub, another project which ELIXIR-UK is heavily involved in, hosts the public validated workflows that this project uses. It is important when producing publicly available data that the whole process is well documented in order for the results to be trusted and reproducible, and so WorkflowHub plays an important role in this project. 

The workflows produced during this project can be used to process samples from other countries, and Galaxy Europe is offering to help set this up for other countries as a next step in monitoring SARS-CoV-2 genomic data. This continuous analysis system is automated using the BioBlend software library, which is maintained at the Earlham Institute, one of the ELIXIR-UK member organisations.

The team behind this project have also provided the resources for people who don’t use Galaxy to contribute via a simple GitHub. This should make it easy for people to analyse viral sequences even if they don’t want to run their own workflows. 

All through the Coronavirus pandemic there has been a clear need for high quality genomic data and associated metadata which is freely available for researchers working in this area. It has enabled the tracking of transmission in care homes and hospitals, and allowed early identification of new strains. The UK has led the way in this area and the data we have produced is now being used to demonstrate this new way of monitoring the data from Galaxy Europe.

Find the documented process here 

Read more about the validated workflows

Find a summary of all the data processed in this dashboard

Read about ELIXIR-UK’s Covid-19 efforts