Metagenomics: Tackling the new, the unknown and the roadblocks

Photo of Dr. Patricia Simner participating in a panel at ID Lab Con.

By Shannon Mahoney, senior specialist, Advanced Molecular Detection, APHL

With the recent COVID-19 pandemic and mpox outbreak, public health laboratories have found themselves in the critical position of having to rapidly adapt to emergent situations impacting a global population. For that reason, the APHL ID Lab Con session “Metagenomic Approaches to Infectious Disease Surveillance and Diagnostics” proved to be very popular. The panel of bioinformaticians, moderated by Patricia Simner, PhD, D(ABMM), Johns Hopkins University School of Medicine, provided attendees with insight into how state laboratories, the US Centers for Disease Control and Prevention (CDC) and academia have been working to combat new and evolving challenges in public health.

Metagenomics in Emergent Public Health Scenarios

Jared Johnson, PhD, former APHL Fellow and currently with the Michigan Bureau of Laboratories opened the session detailing his work utilizing shotgun metagenomics to establish a genomic surveillance methodology during the 2022 mpox outbreak.

Per guidelines from CDC, biosafety protocols regarding mpox are largely dependent on their identified clade, as infections with phylogenetically distinct viral strains can greatly affect the rate of mortality during an outbreak. The mpox virus is composed of two clades: Clade I, which is identified as a Select Agent by CDC and has a fatality rate of around 10% and Clade II, which has two definitive subsets and is rarely fatal, with a 99% recovery rate. Identifying the clade of a mpox-positive specimen plays a large role in helping to ensure the safety of not only the testing laboratory’s laboratory staff, but the communities that could be widely impacted by the outbreak. These factors led the state’s virology section to take action, implementing a method that could help sequence the emergent virus.

The benefit of a metagenomic approach allowed them to rapidly develop a surveillance method that didn’t need to be continuously altered once implemented, permitting for them to sequence without the threat of a decrease in the taxonomic classification quality. By developing a shotgun metagenomics method, which was sequenced on the Illumina NextSeq 550 System and analyzed using open-source alignment and clustering tools Nextclade and Mash, Michigan hopes to promote a robust sequencing technique commonly used to help identify continuously evolving viruses.

Bioinformatics in Resource-Limited Environments

A critical component of public health is being able to identify emergent threats in any region or country. However, one might begin to wonder how a resource-limited laboratory can manage infectious disease outbreaks, where access to technology, computing or personnel often provide a major roadblock.

That was the topic addressed by Brian Merritt’s talk on a threat-agnostic sequencing software toolkit developed by Johns Hopkins University’s Applied Physical Laboratory. Built around open-source workflow and analysis programs nf-core and Basestack, the toolkit provides a myriad of benefits to organizations constrained by hardware accessibility and technical limitations of staff.

The toolkit supports both short-read and long-read sequencing operations, and features interactive quality metrics, real-time reporting and taxonomic filtering, and a CLIA-style report detailing organism identification. No command line knowledge is needed to utilize the toolkit and its ability to be deployed from Docker reduces the need for high-end computing—promoting it as an easily deployable and widely accessible platform.

The Hopkins team has already tested and deployed the toolkit in a variety of use cases, teaming up with health partners in Cambodia and the NIH’s Fogarty International Center to introduce the platform in over 25 countries so far. In the future, they hope to implement more improvements such as expanding integrated module management, enlarging cloud integration, adding additional bioinformatic modules, and more.

Using Culture-independent Metagenomics to Tackle Foodborne Illness

One of the key advantages of metagenomics stands with its ability to provide a more comprehensive view of microbial communities when compared to traditional culture-based methods, which tend to be significantly more time-consuming and biased toward specific types of organisms. Similarly, culture-independent diagnostic testing (CIDT) also fails to identify novel pathogens. When compared to culture-based testing and CIDT, metagenomics tends to serve as an improved methodology when working with outbreaks of unknown etiology.

The advantages of culture-independent metagenomics weren’t lost on Andrew Huang, PhD, of CDC’s Enteric Diseases Laboratory Branch (EDLB), who discussed his unit’s Undiagnosed Diarrheal Illness (UNDI) project, aimed at tackling the frequency of unknown agents that cause one in every four foodborne illness outbreaks.

The project sought to develop a pipeline to aid in pathogen discovery for organisms with strong epidemiological links, but negative or inconsistent CIDT results. This led the CDC EDLB team to establishment a two-tier pipeline. The first tier serves as a rapid, inexpensive triage step targeting markers of known diarrheal pathogens (such as Shigella, Campylobacter, and Salmonella) that weren’t initially defined by CIDT.

The second tier, which would be utilized for specimen rendered negative or indeterminate in the first tier, applies a pathogen-agnostic shotgun metagenomic approach designed to identify shared genomic material for a small number of outbreak samples. It utilizes both reference-based tools (such as MIDAS2) and reference-free tools (such as composition based tools like MetaBat and MaxBin) to attempt to differentiate potential known pathogens that can evade CIDTs from the identification of a novel pathogen fueling a foodborne illness outbreak.

To further develop the pipeline, the CDC EDLB team are currently looking for state public health partners to collaborate with in order to aid in the collection of foodborne outbreak stool samples.

The post Metagenomics: Tackling the new, the unknown and the roadblocks appeared first on APHL Blog.

Two new positions: Senior Statistical Geneticist and Bioinformatician

Two new positions are available in my Infectious Disease Genomics group at the Big Data Institute, University of Oxford.

A Senior Postdoctoral Statistical Geneticist to jointly lead the implementation, design and application of new statistical tools for genome-wide association studies, lead the biological interpretation of key findings, develop methodologies and supervise junior group members. This post would suit a candidate with a PhD and relevant post-doctoral experience including direct experience in statistical genetics. Candidates without post-doctoral experience may be considered for a less senior appointment.

A Bioinformatician to provide expertise for computationally intensive analyses including genome-wide association studies and RNAseq studies of differential gene expression, as well as contributing to informatics projects as part of a wider collaboration with national biomedical cohorts. This post would suit a candidate with either a post-graduate degree related to Bioinformatics, Statistics, and Computing or equivalent experience in industry.

The application deadline for both posts is Noon GMT on Friday 7th January 2022.

Enhancing detection of newborn screening conditions via data analytics

Enhancing detection of newborn screening conditions via data analytics | www.APHLblog.org

For over 50 years, newborn screening programs across the United States have implemented laboratory screening and follow-up programs to detect and report infants at high risk for rare diseases. As we look towards the future, current testing challenges will likely become more pronounced with the anticipated addition of new conditions to the Recommended Uniform Screening Panel (RUSP), increasing sophistication of testing platforms and methodologies, and greater complexity of biomarker profiles.

Building the data analytic capacity of newborn screening programs will help support the analysis and interpretation of patient data, providing tools and resources to create efficiencies in time-intensive program activities.

APHL and the Newborn Screening and Molecular Biology Branch of the Centers for Disease Control and Prevention (CDC) are exploring solutions aimed at improving the interpretation of laboratory tests by expanding data analytic capacity in the following ways:

  • Increasing state newborn screening programs’ capacity to evaluate and interpret laboratory test data by providing Newborn Screening Bioinformatics Fellows
  • Creating a Newborn Screening Data Analytic Workgroup focused on sharing and harmonizing best practices and solutions
  • Enhancing data-driven decision making in the newborn screening community by designing and developing data science resources to address newborn screening-specific data challenges

In March 2019, APHL and CDC hosted a national meeting in Atlanta, GA to broaden their efforts, engage state newborn screening programs in a collective data analytics initiative, and discuss progress toward enhanced disease detection utilizing improved data analytics resources and technologies specific to newborn screening.

The meeting provided a forum for participants to discuss the needs around biochemical and molecular screening methodologies and their related data analytics requirements, as well as the value of data to improving health outcomes.

This national dialogue will help guide CDC development of an in-house data analytics resource that will improve the interpretation of biochemical and molecular test results.

This activity was supported by Cooperative Agreement #NU60OE000103-04 funded by the Centers for Disease Control and Prevention. Its contents are solely the responsibility of the authors and do not necessarily represent the official views of CDC or the Department of Health and Human Services.

The post Enhancing detection of newborn screening conditions via data analytics appeared first on APHL Lab Blog.

Lab Culture Ep. 17: Exploring bioinformatics: From fellow to full time in Virginia

Lab Culture Ep. 17: Exploring bioinformatics: From fellow to full time in Virginia | www.APHLblog.org

Kevin Libuit went from the APHL-CDC Bioinformatics Fellowship to a contractor to working full-time as a bioinformatician at the Virginia state lab (VA Division of Consolidated Laboratory Services (DCLS)). First he talks about when he discovered bioinformatics as a field and how the fellowship propelled his career. Then Kevin takes the mic and interviews Dr. Denise Toney, director of Virginia DCLS, about the value and growing need for bioinformaticians in public health labs.

 

 

Kevin G. Libuit, M.S.
Bioinformatics Lead Scientist, Division of Consolidated Laboratory Services (DCLS), Virginia Department of General Services

Denise Toney, PhD
Director, Division of Consolidated Laboratory Services (DCLS), Virginia Department of General Services

Links:

APHL-CDC Fellowships

APHL-CDC Bioinformatics Fellowships

Virginia Division of Consolidated Laboratory Services (DCLS)

APHL Off the Bench (new Facebook group!)

 

The post Lab Culture Ep. 17: Exploring bioinformatics: From fellow to full time in Virginia appeared first on APHL Lab Blog.

NIH Data Science Collaborative Hackathon April 16 – 18, 2018

The NCBI will assist with a data science hackathon to take place on the NIH Campus in Bethesda, Maryland, from April 16-18, 2018. The hackathon will focus on tools for advanced analysis of biomedical datasets including text, images, next generation … Continue reading

Bioinformatics paper uses NCBI open data to analyze drug response

A study (PMID: 28158543) published in the July 2017 issue of Bioinformatics collects, classifies and analyzes single nucleotide variants (SNVs) that may affect response to currently approved drugs. They identified 2,640 SNVs of interest, most of which occur rarely in populations (minor … Continue reading

North Carolina Research Triangle Hackathon March 12-14, 2018

The UNC Curriculum in Bioinformatics and Computational Biology and NCBI will host a data science hackathon from March 12-14, 2018 on the campus of the University of North Carolina at Chapel Hill. Projects addressed during the hackathon will involve general … Continue reading

NCBI to assist in Southern California genomics hackathon in January

From January 10-12, 2018, the NCBI will help with a bioinformatics hackathon in Southern California hosted by San Diego State University. The hackathon will focus on advanced bioinformatics analysis of next generation sequencing data, proteomics, and metadata. This event is … Continue reading