BER launches Environmental System Science Program. Visit our new website under construction!

U.S. Department of Energy Office of Biological and Environmental Research

BER Research Highlights

A Viral Gold Rush
Published: May 01, 2019
Posted: March 16, 2020

Scientists develop a new tool to find viruses in complex genomic datasets.

The Science
Researchers developed open-source software that can classify viruses in ways that previous tools could not. Scientists have limited data on viruses that they cannot grow in laboratories. That lack of information makes the viruses especially hard to classify. This new system uses viral genes to separate out viruses that are difficult to distinguish from each other into distinct groups. This separation is a key step in organizing and isolating viruses that are particularly interesting to scientists. Tests using information from known viruses have shown the new software to be very accurate.

The Impact
Research on viruses is an important frontier in environmental science. In fact, viruses that invade bacteria and archaea are most likely critical to all ecosystems. Every environment contains myriad viruses that scientists cannot grow in the laboratory. However, the lack of a framework that can classify large numbers of viruses and includes viruses’ relationship with their hosts holds back progress in this area. This software tool provides a new standard for classifying viruses that scientists have detected in DNA from field and other environmental samples.

Classification of environmental viruses, specifically uncultivated viral genomes (called UVIGS) is a key step to organizing the virosphere and isolating viral groups of potential interest. Single-gene or full-genome phylogenies are commonly used to classify viruses within a known framework of virus classification. However, a high rate of gene exchange in and between bacterial viruses (i.e., phages) makes it difficult to classify highly divergent phages with the limited data available. A team of researchers developed vConTACT 2.0, an open-source, community-available, network-based software application to establish prokaryotic virus taxonomy that scales to thousands of uncultivated virus genomes or fragments, while integrating multiple confidence scores for all taxonomic predictions. Performance tests show the predictions of the new software with currently classified viruses to be very accurate (International Committee on Taxonomy of Viruses: >91% genus-level assignments at 97% accuracy). This approach can also resolve highly recombinogenic taxa through an integrated distance-based hierarchical approach, and remaining discrepancies likely will require changes to current viral taxonomy guides. vConTACT 2.0 also automatically classified 1,364 previously unclassified reference viruses. The software application can be scaled to modern metagenomic datasets with a robust reference network and could potentially uncover thousands more viral sequences. Together, these efforts provide a systematic reference network and a robust, scalable taxonomic analysis tool that is critically needed by the research community.

BER Program Managers
Dawn Adin
U.S. Department of Energy Office of Science, Office of Biological and Environmental Research
Biological Systems Science Division (SC-23.2)
Foundational Genomics Research

Ramana Madupu
U.S. Department of Energy Office of Science, Office of Biological and Environmental Research
Biological Systems Science Division (SC-23.2)
Foundational Genomics Research, Computational Biosciences, and DOE Joint Genome Institute

Principal Investigators
Matthew Sullivan
Ohio State University

Jennifer Pett-Ridge
Lawrence Livermore National Laboratory

Funding was provided in part by the Office of Biological and Environmental Research Genomic Science program’s Soil Microbiome Scientific Focus Area, within the U.S. Department of Energy (DOE) Office of Science, award to Lawrence Livermore National Laboratory; National Science Foundation Biological Oceanography awards; and a Gordon and Betty Moore Foundation Investigator Award to M. B. Sullivan. Funding was provided to J. R. Brister by the Intramural Research Program of the U.S. National Institutes of Health (NIH) National Library of Medicine. The work conducted by the DOE Joint Genome Institute is supported by the DOE Office of Science. This work was also funded in part through Battelle Memorial Institute’s prime contract with the NIH National Institute of Allergy and Infectious Diseases.

Jang, H. B., B. Bolduc, O. Zablocki, J. H. Kuhn, S. Roux, E. M. Adriaenssens, J. R. Brister, A. M. Kropinski, M. Krupovic, R. Lavigne, D. Turner, and M. Sullivan. “Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks.” Nature Biotechnology 37, 632–39 (2019). [DOI:10.1038/s41587-019-0100-8].

Topic Areas:

  • Research Area: Genomic Analysis and Systems Biology
  • Research Area: Microbes and Communities
  • Research Area: Computational Biology, Bioinformatics, Modeling

Division: SC-33.2 Biological Systems Science Division, BER


BER supports basic research and scientific user facilities to advance DOE missions in energy and environment. More about BER

Recent Highlights

Mar 23, 2021
Molecular Connections from Plants to Fungi to Ants
Lipids transfer energy and serve as an inter-kingdom communication tool in leaf-cutter ants&rsqu [more...]

Mar 19, 2021
Microbes Use Ancient Metabolism to Cycle Phosphorus
Microbial cycling of phosphorus through reduction-oxidation reactions is older and more widespre [more...]

Feb 22, 2021
Warming Soil Means Stronger Microbe Networks
Soil warming leads to more complex, larger, and more connected networks of microbes in those soi [more...]

Jan 27, 2021
Labeling the Thale Cress Metabolites
New data pipeline identifies metabolites following heavy isotope labeling.

Analysis [more...]

Aug 31, 2020
Novel Bacterial Clade Reveals Origin of Form I Rubisco

  • All plant biomass is sourced from the carbon-fixing enzyme Rub [more...]

List all highlights (possible long download time)