International teams assess computational tools to speed up annotations.
The accurate annotation of protein function from genomic sequences is key to understanding biological processes at the molecular level. However, experimental characterization of protein function is challenging and costly and thus cannot keep pace with the amount of sequencing data being produced.
Numerous computational methods to predict protein function have been developed over the past decade to address the growing divide between sequence data and protein functional annotations. Recently, the scientific community came together to provide an unbiased evaluation of these new methods. This effort, named Critical Assessment of Protein Function Annotation (CAFA), consisted of 30 international teams of scientists who evaluated various computational methods on a target set of 866 protein sequences from 11 species, both eukaryotic and prokaryotic.
The organizers gave the research community four months to provide computational predictions of protein function, and then CAFA assessors obtained experimental validations of the targeted protein functions. The results suggest that predicting protein function is difficult because proteins can behave differently depending on environmental factors, such as pH, temperature, or the presence of interacting partners. This was evident across all targets studied, although predictions of molecular function (e.g., protein binding) outperformed predictions of biological processes (e.g., dynamics as a function of temperature). The CAFA community concluded that one way to improve annotation would be to integrate a variety of experimental evidence and data into new computational methods.
School of Informatics and Computing, Indiana University
The CAFA activity and Automated Function Prediction Special Interest Group meeting at the ISMB 2011 conference were supported jointly by the U.S. National Institutes of Health (grant R13 HG006079-01A1) and the Office of Biological and Environmental Research within the U.S. Department of Energy’s Office of Science (grant DE-SC0006807TDD).
Radivojac, P., et al. “A large-scale evaluation of computational protein function prediction,” Nature Methods 10(3), 221-227 (2013). [DOI: 10.1038/nmeth.2340]
SC-23.2 Biological Systems Science Division, BER
BER supports basic research and scientific user facilities to advance DOE missions in energy and environment. More about BER
May 10, 2019
Quantifying Decision Uncertainty in Water Management via a Coupled Agent-Based Model
Considering risk perception can improve the representation of human decision-making processes in age [more...]
May 09, 2019
Projecting Global Urban Area Growth Through 2100 Based on Historical Time Series Data and Future Scenarios
Study provides country-specific urban area growth models and the first dataset on country-level urba [more...]
May 05, 2019
Calibrating Building Energy Demand Models to Refine Long-Term Energy Planning
A new, flexible calibration approach improved model accuracy in capturing year-to-year changes in bu [more...]
May 03, 2019
Calibration and Uncertainty Analysis of Demeter for Better Downscaling of Global Land Use and Land Cover Projections
Researchers improved the Demeter model’s performance by calibrating key parameters and establi [more...]
Apr 22, 2019
Representation of U.S. Warm Temperature Extremes in Global Climate Model Ensembles
Representation of warm temperature events varies considerably among global climate models, which has [more...]
List all highlights (possible long download time)