Showing posts with label INTERPRETING GENOMES. Show all posts
Showing posts with label INTERPRETING GENOMES. Show all posts

Tuesday, July 22, 2014

RESEARCH USING SUPERCOMPUTER THAT COULD LINK GENES TO TRAITS AND DISEASES

FROM:  NATIONAL SCIENCE FOUNDATION 
"Bottom-up" proteomics

NSF-funded supercomputer helps researchers interpret genomes
Tandem protein mass spectrometry is one of the most widely used methods in proteomics, the large-scale study of proteins, particularly their structures and functions.

Researchers in the Marcotte group at the University of Texas at Austin are using the Stampede supercomputer to develop and test computer algorithms that let them more accurately and efficiently interpret proteomics mass spectrometry data.

The researchers are midway through a project that analyzes the largest animal proteomics dataset ever collected (data equivalent to roughly half of all currently existing shotgun proteomics data in the public domain). These samples span protein extracts from a wide variety of tissues and cell types sampled across the animal tree of life.

The analyses consume considerable computing cycles and require the use of Stampede's large memory nodes, but they allow the group to reconstruct the 'wiring diagrams' of cells by learning how all of the proteins encoded by a genome are associated into functional pathways, systems, and networks. Such models let scientists better define the functions of genes, and link genes to traits and diseases.

"Researchers would usually analyze these sorts of datasets one at a time," Edward Marcotte said. "TACC let us scale this to thousands."

Search This Blog

Translate

White House.gov Press Office Feed