May 21, 2019

After GWAS studies, how to narrow the search for genes?

by Nancy Fliesler, Children's Hospital Boston

Genome-wide association studies (GWAS) look at large populations to find genes that contribute to common, multi-gene traits like height or obesity. These comprehensive investigations frequently turn up large numbers of tiny genetic variations that show up more often in people who are tall, obese, etc. But this association doesn't mean the variant actually helps cause the trait; it could just be going along for the ride.

So which genes should scientists prioritize for further investigation? Numerous computational algorithms are available to help distill GWAS results, each using different criteria and assumptions. But it's been hard to know which one to pick.

Most methods used to evaluate such algorithms can bias investigators toward genes that are already well-characterized, steering them away from opportunities to discover something truly new. Other methods require access to independent reference data that aren't always readily available.

"We have different prioritization algorithms, but we don't actually know how to decide which one is best," says Rebecca Fine, a PhD candidate at Harvard Medical School who has been working on this problem. "We didn't want to have to rely on a previous 'gold standard' or bring in anything other than the original GWAS data."

Fine and Joel Hirschhorn, MD, PhD, chief of endocrinology at Boston Children's Hospital, have developed what they believe is an effective, unbiased method called Benchmarker, described in the American Journal of Human Genetics earlier this month.

Borrowing from machine learning

Borrowing the machine-learning concept of "cross-validation," Benchmarker enables investigators to use the GWAS data itself as its own control. The idea is to take the GWAS dataset and single out one chromosome. The algorithm being benchmarked then uses the data from the remaining 21 chromosomes (all but X and Y) to make predictions about what genes on the single chromosome are most likely to contribute to the trait being investigated. As this process is repeated for each chromosome in turn, the genes that the algorithm has flagged are pooled. The algorithm is then validated by comparing this group of prioritized genes with the original GWAS results.

"You train the algorithm on the GWAS with one chromosome withheld, then go back to that chromosome and ask whether those genes were actually associated with a strong p-value in the original GWAS results," explains Fine. "While these p-values don't represent the exact 'right answers,' they do tell you roughly where some true genetic associations are. The end product is an evaluation of how each algorithm performed."

Benchmarking Benchmarker

Putting this approach through its paces for 20 separate traits, Fine, Hirschhorn and colleagues conclude that combining multiple strategies often gives the best results. They also found evidence that certain algorithms perform best when looking for genes for certain traits.

"We expect that many more algorithms will be developed to answer the key next question after GWAS: which genes and variants are causally related to human traits and diseases," says Hirschhorn. "The Benchmarker approach can be a great help as an unbiased way to figure out which algorithms to use to answer this question."

More information: Rebecca S. Fine et al, Benchmarker: An Unbiased, Association-Data-Driven Strategy to Evaluate Gene Prioritization Algorithms, The American Journal of Human Genetics (2019). DOI: 10.1016/j.ajhg.2019.03.027

Journal information: American Journal of Human Genetics

Provided by Children's Hospital Boston

Citation: After GWAS studies, how to narrow the search for genes? (2019, May 21) retrieved 18 April 2024 from https://medicalxpress.com/news/2019-05-gwas-narrow-genes.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New computational method predicts genes likely to be causal in disease

5 shares

Feedback to editors

Researchers discover new therapeutic target for non-small cell lung cancer

4 hours ago

Immune cells carry a long-lasting 'memory' of early-life pain

4 hours ago

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

4 hours ago

No negative impact from prolonged eye patching on child's development or family stress levels

4 hours ago

COVID-19 booster immunity lasts much longer than primary series alone, study shows

5 hours ago

Study finds that human neuron signals flow in one direction

6 hours ago

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

6 hours ago

Scientists identify airway cells that sense aspirated water and acid reflux

6 hours ago

Environment may influence metacognitive abilities more than genetics

7 hours ago

Contracting RSV before age two can cause long-term lung changes and impairment

7 hours ago

Load comments (0)

After GWAS studies, how to narrow the search for genes?

Borrowing from machine learning

Benchmarking Benchmarker

Researchers discover new therapeutic target for non-small cell lung cancer

Immune cells carry a long-lasting 'memory' of early-life pain

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

No negative impact from prolonged eye patching on child's development or family stress levels

COVID-19 booster immunity lasts much longer than primary series alone, study shows

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Scientists identify airway cells that sense aspirated water and acid reflux

Environment may influence metacognitive abilities more than genetics

Contracting RSV before age two can cause long-term lung changes and impairment

New computational method predicts genes likely to be causal in disease

New approach will help geneticists identify genes responsible for complex traits

Study reveals genetic basis of quantitative traits and diseases in Japanese population

Researchers identify causal variants in blood cells and tie them with genetic mechanisms

GIANT study reveals giant number of genes linked to height

Study identifies novel genetic factors for colorectal cancer risk

Environment may influence metacognitive abilities more than genetics

Mutations in noncoding DNA become functional in some cancer-driving genes

Large genomic study finds tri-ancestral origins for Japanese population

Siblings with unique genetic mutation help scientists progress drug search for type 1 diabetes

Scientists uncover 95 regions of the genome linked to PTSD

Shape-shifting cancer cell discovery reveals potential skin cancer drug targets

Phys.org

Tech Xplore

Science X

After GWAS studies, how to narrow the search for genes?

Borrowing from machine learning

Benchmarking Benchmarker

Researchers discover new therapeutic target for non-small cell lung cancer

Immune cells carry a long-lasting 'memory' of early-life pain

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

No negative impact from prolonged eye patching on child's development or family stress levels

COVID-19 booster immunity lasts much longer than primary series alone, study shows

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Scientists identify airway cells that sense aspirated water and acid reflux

Environment may influence metacognitive abilities more than genetics

Contracting RSV before age two can cause long-term lung changes and impairment

Related Stories

New computational method predicts genes likely to be causal in disease

New approach will help geneticists identify genes responsible for complex traits

Study reveals genetic basis of quantitative traits and diseases in Japanese population

Researchers identify causal variants in blood cells and tie them with genetic mechanisms

GIANT study reveals giant number of genes linked to height

Study identifies novel genetic factors for colorectal cancer risk

Recommended for you

Environment may influence metacognitive abilities more than genetics

Mutations in noncoding DNA become functional in some cancer-driving genes

Large genomic study finds tri-ancestral origins for Japanese population

Siblings with unique genetic mutation help scientists progress drug search for type 1 diabetes

Scientists uncover 95 regions of the genome linked to PTSD

Shape-shifting cancer cell discovery reveals potential skin cancer drug targets

Newsletter sign up

Donate and enjoy an ad-free experience