Saturday, April 06, 2013

Genetic prediction: autism

Some time ago I posted on a striking claim of genetic prediction for autism risk that appeared in Nature Molecular Psychiatry:
Predicting the diagnosis of autism spectrum disorder using gene pathway analysis (Nature Molecular Psychiatry)

Abstract
Autism spectrum disorder (ASD) depends on a clinical interview with no biomarkers to aid diagnosis. The current investigation interrogated single-nucleotide polymorphisms (SNPs) of individuals with ASD from the Autism Genetic Resource Exchange (AGRE) database. SNPs were mapped to Kyoto Encyclopedia of Genes and Genomes (KEGG)-derived pathways to identify affected cellular processes and develop a diagnostic test. This test was then applied to two independent samples from the Simons Foundation Autism Research Initiative (SFARI) and Wellcome Trust 1958 normal birth cohort (WTBC) for validation. Using AGRE SNP data from a Central European (CEU) cohort, we created a genetic diagnostic classifier consisting of 237 SNPs in 146 genes that correctly predicted ASD diagnosis in 85.6% of CEU cases. This classifier also predicted 84.3% of cases in an ethnically related Tuscan cohort; however, prediction was less accurate (56.4%) in a genetically dissimilar Han Chinese cohort (HAN). Eight SNPs in three genes (KCNMB4, GNAO1, GRM5) had the largest effect in the classifier with some acting as vulnerability SNPs, whereas others were protective. Prediction accuracy diminished as the number of SNPs analyzed in the model was decreased. Our diagnostic classifier correctly predicted ASD diagnosis with an accuracy of 71.7% in CEU individuals from the SFARI (ASD) and WTBC (controls) validation data sets. In conclusion, we have developed an accurate diagnostic test for a genetically homogeneous group to aid in early detection of ASD. While SNPs differ across ethnic groups, our pathway approach identified cellular processes common to ASD across ethnicities. Our results have wide implications for detection, intervention and prevention of ASD.
From my comments:
The approach taken here first selects 775 SNPs of interest based on pathway information (not considered in standard GWAS) and then only requires 5E-03 significance. A linear predictor is formed from the 237 SNPs that pass this threshold. The ultimate test is, of course, whether the predictor actually works on (independent) validation samples. Once you have a statistically valid predictor, it doesn't matter how you arrived at it.
This recent letter to the editor of Molecular Psychiatry claims that the predictive power came from the fact that the case and control groups had slightly different ancestral origin (via neuroskeptic):
... cases have more diverse ancestral origins within Europe than controls. The putative risk alleles are more common in the Northeastern than in the Northwestern Europe, whereas the putative protective alleles reflect the opposite trend.
But I don't understand how this explains the moderate success of the classifier in the Chinese cohort -- I'll have to look more carefully. Here are the title and abstract of the new paper.
Population structure confounds autism genetic classifier

T G Belgard, I Jankovic, J K Lowe and D H Geschwind

Abstract
A classifier was recently reported to predict with 70% accuracy if an individual has an autism spectrum disorder using 237 single-nucleotide polymorphisms (SNPs).1 Biomarkers, genetic or otherwise, that would facilitate earlier autism spectrum disorder diagnosis are crucial; therefore, these results warrant careful scrutiny. One potential confounder of such genetic studies is bias when cases and controls have different ancestral origins.

3 comments:

  1. http://blogs.discovermagazine.com/neuroskeptic/2013/04/05/genetic-test-for-autism-criticized/#.UWEOHcVXYix

    ReplyDelete
  2. jeffhsu33:32 PM

    Yes, population structure can lead to spurious associations (duh!); however, there are undoubtedly some ancestral origin components that lead to disease susceptibility. Different groups may have different phenotypes and correcting for it (via principle components) may reduce the signal. This is especially true for complex phenotypes with many causative loci. It looks like North Eastern Europeans have higher rates of autism http://sfari.org/news-and-opinion/news/2011/researchers-track-down-autism-rates-across-the-globe. The reason the prediction may work for Chinese as well, is that North Eastern Europeans share significant ancestry vis-a-vis other Europeans. Not saying that the original study is right, but we cannot simply dismiss group differences.

    ReplyDelete
  3. The Chinese comparison had a mere 33 cases & 169 controls (read: huge confidence intervals). Also, the only metric given is accuracy - a classifier could have achieved 84% accuracy in this Chinese comparison just by classifying all of these samples as not having autism. "56.4% accuracy" is uninterpretable without more information.

    ReplyDelete