Pessimism of the Intellect, Optimism of the Will Favorite posts | Manifold podcast | Twitter: @hsu_steve
Thursday, February 08, 2024
Lecture: Fermi Paradox, AI, Simulation Question — Manifold #53
Sunday, October 29, 2023
The Future of Intelligence: An Interview with Steve Hsu (The Latecomer Magazine)
Wednesday, June 28, 2023
Embryo Selection: Healthy Babies vs Bad Arguments
Polygenic screening and its discontents
... But monogenic and chromosomal screening can only address a part of disease risk because most health conditions that afflict people are polygenic, meaning they are not simply caused by one gene or by a chromosomal abnormality. Instead, they are caused by a huge number of small additive effects dispersed throughout the genome. For example, cancer, schizophrenia, and diabetes can be best predicted by models using tens of thousands of genes.
A polygenic risk score (PRS) looks at a person’s DNA to see how many variants they have associated with a particular disease. Like BRCA1, polygenic risk scores are typically not determinative: “Polygenic screening is not a diagnosis: It is a prediction of relative future risk compared to other people.” In other words, someone with BRCA1 has a higher risk than someone without, and someone with a high breast cancer PRS has a higher risk than someone with a lower breast cancer PRS. But in principle, BRCA1 is just one gene out of thousands contributing to a PRS, with each bit contributing a small part of a total risk estimate. ...
... Recently, a group of European scientists argued that polygenic screening should not be available to couples because it will lead to stigmatization, exacerbate inequalities, or lead to confusion by parents about how to weigh up information about risks before they decide which embryo to implant. These are indeed challenges, but they are not unique to embryo selection using polygenic scores, and they are not plausible arguments for restricting the autonomy of parents who wish to screen their embryos for polygenic traits. Furthermore, from an ethical perspective, it is unconscionable to deny polygenic screening to families with a history of any disease whose risk can be reduced by this lifesaving technology.
Many new technologies are initially only available to people with more money, but these first adopters then end up subsidizing research that drives costs down and quality up. Many other medical choices involve complexity or might result in some people being stigmatized, but this is a reason to encourage genetic counseling and to encourage social tolerance. It is not a reason to marginalize, stigmatize, or criminalize IVF mothers and fathers who wish to use the best available science to increase the chances that their children will be healthy and happy.This is a comment on the article:
1) They don't want to admit that some people are better than others, inherently. Boo hoo.
2) You put a scorecard of embryos in front of everyone, and everyone has a pretty good ballpark estimate of which are better and which are worse. Nobody is going to pretend equality is true when they are choosing their kids genes.
3) So bad feels.
4) Must therefore retard all human progress and cause immense suffering because don't want to deal with bad feels.
That's the anti-polygenic argument in a nutshell. I don't expect it to be very effective. At best it will cause it to take a bit longer before poor people have access.
Sunday, November 13, 2022
Smart Leftists vs Dumb Leftists
Smart Leftists vs Dumb Leftists:
— steve hsu (@hsu_steve) November 13, 2022
Who qualifies as former?
How about Soros, the top donor to the Democratic Party?
A few years ago I gave a talk on genomics (including of cognitive ability) to top leadership at Soros Fund Management.
Reaction: overwhelmingy positive 🤔 1/
Smart Leftists know things which they cannot admit in public.
— steve hsu (@hsu_steve) November 13, 2022
Dumb Leftists fight over these things like retards scuffling in the hallway.
2/
See, e.g., https://t.co/iiqc9TM5ue pic.twitter.com/IUIKp5Y60L
If you are not a billionaire, but want to know what I told the Soros people, see the talks I've given at Berkeley Innovative Genomics Institute, DeepMind, OpenAI, Janelia Farms (HHMI), Allen Institute, Cold Spring Harbor, etc.
— steve hsu (@hsu_steve) November 13, 2022
Links below. 3/https://t.co/xrvxOm1Ld4
Note, the forward looking statements I made in these talks - which were considered bold by the academic genomics community at the time - have almost all been proven correct. 4/https://t.co/vVq4m1Hr11https://t.co/bHYr7vfhdAhttps://t.co/ONdOKyDmQc
— steve hsu (@hsu_steve) November 13, 2022
The impact of progess in polygenic prediction is not confined to academic science.
— steve hsu (@hsu_steve) November 13, 2022
It is going to change how humans reproduce, and even the course of human evolution. 5/https://t.co/Z1fr1zMhD3 pic.twitter.com/iJx5yoZRaR
Tuesday, October 25, 2022
American Society of Human Genetics (ASHG) 2022 Posters
New results from Taiwan Precision Medicine Initiative, a mega-biobank (>500k already genotyped).
— steve hsu (@hsu_steve) October 25, 2022
Diabetes predictor trained in TW as powerful as EUR-trained PRS. Breast Cancer PRS validated in TW. More!
Poster for Amer. Soc. Hum. Genetics#ASHG2022 pic.twitter.com/kU7RmHGlVi
Monte Carlo projections for PGS performance in current and future biobanks.
— steve hsu (@hsu_steve) October 25, 2022
Also methods comparison between sparse training algos: L1, PRScs, elastic net, ..
All of Us and TPMI (Taiwan) will close PGS gap for non-Euro ancestry groups!
Amer. Soc. Hum. Genetics#ASHG2022 pic.twitter.com/Pu7UfeaKEN
Thursday, October 06, 2022
Jeffrey Sachs: Lessons from the COVID Commission, Lab Leak Questions, and Nord Stream — Manifold Episode 21
Tuesday, September 20, 2022
Sibling Variation in Phenotype and Genotype: Polygenic Trait Distributions and DNA Recombination Mapping with UK Biobank and IVF Family Data (medRxiv)
Sibling Variation in Phenotype and Genotype: Polygenic Trait Distributions and DNA Recombination Mapping with UK Biobank and IVF Family Data
L. Lello, M. Hsu, E. Widen, and T. Raben
We use UK Biobank and a unique IVF family dataset (including genotyped embryos) to investigate sibling variation in both phenotype and genotype. We compare phenotype (disease status, height, blood biomarkers) and genotype (polygenic scores, polygenic health index) distributions among siblings to those in the general population. As expected, the between-siblings standard deviation in polygenic scores is \sqrt{2} times smaller than in the general population, but variation is still significant. As previously demonstrated, this allows for substantial benefit from polygenic screening in IVF. Differences in sibling genotypes result from distinct recombination patterns in sexual reproduction. We develop a novel sibling-pair method for detection of recombination breaks via statistical discontinuities. The new method is used to construct a dataset of 1.44 million recombination events which may be useful in further study of meiosis.
Here are some figures illustrating the variation of polygenic scores among siblings from the same family.
The excerpt below describes the IVF family highlighted in blue above:
Among the families displayed in these figures, at position number 15 from the left, we encounter an interesting case of sibling polygenic distribution relative to the parents. In the family all siblings have significantly higher Health Index score than the parents. This arises in an interesting manner: the mother is a high-risk outlier for condition X and the father is a high-risk outlier for condition Y. (We do not specify X and Y, out of an abundance of caution for privacy, although the patients have consented that such information could be shared.) Their lower overall Health Index scores result from high risk of conditions X (mother) and Y (father). However, the embryos, each resulting from unique recombination of parental genotypes, are normal risk for both X and Y and each embryo has much higher Health Index score than the parents.This case illustrates well the potential benefits from PGS embryo screening.
The second part of the paper introduces a new technique that directly probes DNA recombination -- the molecular mechanism responsible for sibling genetic differences. See figure above for some results. The new method detects recombination breaks via statistical discontinuities in pairwise comparisons of DNA regions.
...This new sibling-pair method can be applied to large datasets with many thousands of sibling pairs. In this project we created a map of roughly 1.44 million recombination events using UKB genomes. Similar maps can now be created using other biobank data, including in non-European ancestry groups that have not yet received sufficient attention. The landmark deCODE results were obtained under special circumstances: the researchers had access to data resulting from a nationwide project utilizing genealogical records (unusually prevalent in Iceland) and widespread sequencing. Using the sibling-pair method results of comparable accuracy can be obtained from existing datasets around the world -- e.g., national biobanks in countries such as the USA, Estonia, China, Taiwan, Japan, etc.
Monday, September 05, 2022
Lunar Society (Dwarkesh Patel) Interview
Thursday, June 23, 2022
Polygenic Health Index, General Health, and Disease Risk
Polygenic Health Index, General Health, and Disease Risk
We construct a polygenic health index as a weighted sum of polygenic risk scores for 20 major disease conditions, including, e.g., coronary artery disease, type 1 and 2 diabetes, schizophrenia, etc. Individual weights are determined by population-level estimates of impact on life expectancy. We validate this index in odds ratios and selection experiments using unrelated individuals and siblings (pairs and trios) from the UK Biobank. Individuals with higher index scores have decreased disease risk across almost all 20 diseases (no significant risk increases), and longer calculated life expectancy. When estimated Disability Adjusted Life Years (DALYs) are used as the performance metric, the gain from selection among 10 individuals (highest index score vs average) is found to be roughly 4 DALYs. We find no statistical evidence for antagonistic trade-offs in risk reduction across these diseases. Correlations between genetic disease risks are found to be mostly positive and generally mild. These results have important implications for public health and also for fundamental issues such as pleiotropy and genetic architecture of human disease conditions.
https://www.medrxiv.org/content/10.1101/2022.06.15.22276102v1
The days of our years are threescore years and ten; and if by reason of strength they be fourscore years, yet is their strength labour and sorrow; for it is soon cut off, and we fly away
Psalm 90:10
Sunday, May 29, 2022
Genomic Prediction in Bloomberg
Bloomberg: Simone Collins knew she was pregnant the moment she answered the phone. ... Embryo 3, the fertilized egg that Collins and her husband, Malcolm, had picked, could soon be their daughter—a little girl with, according to their tests, an unusually good chance of avoiding heart disease, cancer, diabetes, and schizophrenia.
This isn’t a story about Gattaca-style designer babies. No genes were edited in the creation of Collins’s embryo. The promise, from dozens of fertility clinics around the world, is just that the new DNA tests they’re using can assess, in unprecedented detail, whether one embryo is more likely than the next to develop a range of illnesses long thought to be beyond DNA-based predictions. It’s a new twist on the industry-standard testing known as preimplantation genetic testing, which for decades has checked embryos for rare diseases, such as cystic fibrosis, that are caused by a single gene.
One challenge with leading killers like cancer and heart disease is that they’re usually polygenic: linked to many different genes with complex interactions. Patients such as Collins can now take tests that assess thousands of DNA data points to decode these complexities and compute the disease risks. Genomic Prediction, the five-year-old New Jersey company that handled the tests for her fertility clinic, generates polygenic risk scores, predicting in percentage terms each embryo’s chances of contracting each disease in the panel, plus a composite score for overall health. Parents with multiple embryos can then weigh the scores when deciding which one to implant.
...
This new form of genetic embryo testing appears to move humanity one step closer to control of its evolution. The $14 billion IVF industry brings more than 500,000 babies into the world each year, and with infertility rates rising, the market is expected to more than double this decade. Companies including Genomic Prediction bet many going into that process have seen enough loved ones suffer from a polygenic disease to want risk scoring.
[ Note I think the number of IVF babies born worldwide each year is more like 1 million, but there is some uncertainty in estimates. ]
...
In December, Genomic Prediction doubled its venture funding to about $25 million and says it will use the cash to expand and add to its testing panel. Boston IVF, one of the biggest fertility networks in the US, recently started offering Genomic Prediction’s polygenic testing to its patients, says CEO David Stern. “Like anything else, you have early adopters,” he says. “We have had patients who worked in the biotech field or the Harvard milieu who came in and asked for it.” Stern predicts that, like egg freezing, polygenic embryo testing will grow slowly at first, but steadily, and eventually demand will reflect the powerful appeal of lowering a child’s odds for disease.
...
Believers such as Collins and her husband support government subsidies for fertility and parenthood but aren’t interested in any conversation about slowing down. “This is about the people who care about giving their children every opportunity,” she says. “I do not believe that law or social norms are going to stop parents from giving their kids advantages.”
... improved success rates resulting from higher accuracy in aneuploidy screening of embryos will affect millions of families around the world, and over 60% of all IVF families in the US.
The SNP array platform allows very accurate genotyping of each embryo at ~1 million locations in the genome, and the subsequent bioinformatic analysis produces a much more accurate prediction of chromosomal normality than the older methods.
Millions of embryos are screened each year using PGT-A, about 60% of all IVF embryos in the US.
Klaus Wiemer is the laborator director for Poma Fertility near Seattle. He conducted this study independently, without informing Genomic Prediction.
There are ~3000 embryos in the dataset, all biopsied at Poma and samples allocated to three testing labs A,B,C using the two different methods. The family demographics (e.g., maternal age) were similar in all three groups. Lab B is Genomic Prediction and A,C are two of the largest IVF testing labs in the world, using NGS.
The results imply lower false-positive rates, lower false-negative rates, and higher accuracy overall from our methods. These lead to a significantly higher pregnancy success rate.
The new technology has the potential to help millions of families all over the world.
Thursday, May 05, 2022
Raghuveer Parthasarathy: Four Physical Principles and Biophysics -- Manifold podcast #11
key holez • 2 days ago
It was a fascinating episode, and I immediately went out and ordered the book! One question that came to mind: given how much of the human genome is dedicated to complex regulatory mechanisms and not proteins as such, it seems unintuitive to me that so much of heritability seems to be additive. I would have thought that in a system with lots of complicated,messy on/off switches, small genetic differences would often lead to large phenotype differences -- but if what I've heard about polygenic prediction is right, then, empirically, assuming everything is linear seems to work just fine (outside of rare variants, maybe). Is there a clear explanation for how complex feedback patterns give rise to linearity in the end? Is it just another manifestation of the central limit theorem...?
steve hsu
This is an active area of research. It is somewhat surprising even to me how well linearity / additivity holds in human genetics. Searches for non-linear effects on complex traits have been largely unsuccessful -- i.e., in the sense that most of the variance seems to be controlled by additive effects. By now this has been investigated for large numbers of traits including major diseases, quantitive traits such as blood biomarkers, height, cognitive ability, etc.
One possible explanation is that because humans are so similar to each other, and have passed through tight evolutionary bottlenecks, *individual differences* between humans are mainly due to small additive effects, located both in regulatory and coding regions.
To genetically edit a human into a frog presumably requires many changes in loci with big nonlinear effects. However, it may be the case that almost all such genetic variants are *fixed* in the human population: what makes two individuals different from each other is mainly small additive effects.
Zooming out slightly, the implications for human genetic engineering are very positive. Vast pools of additive variance means that multiplex gene editing will not be impossibly hard...This topic is discussed further in the review article: https://arxiv.org/abs/2101.05870
Friday, March 11, 2022
Genomic Prediction’s Stephen Hsu: Making superhumans will be possible (Sunday Times podcast)
Genomic Prediction’s Stephen Hsu: Making superhumans will be possible
The Sunday Times’ tech correspondent Danny Fortson brings on Stephen Hsu, co-founder of Genomic Prediction, to talk about the plummeting price of genomic sequencing (5:00), predicting height and cancer (9:10), mining biobanks (14:25), scoring embryos (19:00), why investors are staying anonymous (28:00), the need for a society-wide discussion (32:30), when he was accused of being a eugenicist (37:25), how powerful genetic prediction can be (43:15), genetic engineering (49:45), and why Denmark is the future (59:30).
Thursday, February 24, 2022
ManifoldOne Podcast Episode #5: Shai Carmi (Hebrew University): Polygenic risk scores & embryo screening
Thursday, February 03, 2022
ManifoldOne podcast Episode#2: Steve Hsu Q&A
Thursday, January 06, 2022
BOLA2 Copy Number Variation: Phenotype Effects From A Human Accelerated Region
Allen Institute: ... of the known 3,171 human accelerated regions, 99 percent of these human-specific mutations fall into "non-coding" regions of DNA, or regions of DNA that don't contain instructions for making a protein. Many of them are in stretches of our genome known as enhancers, regions which regulate nearby genes, and about half of those are nestled in enhancers that are active in the developing human brain.Our analysis of DNA regions used in predictors for common diseases and complex human traits found that large portions of phenotype variance reside in non-coding regions. This has important consequences for pleiotropy and for our understanding of genetic architecture.
This figure is from the Supplement (p.62) of a recent Nature paper describing a high quality genome sequence obtained from the toe of a female Neanderthal who lived in the Altai mountains in Siberia. Interestingly, copy number variation at 16p11.2 is one of the structural variants identified in a recent deCODE study as related to IQ depression; see earlier post Structural genomic variants (CNVs) affect cognition.
From the Supplement (p.62):
Of particular interest is the modern human-specific duplication on 16p11.2 which encompasses the BOLA2 gene. This locus is the breakpoint of the 16p11.2 micro-deletion, which results in developmental delay, intellectual disability, and autism5,6. We genotyped the BOLA2 gene in 675 diverse human individuals sequenced to low coverage as part of the 1000 Genome Project Phase I7 to assess the population distribution of copy numbers in homo-sapiens (Figure S8.3). While both the Altai Neandertal and Denisova individual exhibit the ancestral diploid copy number as seen in all the non-human great apes, only a single human individual exhibits this diploid copy number state.
Phenome-wide Burden of Copy-Number Variation in the UK Biobank (2019)
16p11.2 C deletion: "We observe significant increases, on the order of one standard deviation, in weight, BMI, hip and waist circumference, reticulocyte count, and Cystatin C measures for these individuals. The larger 593 kb CNV associates with similar measures of body size and fat, as well as hypertension, diabetes/HbA1c, and abdominal hernia. These results are also indicative of effects due to developmental delay; namely, decreased measures of memory, higher Townsend deprivation (an index of material deprivation which considers employment, home/auto ownership, and household overcrowding in a person's neighborhood) ..."
Note the effect sizes, e.g., on Townsend deprivation index, are extremely large, roughly 1 SD. The effect size for Prospective Memory score (related to ability to read, remember, and execute directions) is 2 SD!
Medical consequences of pathogenic CNVs in adults: analysis of the UK Biobank (2019)
Population percentage in parenthesis:
See also:
The Human-Specific BOLA2 Duplication Modifies Iron Homeostasis and Anemia Predisposition in Chromosome 16p11.2 Autism Individuals (2019)
Quantifying the Effects of 16p11.2 Copy Number Variants on Brain Structure: A Multisite Genetic-First Study (2018)
Monday, October 18, 2021
Embryo Screening and Risk Calculus
Carmi et al. find a ~50% risk reduction for schizophrenia from selecting the lowest risk embryo from a set of 5. For a selection among 2 embryos the risk reduction is ~30%. (We obtain a very similar result using empirical data: real adult siblings with known phenotype.)
Visscher et al. find the following results, see Table 1 and Figure 2 in their paper. To their credit they compute results for a range of ancestries (European, E. Asian, African). We have performed similar calculations using siblings but have not yet published the results for all ancestries.
Relative Risk Reduction (RRR):
Hypertension: 9-18% (ranges depend on specific ancestry)
Type 2 Diabetes: 7-16%
Coronary Artery Disease: 8-17%
Absolute Risk Reduction (ARR):
Hypertension: 4-8.5% (ranges depend on specific ancestry)
Type 2 Diabetes: 2.6-5.5%
Coronary Artery Disease: 0.55-1.1%
Saturday, October 09, 2021
Leo Szilard, the Intellectual Bumblebee (lecture by William Lanouette)
Wikipedia: Leo Szilard was a Hungarian-American physicist and inventor. He conceived the nuclear chain reaction in 1933, patented the idea of a nuclear fission reactor in 1934, and in late 1939 wrote the letter for Albert Einstein's signature that resulted in the Manhattan Project that built the atomic bomb.How Alexander Sachs, acting on behalf of Szilard and Einstein, narrowly convinced FDR to initiate the atomic bomb project: Contingency, History, and the Atomic Bomb.
I was just about to lock the door of my hotel room and go to bed when there was a knock on the door and there stood a Russian officer and a young Russian civilian. I had expected something of this sort ever since the President signed the terms of unconditional surrender and the Russians landed a token occupation force in New York. The officer handed me something that looked like a warrant and said that I was under arrest as a war criminal on the basis of my activities during the Second World War in connection with the atomic bomb. There was a car waiting outside and they told me that they were going to take me to the Brookhaven National Laboratory on Long Island. Apparently, they were rounding up all the scientists who had ever worked in the field of atomic energy ...This story was translated into Russian and it had a large impact on Andrei Sakharov, who showed it to his colleague Victor Adamsky:
A number of us discussed it. It was about a war between the USSR and the USA, a very devastating one, which brought victory to the USSR. Szilard and a number of other physicists are put under arrest and then face the court as war criminals for having created weapons of mass destruction. Neither they nor their lawyers could make up a cogent proof of their innocence. We were amazed by this paradox. You can’t get away from the fact that we were developing weapons of mass destruction. We thought it was necessary. Such was our inner conviction. But still the moral aspect of it would not let Andrei Dmitrievich and some of us live in peace.
Ted Puck was a distinguished biologist, originally trained in physics. ‘With the greatest possible reluctance I have come to the conclusion that it is not possible for me personally to work with you scientifically,’ he wrote Szilard. ‘Your mind is so much more powerful than mine that I find it impossible when I am with you to resist the tremendous polarizing forces of your ideas and outlook.’ Puck feared his ‘own flow of ideas would slow up & productivity suffer if we were to become continuously associated working in the same place and the same general kind of field.’ Puck said, ‘There is no living scientist whose intellect I respect more. But your tremendous intellectual force is a strain on a limited person like myself.’Puck was a pioneer in single cell cloning, aided in part by Szilard:
When Szilard saw in 1954 that biologists Philip Marcus and Theodore Puck were having trouble growing individual cells into colonies, he concluded that “since cells grow with high efficiency when they have many neighbors, you should not let a single cell know it’s alone”. This was no flippant excursion into psychobiology. Rather, Szilard’s idea to use a layered feeder dish worked, while the open dish had not (Lanouette, 1992: 396–397).
“I have … recorded” in my Nobel lecture, said Monod, “how it was Szilard who decisively reconciled me with the idea (repulsive to me, until then) that enzyme induction reflected an anti-repressive effect, rather than the reverse, as I tried, unduly, to stick to.”
Thursday, July 22, 2021
Embryo Screening for Polygenic Disease Risk: Recent Advances and Ethical Considerations (Genes 2021 Special Issue)
Embryo Screening for Polygenic Disease Risk: Recent Advances and Ethical Considerations
L. Tellier, J. Eccles, L. Lello, N. Treff, S. Fishel, S. Hsu
Genes 2021, 12(8), 1105
https://doi.org/10.3390/genes12081105
Machine learning methods applied to large genomic datasets (such as those used in GWAS) have led to the creation of polygenic risk scores (PRSs) that can be used identify individuals who are at highly elevated risk for important disease conditions, such as coronary artery disease (CAD), diabetes, hypertension, breast cancer, and many more. PRSs have been validated in large population groups across multiple continents and are under evaluation for widespread clinical use in adult health. It has been shown that PRSs can be used to identify which of two individuals is at a lower disease risk, even when these two individuals are siblings from a shared family environment. The relative risk reduction (RRR) from choosing an embryo with a lower PRS (with respect to one chosen at random) can be quantified by using these sibling results. New technology for precise embryo genotyping allows more sophisticated preimplantation ranking with better results than the current method of selection that is based on morphology. We review the advances described above and discuss related ethical considerations.I excerpt from the paper below.
Over a million babies are born each year via IVF [1,2]. It is not uncommon for IVF parents to have more than one viable embryo from which to choose, as typical IVF cycles can produce four or five. The embryo that is transferred may become their child, while the others might not be used at all. We refer to this selection problem as the “embryo choice problem”. In the past, selections were made based on criteria such as morphology (i.e., rate of development, symmetry, general appearance) and chromosomal normality as determined by aneuploidy testing.
Recently, large datasets of human genomes together with health and disease histories have become available to researchers in computational genomics [3]. Statistical methods from machine learning have allowed researchers to build risk predictors (e.g., for specific disease conditions or related quantitative traits, such as height or longevity) that use the genotype alone as input information. Combined with the precision genotyping of embryos, these advances provide significantly more information that can be used for embryo selection to IVF parents.
In this brief article, we provide an overview of the advances in genotyping and computational genomics that have been applied to embryo selection. We also discuss related ethical issues, although a full discussion of these would require a much longer paper. ...
Ethical considerations:
For further clarification, we explore a specific scenario involving breast cancer. It is well known that monogenic BRCA1 and BRCA2 variants predispose women to breast cancer, but this population is small—perhaps a few per thousand in the general population. The subset of women who do not carry a BRCA1 or BRCA2 risk variant but are at high polygenic risk is about ten times as large as the BRCA1/2 group. Thus, the majority of breast cancer can be traced to polygenic causes in comparison with commonly tested monogenic variants.
For BRCA carrier families, preimplantation screening against BRCA is a standard (and largely uncontroversial) recommendation [39]. The new technologies discussed here allow a similar course of action for the much larger set of families with breast cancer history who are not carriers of BRCA1 or BRCA2. They can screen their embryos in favor of a daughter whose breast cancer PRS is in the normal range, avoiding a potentially much higher absolute risk of the condition.
The main difference between monogenic BRCA screening and the new PRS screening against breast cancer is that the latter technology can help an order of magnitude more families. From an ethical perspective, it would be unconscionable to deny PRS screening to BRCA1/2-negative families with a history of breast cancer. ...
On Simon Fishel's experiences as an IVF pioneer (see here):
Today millions of babies are produced through IVF. In most developed countries roughly 3-5 percent of all births are through IVF, and in Denmark the fraction is about 10 percent! But when the technology was first introduced with the birth of Louise Brown in 1978, the pioneering scientists had to overcome significant resistance. There may be an alternate universe in which IVF was not allowed to develop, and those millions of children were never born.
Wikipedia: ...During these controversial early years of IVF, Fishel and his colleagues received extensive opposition from critics both outside of and within the medical and scientific communities, including a civil writ for murder.[16] Fishel has since stated that "the whole establishment was outraged" by their early work and that people thought that he was "potentially a mad scientist".[17]
I predict that within 5 years the use of polygenic risk scores will become common in some health systems (i.e., for adults) and in IVF. Reasonable people will wonder why the technology was ever controversial at all, just as in the case of IVF.
Figure below from our paper. EHS = Embryo Health Score.
Tuesday, June 29, 2021
Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank (published version)
( biomarkers | SNPs ) and ( disease risk | biomarkers )
Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank
Erik Widen 1,*,Timothy G. Raben 1, Louis Lello 1,2,* and Stephen D. H. Hsu 1,2
1 Department of Physics and Astronomy, Michigan State University, 567 Wilson Rd, East Lansing, MI 48824, USA
2 Genomic Prediction, Inc., 675 US Highway One, North Brunswick, NJ 08902, USA
*Authors to whom correspondence should be addressed.
Academic Editor: Sulev Koks
Genes 2021, 12(7), 991; https://doi.org/10.3390/genes12070991 (registering DOI)
Received: 30 March 2021 / Revised: 22 June 2021 / Accepted: 23 June 2021 / Published: 29 June 2021
(This article belongs to the Special Issue Application of Genomic Technology in Disease Outcome Prediction)
Abstract
We use UK Biobank data to train predictors for 65 blood and urine markers such as HDL, LDL, lipoprotein A, glycated haemoglobin, etc. from SNP genotype. For example, our Polygenic Score (PGS) predictor correlates ∼0.76 with lipoprotein A level, which is highly heritable and an independent risk factor for heart disease. This may be the most accurate genomic prediction of a quantitative trait that has yet been produced (specifically, for European ancestry groups). We also train predictors of common disease risk using blood and urine biomarkers alone (no DNA information); we call these predictors biomarker risk scores, BMRS. Individuals who are at high risk (e.g., odds ratio of >5× population average) can be identified for conditions such as coronary artery disease (AUC∼0.75), diabetes (AUC∼0.95), hypertension, liver and kidney problems, and cancer using biomarkers alone. Our atherosclerotic cardiovascular disease (ASCVD) predictor uses ∼10 biomarkers and performs in UKB evaluation as well as or better than the American College of Cardiology ASCVD Risk Estimator, which uses quite different inputs (age, diagnostic history, BMI, smoking status, statin usage, etc.). We compare polygenic risk scores (risk conditional on genotype: PRS) for common diseases to the risk predictors which result from the concatenation of learned functions BMRS and PGS, i.e., applying the BMRS predictors to the PGS output.
Monday, April 05, 2021
Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank
Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank
Erik Widen, Timothy G. Raben, Louis Lello, Stephen D.H. Hsu
doi: https://doi.org/10.1101/2021.04.01.21254711
We use UK Biobank data to train predictors for 48 blood and urine markers such as HDL, LDL, lipoprotein A, glycated haemoglobin, ... from SNP genotype. For example, our predictor correlates ~ 0.76 with lipoprotein A level, which is highly heritable and an independent risk factor for heart disease. This may be the most accurate genomic prediction of a quantitative trait that has yet been produced (specifically, for European ancestry groups). We also train predictors of common disease risk using blood and urine biomarkers alone (no DNA information). Individuals who are at high risk (e.g., odds ratio of > 5x population average) can be identified for conditions such as coronary artery disease (AUC ~ 0.75), diabetes (AUC ~ 0.95), hypertension, liver and kidney problems, and cancer using biomarkers alone. Our atherosclerotic cardiovascular disease (ASCVD) predictor uses ~10 biomarkers and performs in UKB evaluation as well as or better than the American College of Cardiology ASCVD Risk Estimator, which uses quite different inputs (age, diagnostic history, BMI, smoking status, statin usage, etc.). We compare polygenic risk scores (risk conditional on genotype: (risk score | SNPs)) for common diseases to the risk predictors which result from the concatenation of learned functions (risk score | biomarkers) and (biomarker | SNPs).
Blog Archive
Labels
- physics (420)
- genetics (325)
- globalization (301)
- genomics (295)
- technology (282)
- brainpower (280)
- finance (275)
- american society (261)
- China (249)
- innovation (231)
- ai (206)
- economics (202)
- psychometrics (190)
- science (172)
- psychology (169)
- machine learning (166)
- biology (163)
- photos (162)
- genetic engineering (150)
- universities (150)
- travel (144)
- podcasts (143)
- higher education (141)
- startups (139)
- human capital (127)
- geopolitics (124)
- credit crisis (115)
- political correctness (108)
- iq (107)
- quantum mechanics (107)
- cognitive science (103)
- autobiographical (97)
- politics (93)
- careers (90)
- bounded rationality (88)
- social science (86)
- history of science (85)
- realpolitik (85)
- statistics (83)
- elitism (81)
- talks (80)
- evolution (79)
- credit crunch (78)
- biotech (76)
- genius (76)
- gilded age (73)
- income inequality (73)
- caltech (68)
- books (64)
- academia (62)
- history (61)
- intellectual history (61)
- MSU (60)
- sci fi (60)
- harvard (58)
- silicon valley (58)
- mma (57)
- mathematics (55)
- education (53)
- video (52)
- kids (51)
- bgi (48)
- black holes (48)
- cdo (45)
- derivatives (43)
- neuroscience (43)
- affirmative action (42)
- behavioral economics (42)
- economic history (42)
- literature (42)
- nuclear weapons (42)
- computing (41)
- jiujitsu (41)
- physical training (40)
- film (39)
- many worlds (39)
- quantum field theory (39)
- expert prediction (37)
- ufc (37)
- bjj (36)
- bubbles (36)
- mortgages (36)
- google (35)
- race relations (35)
- hedge funds (34)
- security (34)
- von Neumann (34)
- meritocracy (31)
- feynman (30)
- quants (30)
- taiwan (30)
- efficient markets (29)
- foo camp (29)
- movies (29)
- sports (29)
- music (28)
- singularity (27)
- entrepreneurs (26)
- conferences (25)
- housing (25)
- obama (25)
- subprime (25)
- venture capital (25)
- berkeley (24)
- epidemics (24)
- war (24)
- wall street (23)
- athletics (22)
- russia (22)
- ultimate fighting (22)
- cds (20)
- internet (20)
- new yorker (20)
- blogging (19)
- japan (19)
- scifoo (19)
- christmas (18)
- dna (18)
- gender (18)
- goldman sachs (18)
- university of oregon (18)
- cold war (17)
- cryptography (17)
- freeman dyson (17)
- smpy (17)
- treasury bailout (17)
- algorithms (16)
- autism (16)
- personality (16)
- privacy (16)
- Fermi problems (15)
- cosmology (15)
- happiness (15)
- height (15)
- india (15)
- oppenheimer (15)
- probability (15)
- social networks (15)
- wwii (15)
- fitness (14)
- government (14)
- les grandes ecoles (14)
- neanderthals (14)
- quantum computers (14)
- blade runner (13)
- chess (13)
- hedonic treadmill (13)
- nsa (13)
- philosophy of mind (13)
- research (13)
- aspergers (12)
- climate change (12)
- harvard society of fellows (12)
- malcolm gladwell (12)
- net worth (12)
- nobel prize (12)
- pseudoscience (12)
- Einstein (11)
- art (11)
- democracy (11)
- entropy (11)
- geeks (11)
- string theory (11)
- television (11)
- Go (10)
- ability (10)
- complexity (10)
- dating (10)
- energy (10)
- football (10)
- france (10)
- italy (10)
- mutants (10)
- nerds (10)
- olympics (10)
- pop culture (10)
- crossfit (9)
- encryption (9)
- eugene (9)
- flynn effect (9)
- james salter (9)
- simulation (9)
- tail risk (9)
- turing test (9)
- alan turing (8)
- alpha (8)
- ashkenazim (8)
- data mining (8)
- determinism (8)
- environmentalism (8)
- games (8)
- keynes (8)
- manhattan (8)
- new york times (8)
- pca (8)
- philip k. dick (8)
- qcd (8)
- real estate (8)
- robot genius (8)
- success (8)
- usain bolt (8)
- Iran (7)
- aig (7)
- basketball (7)
- free will (7)
- fx (7)
- game theory (7)
- hugh everett (7)
- inequality (7)
- information theory (7)
- iraq war (7)
- markets (7)
- paris (7)
- patents (7)
- poker (7)
- teaching (7)
- vietnam war (7)
- volatility (7)
- anthropic principle (6)
- bayes (6)
- class (6)
- drones (6)
- econtalk (6)
- empire (6)
- global warming (6)
- godel (6)
- intellectual property (6)
- nassim taleb (6)
- noam chomsky (6)
- prostitution (6)
- rationality (6)
- academia sinica (5)
- bobby fischer (5)
- demographics (5)
- fake alpha (5)
- kasparov (5)
- luck (5)
- nonlinearity (5)
- perimeter institute (5)
- renaissance technologies (5)
- sad but true (5)
- software development (5)
- solar energy (5)
- warren buffet (5)
- 100m (4)
- Poincare (4)
- assortative mating (4)
- bill gates (4)
- borges (4)
- cambridge uk (4)
- censorship (4)
- charles darwin (4)
- computers (4)
- creativity (4)
- hormones (4)
- humor (4)
- judo (4)
- kerviel (4)
- microsoft (4)
- mixed martial arts (4)
- monsters (4)
- moore's law (4)
- soros (4)
- supercomputers (4)
- trento (4)
- 200m (3)
- babies (3)
- brain drain (3)
- charlie munger (3)
- cheng ting hsu (3)
- chet baker (3)
- correlation (3)
- ecosystems (3)
- equity risk premium (3)
- facebook (3)
- fannie (3)
- feminism (3)
- fst (3)
- intellectual ventures (3)
- jim simons (3)
- language (3)
- lee kwan yew (3)
- lewontin fallacy (3)
- lhc (3)
- magic (3)
- michael lewis (3)
- mit (3)
- nathan myhrvold (3)
- neal stephenson (3)
- olympiads (3)
- path integrals (3)
- risk preference (3)
- search (3)
- sec (3)
- sivs (3)
- society generale (3)
- systemic risk (3)
- thailand (3)
- twitter (3)
- alibaba (2)
- bear stearns (2)
- bruce springsteen (2)
- charles babbage (2)
- cloning (2)
- david mamet (2)
- digital books (2)
- donald mackenzie (2)
- drugs (2)
- dune (2)
- exchange rates (2)
- frauds (2)
- freddie (2)
- gaussian copula (2)
- heinlein (2)
- industrial revolution (2)
- james watson (2)
- ltcm (2)
- mating (2)
- mba (2)
- mccain (2)
- monkeys (2)
- national character (2)
- nicholas metropolis (2)
- no holds barred (2)
- offices (2)
- oligarchs (2)
- palin (2)
- population structure (2)
- prisoner's dilemma (2)
- singapore (2)
- skidelsky (2)
- socgen (2)
- sprints (2)
- star wars (2)
- ussr (2)
- variance (2)
- virtual reality (2)
- war nerd (2)
- abx (1)
- anathem (1)
- andrew lo (1)
- antikythera mechanism (1)
- athens (1)
- atlas shrugged (1)
- ayn rand (1)
- bay area (1)
- beats (1)
- book search (1)
- bunnie huang (1)
- car dealers (1)
- carlos slim (1)
- catastrophe bonds (1)
- cdos (1)
- ces 2008 (1)
- chance (1)
- children (1)
- cochran-harpending (1)
- cpi (1)
- david x. li (1)
- dick cavett (1)
- dolomites (1)
- eharmony (1)
- eliot spitzer (1)
- escorts (1)
- faces (1)
- fads (1)
- favorite posts (1)
- fiber optic cable (1)
- francis crick (1)
- gary brecher (1)
- gizmos (1)
- greece (1)
- greenspan (1)
- hypocrisy (1)
- igon value (1)
- iit (1)
- inflation (1)
- information asymmetry (1)
- iphone (1)
- jack kerouac (1)
- jaynes (1)
- jazz (1)
- jfk (1)
- john dolan (1)
- john kerry (1)
- john paulson (1)
- john searle (1)
- john tierney (1)
- jonathan littell (1)
- las vegas (1)
- lawyers (1)
- lehman auction (1)
- les bienveillantes (1)
- lowell wood (1)
- lse (1)
- machine (1)
- mcgeorge bundy (1)
- mexico (1)
- michael jackson (1)
- mickey rourke (1)
- migration (1)
- money:tech (1)
- myron scholes (1)
- netwon institute (1)
- networks (1)
- newton institute (1)
- nfl (1)
- oliver stone (1)
- phil gramm (1)
- philanthropy (1)
- philip greenspun (1)
- portfolio theory (1)
- power laws (1)
- pyschology (1)
- randomness (1)
- recession (1)
- sales (1)
- skype (1)
- standard deviation (1)
- starship troopers (1)
- students today (1)
- teleportation (1)
- tierney lab blog (1)
- tomonaga (1)
- tyler cowen (1)
- venice (1)
- violence (1)
- virtual meetings (1)
- wealth effect (1)


























