Would you like a latte of common variants or an expresso shot of rare variants?
A preprint on the association of Alzheimer’s PRS with cognitive functions in the UK biobank has some interesting results. It seems that the effect sizes of the association of Alzheimer’s PRS with certain cognitive functions such as fluid intelligence and matrix completion become larger after adjusting for APOE-e4 allele status (albeit with larger standard errors, I assume). Although these differences were not discussed in the paper, I think this is interesting, and it suggests that the polygenic effect of the Alzheimer’s PRS get attenuated when other strong sources of variations such as APOE-e4 allele exist. I have seen this in my own work. In an analysis in the iPSYCH cohort, I found that the association of educational attainment polygenic score with school performance is weaker in individuals with ADHD and ASD compared to individuals from the general population, and I reasoned that this might be due to that many of the ADHD and ASD individuals included in my analysis might have large effect rare variants (e.g. deletion CNVs or rare protein truncating variants) that might have attenuated the polygenic effect of the common variants. A similar example I came across recently: heritability of cardiomyopathy is higher in individuals without family history compared to individuals with family history, and this might be due to that the family history here serves as a proxy for large effect mutations.
Another way to look at these results is through the lens of the liability threshold model. According to this model, individuals with rare variants carry significantly lower polygenic risk score than individuals without rare variants (as has been shown in many studies, e.g. schizophrenia). If you take the cardiomyopathy example above, it might be that the common variant heritability in family history positive individuals is low because such individuals might carry less number of common risk variants (under the liability threshold model).
Either way, it’s clear that there exists a negative relationship among cases between common and rare variants (yes, I can hear you. It’s not a true negative relationship, it’s just a collider bias). This negative relationship can be leveraged to identify rare variants. For example, you have a cohort of 100,000 individuals who were all genotyped; you’d like to sequence a subset of the cohort, say 20,000 individuals, to identify rare variants for a disease. You could construct polygenic risk score for the disease and pick up the ones who have low polygenic risk score, but still have the disease. Under the liability threshold model, these individuals have the disease despite low polygenic score because they hold other risk factors that increased their liability for the disease, one of which might be rare variants. This has been indeed recently demonstrated in the UK Biobank sample.













