Skip to main content

The physical and psychiatric health conditions related to autism genetic scores, across genetic ancestries, sexes and age-groups in electronic health records

Abstract

Background

Although polygenic scores (PGS) for autism have been related to various psychiatric and medical conditions, most studies to date have been conducted in research ascertained populations. We aimed to identify the psychiatric and physical conditions associated with autism PGS in a health care setting.

Methods

We computed PGS for 12,383 unrelated participants of African genetic ancestry (AF) and 65,363 unrelated participants of European genetic ancestry (EU) from Vanderbilt’s de-identified biobank. Next, we performed phenome wide association studies of the autism PGS within these two genetic ancestries.

Results

Seven associations surpassed the Bonferroni adjusted threshold for statistical significance (p = 0.05/1374 = 3.6 × 10−5) in the EU participants, including mood disorders (OR (95%CI) = 1.08(1.05 to 1.10), p = 1.0 × 10−10), autism (OR (95%CI) = 1.34(1.24 to 1.43), p = 1.2 × 10–9), and breast cancer (OR (95%CI) = 1.09(1.05 to 1.14), 2.6 × 10−5). There was no statistical evidence for PGS-phenotype associations in the AF participants. Conditioning on the diagnosis of autism or on median body mass index (BMI) did not impact the strength of the reported associations. Although we observed some sex differences in the pattern of associations, there was no significant interaction between sex and autism PGS. Finally, the associations between autism PGS and autism diagnosis were stronger in childhood and adolescence, while the associations with mood disorders and breast cancer were stronger in adulthood.

Discussion

Our findings indicate that autism PGS is not only related to autism diagnosis but may also be related to adult-onset conditions, including mood disorders and some cancers.

Conclusions

Our study raises the hypothesis that genes associated with autism may also increase the risk for cancers later in life. Future studies are necessary to replicate and extend our findings.

Background

Autism is a heterogeneous and complex neurodevelopmental diagnosis that is characterized by difficulty with certain social interactions, restricted interests and/or repetitive behaviors, as well as features that affect school and work performance and may impact other areas of life. While language delay or difficulty is often a feature of autism, there is marked variation, in language and cognitive development, as well as variation in co-occurring psychiatric and medical conditions [26].

Co-occurring conditions are estimated to affect at least 70% of people with autism [20]. A recent meta-analysis provided pooled prevalence estimates of 28% for attention deficit hyperactivity disorder, 20% for anxiety disorders, 13% for sleep related disorders, 11% for depressive disorders, 9% for obsessive–compulsive disorder and 5% for bipolar disorders and 4% for schizophrenia spectrum disorders [20]. Other commonly reported medical conditions include infections, obesity, neurologic, and immunologic conditions [8, 10]. Recent reports suggest that long diagnostic delays may be accompanied by “masking” or “camouflaging”, which can have detrimental effects on lifetime mental health [1, 2]. Importantly, genetic signatures of autism can be leveraged to disentangle the relationship between biologically mediated co-occurring conditions and those that co-occur as a consequence of the common experiences of autistic people.

Genome-Wide Association studies (GWAS) demonstrate that autism is polygenic, i.e., it is associated with thousands of genetic variants of small effect [13]. The effect of these polymorphisms can be summed to result in a cumulative polygenic score for autism. Polygenic scores do not map perfectly to autism diagnoses. Autism as a clinical condition results from a complex interplay of genetic and non-genetic factors and receiving a diagnosis of autism also requires access to appropriate expertise. PGS for autism capture only part of the common variant genetic contribution to autism [15]. Some studies have shown that autism is not only phenotypically, but also genetically correlated with other neurodevelopmental conditions, including ADHD, major depression, and positively correlated with intelligence and educational attainment [3, 5, 13]. Thus, PGS are a useful tool to investigate whether and how genes associated with autism are also related to other psychiatric and medical conditions.

The majority of studies using PGS, however, have only tested the associations between autism PGS and co-occurring conditions in volunteer or research ascertained settings, rather than in health care settings. For example, a study in 334,976 volunteer participants from the UK biobank tested the associations between autism PGS and a number of general health and mental health characteristics and found associations with eight general health related outcomes and one mental health related outcome [22]. However, this study did not examine a wide variety of clinical diagnoses, nor did it examine how associations may change in different age groups, or across genetic ancestries.

The wealth of clinical information in Electronic Health Records (EHR) linked with genetic data provides the ability to conduct phenome-wide association scans (pheWAS) across a wide range of psychiatric and medical conditions. Using Vanderbilt’s biobank (BioVU) that is linked to de-identified medical records, we conducted a pheWAS to identify the physical and mental health conditions that are associated with autism PGS across genetic ancestries, sexes, and age-groups.

Methods

Vanderbilt University Medical Center EHR

The project was approved by Vanderbilt’s Institutional Review Board (IRB) (#190418). The phenotypic data originated from the synthetic derivative (SD), a de-identified clinical data repository of the EHR at VUMC, that includes records of over 3.2 million patients, and is updated regularly. The data includes billing codes from the International Classification of Diseases, 9th and 10th editions (ICD-9 and ICD-10), laboratory values, medication history, Current Procedural Terminology (CPT) codes, and clinical reports. BioVU is the associated biobank that accrues DNA samples from more than 280,000 of these patients [28].

Genotyping data

The genotype data for a total of 94,474 samples were imputed using the Michigan Imputation server [9] and the Haplotype Reference Consortium (HRC) panel. The data were filtered to biallelic Single Nucleotide Polymorphisms (SNPs) only, and filtered to include SNPs with an R2 > = 0.3, and minor allele frequency > = 0.005. SNPs with a Hardy–Weinberg Equilibrium P value < 10e−10 were removed. To assign primarily European genetic ancestries as well as African ancestries, principal components analysis (PCA) was performed including all samples and the 1000 genomes reference populations. Details on ancestry assignment, imputation, and further quality control procedures are described elsewhere [11]. Cryptic relatedness was addressed by removing one individual of each pair for which pihat > 0.2. The final samples included 12,383 unrelated participants (median age of record (mean(SD) = 39.2(21.2), 61%females) of African (AF) ancestries and 65,363 unrelated participants of European (EU) ancestries (median age of record (mean(SD) = 48.4(22.2), 55.6% females) (also see Table 1).

Table 1 Basic demographics of the samples

Autism polygenic score

We constructed PGS based on the latest publicly available autism GWAS summary statistics [13]. The autism GWAS (training set) included 18,381 cases and 27,929 controls. The PGS per BioVU participant was computed using a continuous shrinkage prior (CS) to SNP effect sizes using the PRS_CS software [12] for participants of EU ancestries and the PRS_CSx [29] for participants of AF ancestries.

Statistical analyses

The PGS were Z-score standardized with a mean of 0 and a standard deviation of 1. We ran two ancestry stratified PheWAS, one in the participants of EU genetic ancestries and and one in the participants of AF genetic ancestries. Outcomes in the PheWAS are based on phecodes that are comprised of ICD-codes grouped based on similarity [6]. The PheWAS analyses included 648 phecodes for the AF ancestry participants and 1374 phecodes for the EU ancestry participants with a minimum of 100 case patients. Individuals are assigned as a “case” for each phecode outcome if there are at least two instances of a corresponding ICD code map v1.2 [30] within their health record. Controls are assigned by the absence of the case defining ICD codes. Fewer phecodes met the minimum case number for inclusion in the AF ancestry population due to the smaller total number of individuals. PheWAS covariates included sex (defined as sex reported in the EHR), median age for an individual’s the medical record, current age (to control for cohort effects), and top ten principal components generated from genotype data (to control for population stratification). The p value threshold for statistical significance was Bonferroni corrected for multiple testing based on the number of tests in each PheWAS (i.e., 0.05/648 tests in the AF genetic ancestry = 7.7 × 10−5, and 0.05/1374 tests in the EU ancestry = 3.6 × 10−5). There were no statistically significant associations in the AF genetic ancestry, and therefore, no further analyses were conducted. Two conditional analyses were performed in which we further co-varied for (1) median BMI and (2) autism diagnosis in the EU ancestry.

We also conducted a sex stratified PheWAS in the EU ancestry sample. We then tested for significant differences in effect estimates between males and females for the phenotypes with evidence of a main effect in either sex, using an interaction term applied to the PheWAS model (Sex * PGS).

As an exploratory analysis, we tested associations of the autism PGS across different age groups. The age bin selection was based approximate life stages described in a previous study [25]. Specifically, the selected age (in years) bins were: 0 to 11 (children), 12 to 18 (teens), 19 to 25 (young adults, and college-age students), 26 to 40 (adults), 41 to 60 (middle aged adults), and 61 to 100 (older adults). Each PheWAS was adjusted for sex, median EHR age, current age and 10 PCs.

Results

African genetic ancestry population

PheWAS

No associations passed the Bonferroni significance threshold (p = 0.05/648 tests = 7.7 × 10−5). Due to the lack of significant associations, no further analyses were performed (Table 2, Fig. 1, Supplementary Table S1).

Table 2 Summary of PheWAS results
Fig. 1
figure 1

Manhattan plot of pheWAS results for the autism PGS in individuals of African genetic ancestries. The y-axis shows the -log10 transformed p values multiplied by the direction of effect. The dots represent the phecodes which are grouped along the X axis by phecode category. Size of the dot corresponds to the effect size. The colors of the dots indicate the phecode categories, which are explained in the figure. The dotted red line signifies the Bonferroni-corrected threshold for statistical significance

European genetic ancestry population

PheWAS

Seven associations surpassed the Bonferroni significance threshold (p = 0.05/1374 = 3.6 × 10−5), including mood disorders (OR (95%CI) = 1.08(1.05 to 1.10), p = 1.0 × 10−10), autism (OR (95%CI) = 1.34(1.24 to 1.43), p = 1.2 × 10−9), breast cancer (OR (95%CI) = 1.09(1.05 to 1.14), 2.6 × 10−5), depression (OR (95%CI) = 1.07(1.04 to 1.09), p = 3.1 × 10−8, pervasive developmental disorders (OR (95%CI) = 1.15(1.10 to 1.20), p = 2.7 × 10−7), malignant neoplasm of female breast (OR (95%CI) = 1.10(1.05 to 1.14), p = 2.1 × 10−5), and breast cancer [female] (OR (95%CI) = 1.09(1.05 to 1.14), p = 2.6 × 10−5) (Table 2, Fig. 2, Supplementary Table S2). The association with autism yielded the largest effect size, followed by pervasive developmental disorders (OR (95%CI) = 1.15 (1.10 to 1.20), 2.7 × 10−7).

Fig. 2
figure 2

Manhattan plot of autism PGS in individuals of European genetic ancestries. The y-axis shows the -log10 transformed p values multiplied by the direction of effect. The dots represent the phecodes which are grouped along the X axis by phecode category. Size of the dot corresponds to the effect size. The colors of the dots indicate the phecode categories, which are explained in the figure. The dotted red line signifies the Bonferroni-corrected threshold for statistical significance

Conditioning on BMI

When BMI was added as a covariate in the PheWAS, all previously significant associations remained largely unchanged (Supplementary Table S3).

Conditioning on autism diagnosis

When autism was added as a covariate in the PheWAS, all previously significant associations remained largely unchanged, apart from the associations with autism and pervasive developmental disorders which, as expected, were no longer significant (Supplementary Table 4).

Sex-stratified PheWAS

Autism PGS was associated with autism in males (OR (95%CI) = 1.28(1.17 to 1.38), p = 6.9 × 10−6) (Table 3, Supplementary Table 5, Fig. 3) and females (OR (95%CI) = 1.56(1.36 to 1.75), p = 9,4 × 10−6) (Table 3, Supplementary Table 6, Fig. 4). In females, autism PGS was also associated with mood disorders (OR (95%CI) = 1.09 (1.06 to 1.12), p = 9.3 × 10−10), depression (OR (95%CI) = 1.09 (1.06 to 1.12), p = 3.8 × 10−8), malignant neoplasm of female breast (OR (95%CI) = 1.10 (1.05 to 1.14), p = 2.23 × 10−5), and breast cancer (OR (95%CI) = 1.09 (1.05 to 1.14), p = 2.7 × 10−5). The sex-interaction test indicated that there was no statistical evidence of differences between the sexes once the baseline prevalence of the condition in the sample was taken into account (Table 3, Supplementary Table S7). However, we do observe that the ORs for autism PGS are notably higher in females with an autism diagnosis (OR = 1.56) compared to males with an autism diagnosis (OR = 1.28) suggesting that autistic females may have to reach a higher threshold of genetic likelihood before they come to clinical attention.

Table 3 Sex-stratified results in the European genetic ancestry population
Fig. 3
figure 3

Manhattan plot of autism PGS in males of European genetic ancestries. The y-axis shows the -log10 transformed p-values multiplied by the direction of effect. The dots represent the phecodes which are grouped along the X axis by phecode category. Size of the dot corresponds to the effect size. The colors of the dots indicate the phecode categories, which are explained in the figure. The dotted red line signifies the Bonferroni-corrected threshold for statistical significance

Fig. 4
figure 4

Manhattan plot of autism PGS in females of European genetic ancestries. The y-axis shows the -log10 transformed p-values multiplied by the direction of effect. The dots represent the phecodes which are grouped along the X-axis by phecode category. Size of the dot corresponds to the effect size. The colors of the dots indicate the phecode categories, which are explained in the figure. The dotted red line signifies the Bonferroni-corrected threshold for statistical significance

Age-stratified PheWAS

Birth–11-year age group

Autism PGS was associated with autism diagnosis in this age group (OR (95%CI) = 1.28 (1.16 to 1.40), p = 3.0 × 10−5) (Table 4, Supplementary Table 8). No other associations surpassed the Bonferroni significance threshold.

Table 4 PheWAS results by age-group (European genetic ancestry population)

12–18-year age group

Similar to the 0 to 11 age group, autism PGS was associated with autism (OR (95%CI) = 1.36 (1.22 to 1.50), p = 1.9 × 10−5) (Table 4, Supplementary Table S9). No other associations surpassed the Bonferroni significance threshold.

19 to 25-year age group

There was no evidence for associations in this age group (Table 4, Supplementary Table S10).

26 to 40-year age group

Autism PGS was associated with mood disorders (OR (95%CI) = 1.22 (1.07 to 1.17, p = 1.1 × 10−5), and depression (OR (95%CI) = 1.13 (1.07 to 1.19), p = 3.4 × 10−5) in this age group (Table 4, Supplementary Table S11).

41 to 60-year age group

Autism PGS was associated with mood disorders (OR (95%CI) = 1.09 (1.06 to 1.13, p = 7.7 × 10−8) and depression (OR (95%CI) = 1.09 (1.05 to 1.12), p = 2.8 × 10−6), while there was a negative association with electrolyte imbalance (OR (95%CI) = 0.93, 0.89 to 0.96), p = 2.8 × 10−5) (Table 4, Supplementary Table S12).

61 to 100-year age group

There was no evidence for associations in this age group (Table 4, Supplementary Table S13).

Discussion

Our study is the first to test the associations of autism PGS with psychiatric and medical conditions in an EHR context in a tertiary health care setting. There were seven medical diagnoses associated with autism PGS in the EU participants, including autism. These findings suggest that autism, as defined by ICD codes in the EHR, does reflect the genetic architecture of autism as defined in other independent samples. We did not estimate this association in the AF participants, as there were only 59 participants with an ICD diagnosis of autism in this sample (compared to 476 in the EU sample), and the minimum number we set for including case patients in the PheWAS is 100.

Autism PGS was also associated with mood disorders in the EHR, an association that is not surprising given that autism is both phenotypically and genetically correlated with depressive symptoms and major depression [13, 21]. Our study is the first to show that this association is not as strong in childhood and adolescence as it is in adulthood. This could be explained by the fact that the age of onset of mood disorders ranges from mid to late adolescence to the early 40 s, with the median being around the early to mid-20 s [19]. As a result of this, the sample size of mood disorders differs across the different age groups. For instance, the number of cases of mood disorders in adulthood are twice the number of cases in adolescence. Although women are twice as likely to be diagnosed with depression [4], most but not all the genetic background related to depression is shared across sexes [18]. Our results are consistent with this pattern (i.e., the association between the autism PGS and mood disorders is not different across sexes). Our results contrast with a previous autism PGS–PheWAS study in 334,976 participants from the UK biobank that did not find evidence for associations between autism PGS and depression. One limitation that the authors of that study noted is that the UK Biobank participants are less likely to be diagnosed with psychiatric disorders compared to the general UK population, and this may be a potential explanation the authors did not find associations between the autism PGS and any psychiatric-related outcome [22]. Importantly, the links with mood disorders remained significant even after adjusting for an autism diagnosis. While this observation is consistent with pleiotropy it is important to note that the adult diagnostic rate was low (0.7%) and undiagnosed autism may be a confounding factor. Indeed, prior studies find that adults with undiagnosed autism, and thus less access to accommodations and resources, are also more likely to engage in camouflaging [24] and that camouflaging increases risk for mood disorders and anxiety [17].

Autism PGS was associated with a slight increase in breast cancer risk in our study. While epidemiologically there is limited evidence of an association between autism and cancer [8], there is known genetic overlap between these two conditions [7]. For example, the tumor suppressor gene, the Phosphatase and Tensin Homolog (PTEN), has been implicated in both autism [14, 23] and a range of cancers, including breast cancer [31]. Despite the known links between BMI and autism [16] and BMI and breast cancer [27], the associations between autism PGS and breast cancer remained virtually unchanged after adjusting for BMI. Our findings indicate that genes involved in autism may also increase risk for breast cancer later in life. Few studies have focused on the medical conditions of adults with developmental disabilities including autism. Thus, additional research is needed to replicate and extend our findings.

One limitation of our study is the low sample size of the AF participants that likely affected our power. Similarly, sample size difference across age groups may also have reduced power and increased the type II error rate. Another limitation of our study is that our data comes from a single medical center, Vanderbilt University Medical Center, located in Nashville, Tennessee. Thus, the data included in our study is unlikely to be representative of the entire population. However, it should be noted that our study included a large and diverse sample of patients due to the broad catchment of the Vanderbilt Affiliated Health Network (VHAN) which includes outpatient clinics in Tennessee and eight bordering states. As with all research collections, there is a sampling bias. In this case, the sampling bias includes individuals who seek medical care at one or more Vanderbilt affiliated clinics. For example, individuals with reliable transportation, close proximity to VHAN clinics, and resources to cover the cost of medical care are more likely to be represented in the data. Importantly, the VHAN child and adolescent clinics accept both public and private health insurance which reduces the impact of insurance status on entry into the medical system for children. Nevertheless, there are limits to the generalizability of our findings because of the sampling strategy. Collider bias could result from selection of people into a health care setting. However, this is only likely to have a significant impact on the results shown here if autism polygenic scores are also correlated with the sample selection. To the best of our knowledge, our biobank is not enriched for individuals with a family history of autism, mood disorders and/or breast cancer, nor are individuals with these conditions more likely to participate in the biobank. Additionally, autism polygenic scores are not clinically assessed, which reduces the potential for PGS to influence health care, thus reducing the potential for collider bias. Lastly, with regards to ascertainment bias, it is possible that individuals with autism are more likely to be given certain additional diagnoses. For instance, individuals with autism are more likely to be coded with developmental delays and may be more likely to come to clinical attention for other behavioral or psychiatric conditions that are often diagnosed and documented in the EHR. However, it should be noted that we did not specifically restrict our sample to individuals with autism. Instead, we examined the autism polygenic score across the entire biobank, which again reduces the potential for ascertainment bias as the only ascertained variables included entry into the EHR and the biobank. Lastly, we also adjusted for potential confounding factors such as age, sex, and the first 10 principal components of genetic ancestry. Finally, future studies in other cohorts/datasets are needed to replicate our findings.

Conclusions

Our study is the first to examine the associations between autism PGS and psychiatric and medical conditions in the EHR of a tertiary care center. We replicated the association between autism PGS and autism, and also identified associations with mood disorders and breast cancer. Our findings indicate that genes involved in autism may also increase risk for breast cancer later in life. Future studies are essential to replicate our findings.

Availability of data and materials

Due to data sharing restrictions related to privacy concerns in the EHR, the datasets generated from our hospital population will not be publicly available; however, all scripts used in the study are available upon request.

References

  1. Bargiela S, Steward R, Mandy W. The experiences of late-diagnosed women with autism spectrum conditions: an investigation of the female autism phenotype. J Autism Dev Disord. 2016;46(10):3281–94. https://doi.org/10.1007/s10803-016-2872-8.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Belcher HL, Morein-Zamir S, Stagg SD, Ford RM. Shining a light on a hidden population: social functioning and mental health in women reporting autistic traits but lacking diagnosis. J Autism Dev Disord. 2022. https://doi.org/10.1007/s10803-022-05583-2.

    Article  PubMed  Google Scholar 

  3. Brainstorm C, Anttila V, Bulik-Sullivan B, Finucane HK, Walters RK, Bras J, et al. Analysis of shared heritability in common disorders of the brain. Science. 2018;360(6395). https://doi.org/10.1126/science.aap8757

  4. Bromet E, Andrade LH, Hwang I, Sampson NA, Alonso J, de Girolamo G, et al. Cross-national epidemiology of DSM-IV major depressive episode. BMC Med. 2011;9:90. https://doi.org/10.1186/1741-7015-9-90.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh PR, et al. An atlas of genetic correlations across human diseases and traits. Nat Genet. 2015;47(11):1236–41. https://doi.org/10.1038/ng.3406.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Carroll RJ, Bastarache L, Denny JC. R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment. Bioinformatics. 2014;30(16):2375–6. https://doi.org/10.1093/bioinformatics/btu197.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Crawley JN, Heyer WD, LaSalle JM. Autism and cancer share risk genes, pathways, and drug targets. Trends Genet. 2016;32(3):139–46. https://doi.org/10.1016/j.tig.2016.01.001.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Croen LA, Zerbo O, Qian Y, Massolo ML, Rich S, Sidney S, Kripke C. The health status of adults on the autism spectrum. Autism. 2015;19(7):814–23. https://doi.org/10.1177/1362361315577517.

    Article  PubMed  Google Scholar 

  9. Das S, Forer L, Schönherr S, Sidore C, Locke AE, Kwong A, et al. Next-generation genotype imputation service and methods. Nat Genet. 2016;48(10):1284–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Davignon MN, Qian Y, Massolo M, Croen LA. Psychiatric and medical conditions in transition-aged individuals with ASD. Pediatrics. 2018;141(Suppl 4):S335–45. https://doi.org/10.1542/peds.2016-4300K.

    Article  PubMed  Google Scholar 

  11. Dennis JK, Sealock JM, Straub P, Lee YH, Hucks D, Actkins KE, et al. Clinical laboratory test-wide association scan of polygenic scores identifies biomarkers of complex disease. Genome Medicine. 2021;13(1):6. https://doi.org/10.1186/s13073-020-00820-8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Ge T, Chen C-Y, Ni Y, Feng Y-CA, Smoller JW. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat Commun. 2019;10(1):1776. https://doi.org/10.1038/s41467-019-09718-5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019. https://doi.org/10.1038/s41588-019-0344-8.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Hansen-Kiss E, Beinkampen S, Adler B, Frazier T, Prior T, Erdman S, et al. A retrospective chart review of the features of PTEN hamartoma tumour syndrome in children. J Med Genet. 2017;54(7):471–8. https://doi.org/10.1136/jmedgenet-2016-104484.

    Article  PubMed  Google Scholar 

  15. Havdahl A, Niarchou M, Starnawska A, Uddin M, van der Merwe C, & Warrier V. Genetic contributions to autism spectrum disorder. Psychol Med. 2021:1–14. https://doi.org/10.1017/S0033291721000192

  16. Hill AP, Zuckerman KE, Fombonne E. Obesity and autism. Pediatrics. 2015;136(6):1051–61. https://doi.org/10.1542/peds.2015-1437.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Hull L, Levy L, Lai MC, Petrides KV, Baron-Cohen S, Allison C, et al. Is social camouflaging associated with anxiety and depression in autistic adults? Mol Autism. 2021;12(1):13. https://doi.org/10.1186/s13229-021-00421-1.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Kendler KS, Ohlsson H, Lichtenstein P, Sundquist J, Sundquist K. The genetic epidemiology of treated major depression in Sweden. Am J Psychiatry. 2018;175(11):1137–44. https://doi.org/10.1176/appi.ajp.2018.17111251.

    Article  PubMed  Google Scholar 

  19. Kessler RC, Bromet EJ. The epidemiology of depression across cultures. Annu Rev Public Health. 2013;34:119–38. https://doi.org/10.1146/annurev-publhealth-031912-114409.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Lai MC, Kassee C, Besney R, Bonato S, Hull L, Mandy W, et al. Prevalence of co-occurring mental health diagnoses in the autism population: a systematic review and meta-analysis. Lancet Psychiatry. 2019;6(10):819–29. https://doi.org/10.1016/S2215-0366(19)30289-5.

    Article  PubMed  Google Scholar 

  21. Lee PH, Anttila V, Won H, Feng Y-CA, Rosenthal J, Zhu Z, et al. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell. 2019;179(7):1469-1482. e1411.

    Article  Google Scholar 

  22. Leppert B, Millard LAC, Riglin L, Davey Smith G, Thapar A, Tilling K, et al. A cross-disorder PRS-pheWAS of 5 major psychiatric disorders in UK Biobank. PLoS Genet. 2020;16(5):1008185. https://doi.org/10.1371/journal.pgen.1008185.

    Article  CAS  Google Scholar 

  23. McBride KL, Varga EA, Pastore MT, Prior TW, Manickam K, Atkin JF, Herman GE. Confirmation study of PTEN mutations among individuals with autism or developmental delays/mental retardation and macrocephaly. Autism Res. 2010;3(3):137–41. https://doi.org/10.1002/aur.132.

    Article  PubMed  Google Scholar 

  24. Milner V, Colvert E, Mandy W, Happe F. A comparison of self-report and discrepancy measures of camouflaging: exploring sex differences in diagnosed autistic versus high autistic trait young adults. Autism Res. 2023;16(3):580–90. https://doi.org/10.1002/aur.2873.

    Article  PubMed  Google Scholar 

  25. Niarchou M, Sealock JM, Straub P, Sanchez-Roige S, Sutcliffe JS, Davis LK. A phenome-wide association study of polygenic scores for attention deficit hyperactivity disorder across two genetic ancestries in electronic health record data. Am J Med Genet B Neuropsychiatr Genet. 2022;189(6):185–95.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Organization, W. H. International classification of diseases for mortality and morbidity statistics (11th Revision). 2018.

    Google Scholar 

  27. Renehan AG, Tyson M, Egger M, Heller RF, Zwahlen M. Body-mass index and incidence of cancer: a systematic review and meta-analysis of prospective observational studies. Lancet. 2008;371(9612):569–78. https://doi.org/10.1016/S0140-6736(08)60269-X.

    Article  PubMed  Google Scholar 

  28. Roden DM, Pulley JM, Basford MA, Bernard GR, Clayton EW, Balser JR, Masys DR. Development of a large-scale de-identified DNA biobank to enable personalized medicine. Clin Pharmacol Ther. 2008;84(3):362–9.

    Article  CAS  PubMed  Google Scholar 

  29. Ruan Y, Anne Feng Y-C, Chen C-Y, Lam M, Sawa A, Martin A R, et al. Improving Polygenic Prediction in Ancestrally Diverse Populations. medRxiv. 2021. 2020.2012.2027.20248738. https://doi.org/10.1101/2020.12.27.20248738

  30. Wu P, Gifford A, Meng X, Li X, Campbell H, Varley T, et al. Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation. JMIR Med Inform. 2019;7(4):e14325. https://doi.org/10.2196/14325.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Yehia L, Keel E, Eng C. The Clinical Spectrum of PTEN Mutations. Annu Rev Med. 2020;71:103–16. https://doi.org/10.1146/annurev-med-052218-125823.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank the Synthetic Derivative: The project described was supported by the National Center for Research Resources, Grant UL1 RR024975-01, and is now at the National Center for Advancing Translational Sciences, Grant 2 UL1 TR000445-06. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Funding

5P50HD103537-02, sub-project ID: 7966 to LKD.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization of the study: M.N. and L.K.D., data analyses: M.N., and T.M.F, data interpretation: all authors; writing, editing, and reviewing drafts: all authors. All authors read and approved the final manuscripts.

Corresponding authors

Correspondence to Maria Niarchou or Lea K. Davis.

Ethics declarations

Ethics approval and consent to participate

The project was approved by Vanderbilt’s Institutional Review Board (IRB) (#190418).

Consent for publication

Not applicable. Our study has a non-human subjects determinations.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Supplementary Table S1. PheWAS results of the African American genetic ancestry population. Supplementary Table S2. PheWAS results of the European genetic ancestry population. Supplementary Table S3. PheWAS results conditional on BMI in the European genetic ancestry population. Supplementary Table S4. PheWAS results conditioning on autism diagnosis. Supplementary Table S5. PheWAS results in males of European genetic ancestry. Supplementary Table S6. PheWAS in females within the European genetic ancestry population. Supplementary Table S7. Results of sex-interaction analysis in the European genetic ancestry population. Supplementary Table S8. PheWAS results in the 0 to 11 age group (European genetic ancestry population). Supplementary Table S9. PheWAS results in the 12 to 18 age group (European genetic ancestry population). Supplementary Table S10. PheWAS results in the 19 to 25 age group (European genetic ancestry population). Supplementary Table S11. PheWAS results in the 26 to 40 age group (European genetic ancestry population). Supplementary Table S12. PheWAS results in the 41 to 60 age group (European genetic ancestry population). Supplementary Table S13. PheWAS results in the 61 to 100 age group (European genetic ancestry population).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Niarchou, M., Miller-Fleming, T., Malow, B.A. et al. The physical and psychiatric health conditions related to autism genetic scores, across genetic ancestries, sexes and age-groups in electronic health records. J Neurodevelop Disord 15, 18 (2023). https://doi.org/10.1186/s11689-023-09485-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s11689-023-09485-x

Keywords