- Open Access
Rare copy number variations affecting the synaptic gene DMXL2 in neurodevelopmental disorders
Journal of Neurodevelopmental Disordersvolume 11, Article number: 3 (2019)
Ultra-rare genetic variants, including non-recurrent copy number variations (CNVs) affecting important dosage-sensitive genes, are important contributors to the etiology of neurodevelopmental disorders (NDDs). Pairing family-based whole-genome sequencing (WGS) with detailed phenotype data can enable novel gene associations in NDDs.
We performed WGS of six members from a three-generation family, where three individuals each had a spectrum of features suggestive of a NDD. CNVs and sequence-level variants were identified and further investigated in disease and control databases.
We identified a novel 252-kb deletion at 15q21 that overlaps the synaptic gene DMXL2 and the gene GLDN. The microdeletion segregated in NDD-affected individuals. Additional rare inherited and de novo sequence-level variants were found that may also be involved, including a missense change in GRIK5. Multiple CNVs and loss-of-function sequence variants affecting DMXL2 were discovered in additional unrelated individuals with a range of NDDs.
Disruption of DMXL2 may predispose to NDDs including autism spectrum disorder. The robust interpretation of private variants requires a multifaceted approach that incorporates multigenerational pedigrees and genome-wide and population-scale data.
Rare genetic variants can contribute to the etiology of common neurodevelopmental disorders (NDDs) such as autism spectrum disorder (ASD) [1,2,3,4,5,6,7], attention-deficit/hyperactivity disorder (ADHD) [8, 9], intellectual disability (ID) [10,11,12], and schizophrenia [13,14,15,16,17]. Marked genetic heterogeneity contributed to the recommendation that genome-wide chromosomal microarray (CMA) be a first-tier genetic test for individuals with selected NDDs [10, 14]. CMA along with whole-genome sequencing (WGS) technology have allowed increasing detection of genomic alterations such as copy number variations (CNVs) affecting important developmental genes [10, 18,19,20]. These variants are difficult to adjudicate through parental testing alone, as even established “genomic disorders” can demonstrate highly variable neurodevelopmental and neuropsychiatric expression [21, 22]. Many are inherited from a putatively unaffected parent, who may have sub-clinical traits or a history of symptoms suggestive of an undiagnosed psychiatric condition . WGS represents a comprehensive platform for detection of coding and non-coding sequence-level, CNV, structural, and mitochondrial variation [1, 24,25,26].
There are two main contemporary strategies for discovering NDD genes based on CNVs [18, 27]. One begins with a unique variant in a proband and involves a traditional family-based study design with deep phenotyping and curation of all genomic variants. The other involves searching for genes or loci that are overrepresented in large datasets of rare genetic variation assembled through clinical testing or research consortia. In this study, we combined these “depth” and “breadth” approaches to characterize a small non-recurrent CNV of uncertain clinical significance. We sequenced the genomes of six individuals from a single multiplex NDD family and propose that a novel 15q21 microdeletion is the likely genetic lesion involved, because of haploinsufficiency of a synaptic scaffolding protein encoded by DMXL2.
Family recruitment and phenotyping
The adult proband contacted our research group requesting to participate in our ongoing genetic studies of ASD, after clinical CMA testing had revealed a 15q21 deletion of uncertain significance. History provided by the proband was supplemented by review of all available lifetime medical and psychiatric records. The proband and five family members across three generations provided DNA samples for WGS, and written informed consent was obtained for all participants. After the initial submission of this manuscript, one additional family member provided a saliva sample for targeted genetic testing.
WGS methods are as described in detail elsewhere [1, 24]. In brief, WGS was performed at The Centre for Applied Genomics (Toronto, Canada) using DNA extracted from whole blood. Sequencing was performed with the Illumina HiSeq X system and following Illumina’s recommended protocols. Base calling and data analysis were performed using Illumina HiSeq Analysis Software (HAS) version 2-188.8.131.521. Reads were mapped to the hg19 reference sequence using Isaac alignment software (Isaac alignment software: SAAC00776.15.01.27), and single nucleotide variants (SNVs) and small indel variants were detected using the Isaac variant caller [Isaac Variant Caller (Starling): 184.108.40.206]. These variants were annotated using a custom pipeline based on ANNOVAR . Rare variants were defined as those with ≤ 1% frequency in large public control databases [29,30,31]. CNVs were detected using the read-depth methods ERDS  and CNVnator  (using a window size of 500 bp). High-quality CNVs were defined as those greater than 1 kb and detected by both ERDS and CNVnator with more than 50% reciprocal overlap , and rare CNVs as those with ≤ 1% frequency in the Autism Speaks MSSNG dataset (probands and parents) . We also annotated CNVs with respect to overlap with gold standard variants in the Database of Genomic Variants (DGV) [35, 36]. For further targeted review of candidate CNVs, we used a control dataset comprised of 10,851 unrelated subjects, with a majority being of European ancestry, who were genotyped on multiple microarray platforms including the Affymetrix Genome-wide Human SNP Array 6.0, Illumina HumanOmni2.5, and Affymetrix CytoScan HD [37, 38]. All genome coordinates in this manuscript refer to NCBI Build 37 (UCSC hg19). Variants of interest were confirmed by PCR and Sanger sequencing (primers available on request) or by CMA.
Review of public and private databases
We conducted a comprehensive review of public, and non-public clinical and research laboratory, databases of genomic variation in disease (primarily NDD) to identify additional individuals with DMXL2, GLDN, and GRIK5 variants. We focused on smaller (< 1 Mb and < 15 genes) CNVs predicted to disrupt any of these genes (i.e., deletions impacting the coding region of the gene, and intragenic duplications), as well as predicted null SNVs and small indels. The denominator of individuals with data available for review is estimated to have been ~ 100,000. This included > 30,000 postnatal cases from Canadian laboratory clinical CMA databases, > 20,000 schizophrenia cases with CMA data from the Psychiatric Genomics Consortium , > 18,000 cases with CMA data in DECIPHER , > 23,000 cases from the Lineagen Inc. CMA database (Salt Lake City, Utah, USA), and > 6000 cases with CMA data from our ongoing ASD studies and/or WGS data from MSSNG .
Family members demonstrated variable features of NDDs
The proband (II-2; Fig. 1) is a 34-year-old female of Dutch ancestry with a reported history of difficulties with sensory processing and physical contact as a child. During adolescence, symptoms across a broad range of domains were reported, including conduct symptoms, attention deficit, anxiety including panic attacks, and fluctuating mood. She completed preparatory middle-level vocational education (voorbereidend middelbaar beroepsonderwijs or VMBO), the lowest level educational stream in the Netherlands. At age 28 years, Wechsler Adult Intelligence Scale—Third Edition (WAIS-III) showed a marked and unusual  difference between verbal IQ (VIQ; 79) and performance IQ (PIQ; 100).
At about age 20, she received pharmacological as well as non-pharmacological treatments for symptoms of anxiety and panic. After the birth of her first child, at age 25, she was treated briefly with an antidepressant for postpartum depression. She was diagnosed with ADHD at age 27 and treated with methylphenidate. At age 33, two independent psychiatric assessments confirmed the previous diagnosis of ADHD. In addition, several aspects of ASD were mentioned and endorsed by questionnaire reports, including reduced empathic abilities, difficulties in social interaction and communication, and difficulties coping with change in her routines. Results of the Social Responsiveness Scale for Adults (reported by self and husband) and the Pervasive Developmental Disorders Screening Questionnaire were in the clinical range. However, these symptoms were evaluated as insufficient to justify a formal diagnosis of ASD. Fluctuations in mood and poor coping skills were interpreted at times as aspects of a borderline personality disorder. A subsequent psychiatric assessment at age 34 years concluded that, in retrospect, ASD was a justified diagnosis based on all available information.
The early developmental trajectory of one of the proband’s daughters (III-1; Fig. 1) was marked by mild cognitive delay and a mixed language disorder (as assessed with the Clinical Evaluation of Language Fundamentals—Fourth Edition). Motor milestones were unremarkable. Her medical history was notable for a febrile seizure at age 3 years and recurrent otitis media, which was associated with temporary hearing loss. She was reported to be a highly anxious infant with a high threshold for pain and hypersensitivity to sensory input. She continued to display moderate behavioral issues as a toddler, both at home and in school, with social and communicative difficulties as well as oppositional, attentional, and anxious features. Early clinical diagnoses included developmental language disorder and disorder of childhood not otherwise specified. At 7 years 11 months, full scale IQ was assessed to be 85, with VIQ 87 and PIQ 86, using the Wechsler Intelligence Scale for Children—Third Edition (WISC-III). The most recent child psychiatric assessment performed at age 9.5 years recorded symptoms including low reciprocity, inadequate social behaviors, and behavioral rituals. Although she scored just below the clinical threshold of the Autism Diagnostic Observation Schedule (ADOS), the overall picture was deemed to justify a clinical diagnosis of ASD, in addition to the language disorder.
The proband’s father (I-1; Fig. 1) was described as having longstanding difficulties with impulsivity/anger, obsessiveness, and other distressing characterological traits that interfered with his functioning. He had similar educational achievement to his daughter and has maintained stable employment without advancement or promotion. He has had at least three “nervous breakdowns” in his life that required him to take a leave of absence from his work. On the basis of this limited history, he was coded prior to WGS as likely affected with a psychiatric disorder and/or NDD. The proband’s other daughter (III-2; Fig. 1) has had normal development, and there are no concerns for ASD. She was diagnosed with Pediatric Acute-onset Neuropsychiatric Syndrome (PANS) after developing motor tics post viral infection. Although the tics initially resolved after a short course of treatment with methylprednisolone, treatment of a recurrence is ongoing. She was coded prior to WGS as unaffected with a NDD. The other two members of the family displayed in Fig. 1 (I-2 and II-1; Fig. 1) are reported to have no major learning difficulties, diagnosed psychiatric conditions, or features of a NDD. There is no known history of unexplained hearing loss in the family  nor of infertility or recurrent (> 2) miscarriage, delayed puberty, hypothyroidism, peripheral neuropathy, or short stature . Facial photographs of the proband and her two daughters are available upon request; there are no notable dysmorphic features.
The familial microdeletion implicates the candidate gene DMXL2
WGS identified the novel 262-kb loss at chromosome 15q21.2 (Fig. 2) in the proband that had been detected on CMA. The deletion was inherited from her father and transmitted to her affected daughter (Fig. 1). This deletion overlaps the entire DMXL2 gene (GenBank: NM_001174116.1) and the first three exons of GLDN (GenBank: NM_181789.3). There were no segmental duplication repeats within 200 kb of the breakpoints. No additional rare CNVs of interest were shared amongst these three family members, or between just the proband and her affected daughter.
DMXL2 encodes DmX-like protein 2, previously known as Rabconnectin-3. This is a component of the human neocortex post-synaptic density (PSD) proteome  that may serve as a scaffold protein on synaptic vesicles, and it is a target of the fragile X mental retardation protein (FMRP) [44, 45]. Multiple lines of evidence, including conditional heterozygous deletion of Dmxl2 in mouse neurons, suggest that the DMXL2 protein functions in the brain in neuronal and endocrinological homeostasis . The Exome Aggregation Consortium (ExAC) probability of loss-of-function intolerance (pLI) of DMXL2 is 1.00, with tenfold fewer observed variants than anticipated . DMXL2 is in a copy number-stable region of the genome; no deletions of DMXL2 were found in the DGV gold standard variants (Fig. 2) or in our CNV control datasets (see the “Methods” section) . The DMXL2 gene contains “brain-critical exons” . Two de novo missense variants in DMXL2 were previously observed in 2517 simplex ASD probands from the Simons Simplex Collection (SSC), with no de novo events in this gene observed in their unaffected siblings [5, 6]. There is also evidence of enrichment for rare sequence variants in DMXL2 in major depressive disorder . With respect to the other gene overlapped by the 15q21 loss CNV, GLDN encodes the protein gliomedin, a secreted cell adhesion molecule involved in the formation of the nodes of Ranvier . In contrast to DMXL2, the ExAC pLI is 0.00 and a deletion was observed in one of our control individuals, suggesting there may not be overt clinical consequences for a heterozygous null allele. Biallelic loss of function of GLDN causes a lethal congenital contracture syndrome (MIM #617194), and as expected, carrier parents were described as unaffected .
Additional rare sequence variants may shape expression
Considering SNVs and small indels, there were no likely pathogenic or pathogenic variants  in established NDD risk genes in any of the six family members. A total of 198 high-quality, rare, predicted damaging variants were shared by the three family members coded as affected (I-1, II-2, III-1; Fig. 1), including 89 that were absent in the proband’s unaffected daughter (III-2). One of these 89 is a missense variant of uncertain significance in GRIK5 [NM_001301030: c.1840G>A: p.(Ala614Thr)]. It is absent in control databases, affects a highly conserved amino acid, and is predicted to be damaging by in silico programs including SIFT (score 0), PolyPhen-2 (score 0.977), MutationTaster (probability 0.999), and CADD (scaled C-score 32). GRIK5 encodes an understudied auxiliary subunit of the kainate receptor (glutamate receptor, ionotropic, kainate 5). Dysregulation of other components of this receptor has been associated with psychiatric and neurodevelopmental diseases [50,51,52,53,54]. In ExAC, GRIK5 is constrained with respect to loss-of-function (pLI = 0.92) and missense (z = 3.85) variants. There are no coding CNVs in our control databases impacting GRIK5. Similar to DMXL2, prior evidence for an association with ASD is derived from the SSC and Autism Sequencing Consortium, where there were three de novo missense variants in the GRIK5 gene in simplex ASD probands and none in the unaffected SSC siblings [4,5,6].
We also identified three high confidence de novo exonic variants. In the proband (II-2), there was a 2-bp frameshift deletion in exon 10 of ZC3H14 [NM_024824.4: c.1339_1340del: p.(Leu447Alafs*8)] and a synonymous substitution in exon 3 of TRPC4 that was not predicted to affect splicing. ZC3H14 plays a key role in neurodevelopmental processes [55, 56]. The gene is associated with an autosomal recessive form of ID (MIM #617125) , but is also constrained against loss-of-function variants (ExAC pLI = 1), suggesting that it may be sensitive to haploinsufficiency. However, the de novo frameshift variant was transmitted only to the proband’s unaffected daughter (III-2; Fig. 1). The affected daughter (III-1) has a novel de novo missense variant in exon 3 of DLGAP3 [NM_001080418.2: c.33 T>A: p.(His11Gln)]. This gene is a member of the DLGAP family that has been studied in relation to various neuropsychiatric disorders . However, in silico predictions suggest the missense change here will have a benign impact. No de novo exonic variants were identified in the unaffected daughter (III-2).
Genetic variants in additional individuals pinpoint DMXL2 as a risk gene for NDD
Much larger deletions encompassing the 15q21.2 locus are associated with ID and other features [59, 60]. A comprehensive review of public databases identified a single comparable 274-kb 15q21 loss (Fig. 2), in DECIPHER . The individual is a 21-year-old Belgian male with diagnoses of mild ID, ASD, and ADHD (case 4 in Table 1). Growth parameters were within normal limits (weight 71.5 kg, height 168 cm, head circumference 56.4 cm), and there were no major congenital anomalies or notable dysmorphic features. Inheritance of the CNV could not be assessed as the proband was adopted. Seven other individuals with CNVs predicted to disrupt DMXL2 were identified in private clinical laboratory databases (cases 5–11 in Table 1), with available phenotype data supporting a primary NDD phenotype in most. Additional CNVs disrupting DMXL2 were identified in schizophrenia probands (cases 12–13 in Table 1) in the PGC , and novel loss-of-function SNVs were identified in ASD probands (cases 14–17 in Table 1) in the Autism Speaks MSSNG study . Similar efforts to identify variants in GRIK5 were not successful, with only a single exonic deletion found in the MSSNG cohort (data not shown).
We employed WGS and a multifaceted strategy to characterize a CNV of uncertain clinical significance. Diverse and converging lines of evidence suggest that haploinsufficiency of DMXL2 is a risk factor for NDDs. The available data indicate variable expressivity and possibly incomplete penetrance (or at least age-related penetrance). Families exhibiting an apparently heritable but broad phenotype of NDD symptoms, including members with and without a clinical diagnosis of ASD, can expand our knowledge of the genetically related spectrum of disease. One of the most pressing challenges in the field is to understand the typically high degree of variable neuropsychiatric expression associated with risk variants [61, 62]. The range of symptomatology observed in this family is reminiscent of what is often observed with genomic disorders and non-recurrent large CNVs, and it should further highlight the impact of rare inherited genetic variants [18, 22, 63, 64]. In some cases, there is emerging evidence for additional deleterious variants elsewhere in the genome that may act as modifiers [65, 66]. We also identified de novo and inherited sequence variants of potential relevance in this family, including a rare missense change in GRIK5. The latter was not as compelling of a candidate variant as the microdeletion overlapping DMXL2 because of (i) the predicted less-severe nature of the genetic lesion, (ii) the current absence of overt functional or model organism data at the gene level, and (iii) our inability to adequately replicate the finding in the population-scale NDD cohorts. Nonetheless, it could still be a contributor to the risk for NDD in this family. A limitation of our study was that the relatively small nuclear families precluded the identification of an individual with one, but not both, of the variants.
Our findings provide a clinical impetus to now develop and test functional hypotheses regarding disease mechanism(s). The Dmxl2 gene is best studied in the gonadotropin-releasing hormone neurons [42, 67], where low expression in mice impedes normal dendritic development . However, the phenotypes observed in knockout mice are not entirely attributable to deficient Dmxl2 in that neuronal cell population . Dmxl2+/− mice also demonstrate neuroanatomical differences in the corpus callosum . At least four of ten experimentally proven protein interactors of DMXL2  are encoded by established or candidate genes for NDDs: CYFIP2 [70, 71], DYNC1H1 [72, 73], MATR3 , and NCKAP1 [5, 75, 76]. Both CYFIP2 and NCKAP1 shape the formation of dendritic spines via the WAVE actin-remodeling complex [77, 78]. DMXL2 protein dosage insufficiency may therefore result in structural (e.g., abnormal dendritic spines morphology) and/or functional (e.g., impaired transmission) synaptic consequences in humans, as have been observed in other genetic forms of ID, ASD, and schizophrenia [79, 80].
It remains difficult in clinical practice to interpret novel inherited CNVs, which are often labeled variants of uncertain clinical significance . Incomplete penetrance and epistasis are likely underappreciated. Most approaches to CNV adjudication focus on the overlapped genes and disregard the remainder of the genome , in part because CMA is incapable of identifying smaller and balanced genetic variants. Whereas WGS is more comprehensive in detection [1, 24,25,26], sometimes also revealing complicating data for clinical interpretation, as was the case with the GRIK5 variant. Updated guidelines are required for interpreting CNVs that take into account sequence variants . Specific consideration will need to be given with respect to how to determine if two or more variants are major contributors to disease within a specific individual. The risk profile from common variants with a polygenic effect may also become clinically relevant in time, even in those individuals with a monogenic diagnosis or genomic disorder [82, 83].
Variant inheritance patterns in multigenerational pedigrees can be highly informative, with the caveat that customary assumptions (e.g., de novo suggests pathogenic, and inherited from putatively unaffected parent suggests benign) are imperfect [84,85,86]. The availability of samples and detailed phenotype data from three generations of a family, including offspring and parents of an individual with ASD or other NDD phenotypes, is uncommon . There is substantial evidence that different rare disease-causing variants can segregate within the same family [1, 3]. Our approach here of examining multiple rare penetrant variants across generations might reveal new genes for NDD. This may especially be the case for genes involved in higher functioning forms of ASD, perhaps including DMXL2. Our study design also allowed for the observation that the proband’s de novo variant in ZC3H14 was not inherited by her daughter with NDD. In contrast, both the 15q21 deletion and the GRIK5 variant did segregate with the apparent NDD phenotypes, albeit in a simple pedigree where the a priori probability was p = 0.125 for any single variant .
These results support DMXL2 as a candidate gene for a spectrum of NDDs that includes ASD. As for virtually all (even highly penetrant) risk variants, additional modifiers of expression remain to be discovered. The robust interpretation of ultra-rare variants identified by CMA or other testing requires incorporation of WGS from affected families and population-scale data. Although currently expensive for clinical diagnostics , in time, we expect the same utility will be found as in this research study and WGS will become a first-tier clinical genetic test for NDDs.
Autism Diagnostic Observation Schedule
Autism spectrum disorder
Copy number variation
Database of Genomic Variants
Psychiatric Genomics Consortium
Performance intelligence quotient
Single nucleotide variant
Verbal intelligence quotient
Wechsler Adult Intelligence Scale—Third Edition
Wechsler Intelligence Scale for Children—Third Edition
Yuen RKC, Merico D, Bookman M, Howe LJ, Thiruvahindrapuram B, Patel RV, Whitney J, Deflaux N, Bingham J, Wang Z, et al. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat Neurosci. 2017;20(4):602–11.
Tammimies K, Marshall CR, Walker S, Kaur G, Thiruvahindrapuram B, Lionel AC, Yuen RK, Uddin M, Roberts W, Weksberg R, et al. Molecular diagnostic yield of chromosomal microarray analysis and whole-exome sequencing in children with autism spectrum disorder. JAMA. 2015;314(9):895–903.
Yuen RK, Thiruvahindrapuram B, Merico D, Walker S, Tammimies K, Hoang N, Chrysler C, Nalpathamkalam T, Pellecchia G, Liu Y, et al. Whole-genome sequencing of quartet families with autism spectrum disorder. Nat Med. 2015;21(2):185–91.
De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Cicek AE, Kou Y, Liu L, Fromer M, Walker S, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014;515(7526):209–15.
Iossifov I, O'Roak BJ, Sanders SJ, Ronemus M, Krumm N, Levy D, Stessman HA, Witherspoon KT, Vives L, Patterson KE, et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature. 2014;515(7526):216–21.
Krumm N, Turner TN, Baker C, Vives L, Mohajeri K, Witherspoon K, Raja A, Coe BP, Stessman HA, He ZX, et al. Excess of rare, inherited truncating mutations in autism. Nat Genet. 2015;47(6):582–8.
Uddin M, Tammimies K, Pellecchia G, Alipanahi B, Hu P, Wang Z, Pinto D, Lau L, Nalpathamkalam T, Marshall CR, et al. Brain-expressed exons under purifying selection are enriched for de novo mutations in autism spectrum disorder. Nat Genet. 2014;46(7):742–7.
Williams NM, Zaharieva I, Martin A, Langley K, Mantripragada K, Fossdal R, Stefansson H, Stefansson K, Magnusson P, Gudmundsson OO, et al. Rare chromosomal deletions and duplications in attention-deficit hyperactivity disorder: a genome-wide analysis. Lancet. 2010;376(9750):1401–8.
Lionel AC, Crosbie J, Barbosa N, Goodale T, Thiruvahindrapuram B, Rickaby J, Gazzellone M, Carson AR, Howe JL, Wang Z, et al. Rare copy number variation discovery and cross-disorder comparisons identify risk genes for ADHD. Sci Transl Med. 2011;3(95):95ra75.
Miller DT, Adam MP, Aradhya S, Biesecker LG, Brothman AR, Carter NP, Church DM, Crolla JA, Eichler EE, Epstein CJ, et al. Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. Am J Hum Genet. 2010;86(5):749–64.
Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, Williams C, Stalker H, Hamid R, Hannig V, et al. A copy number variation morbidity map of developmental delay. Nat Genet. 2011;43(9):838–46.
Gilissen C, Hehir-Kwa JY, Thung DT, van de Vorst M, van Bon BW, Willemsen MH, Kwint M, Janssen IM, Hoischen A, Schenck A, et al. Genome sequencing identifies major causes of severe intellectual disability. Nature. 2014;511(7509):344–7.
Xu B, Ionita-Laza I, Roos JL, Boone B, Woodrick S, Sun Y, Levy S, Gogos JA, Karayiorgou M. De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia. Nat Genet. 2012;44(12):1365–9.
Costain G, Lionel AC, Merico D, Forsythe P, Russell K, Lowther C, Yuen T, Husted J, Stavropoulos DJ, Speevak M, et al. Pathogenic rare copy number variants in community-based schizophrenia suggest a potential role for clinical microarrays. Hum Mol Genet. 2013;22(22):4485–501.
Gulsuner S, Walsh T, Watts AC, Lee MK, Thornton AM, Casadei S, Rippey C, Shahin H, Consortium on the Genetics of S, Group PS, et al. Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell. 2013;154(3):518–29.
Fromer M, Pocklington AJ, Kavanagh DH, Williams HJ, Dwyer S, Gormley P, Georgieva L, Rees E, Palta P, Ruderfer DM, et al. De novo mutations in schizophrenia implicate synaptic networks. Nature. 2014;506(7487):179–84.
Marshall CR, Howrigan DP, Merico D, Thiruvahindrapuram B, Wu W, Greer DS, Antaki D, Shetty A, Holmans PA, Pinto D, et al. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet. 2017;49(1):27–35.
Cook EH Jr, Scherer SW. Copy-number variations associated with neuropsychiatric conditions. Nature. 2008;455(7215):919–23.
Riggs ER, Ledbetter DH, Martin CL. Genomic variation: lessons learned from whole-genome CNV analysis. Curr Genet Med Rep. 2014;2:146–50.
Hehir-Kwa JY, Pfundt R, Veltman JA. Exome sequencing and whole genome sequencing for the detection of copy number variation. Expert Rev Mol Diagn. 2015;15(8):1023–32.
Vorstman JA, Ophoff RA. Genetic causes of developmental disorders. Curr Opin Neurol. 2013;26(2):128–36.
Lowther C, Costain G, Baribeau DA, Bassett AS. Genomic disorders in psychiatry-what does the clinician need to know? Curr Psychiatry Rep. 2017;19(11):82.
Woodbury-Smith M, Paterson AD, Thiruvahindrapduram B, Lionel AC, Marshall CR, Merico D, Fernandez BA, Duku E, Sutcliffe JS, O'Conner I, et al. Using extended pedigrees to identify novel autism spectrum disorder (ASD) candidate genes. Hum Genet. 2015;134(2):191–201.
Lionel AC, Costain G, Monfared N, Walker S, Reuter MS, Hosseini SM, Thiruvahindrapuram B, Merico D, Jobling R, Nalpathamkalam T, et al. Improved diagnostic yield compared with targeted gene sequencing panels suggests a role for whole-genome sequencing as a first-tier genetic test. Genet Med. 2018;20(4):435–43.
Stavropoulos DJ, Merico D, Jobling R, Bowdin S, Monfared N, Thiruvahindrapuram B, Nalpathamkalam T, Pellecchia G, Yuen RKC, Szego MJ, et al. Whole genome sequencing expands diagnostic utility and improves clinical management in pediatric medicine. NPJ Genom Med. 2016;1:15012.
Noll AC, Miller NA, Smith LD, Yoo B, Fiedler S, Cooley LD, Willig LK, Petrikin JE, Cakici J, Lesko J, et al. Clinical detection of deletion structural variants in whole-genome sequences. NPJ Genom Med. 2016;1:16026.
Buchanan JA, Scherer SW. Contemplating effects of genomic structural variation. Genet Med. 2008;10(9):639–47.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164.
Genomes Project C, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010;467(7319):1061–73.
Tennessen JA, Bigham AW, O'Connor TD, Fu W, Kenny EE, Gravel S, McGee S, Do R, Liu X, Jun G, et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science. 2012;337(6090):64–9.
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, O'Donnell-Luria AH, Ware JS, Hill AJ, Cummings BB, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91.
Zhu M, Need AC, Han Y, Ge D, Maia JM, Zhu Q, Heinzen EL, Cirulli ET, Pelak K, He M, et al. Using ERDS to infer copy-number variants in high-coverage genomes. Am J Hum Genet. 2012;91(3):408–21.
Abyzov A, Urban AE, Snyder M, Gerstein M. CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011;21(6):974–84.
Trost B, Walker S, Wang Z, Thiruvahindrapuram B, MacDonald JR, Sung WWL, Pereira SL, Whitney J, Chan AJS, Pellecchia G, et al. A comprehensive workflow for read depth-based identification of copy-number variation from whole-genome sequence data. Am J Hum Genet. 2018;102(1):142–55.
MacDonald JR, Ziman R, Yuen RK, Feuk L, Scherer SW. The database of genomic variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2014;42(Database issue):D986–92.
Zarrei M, MacDonald JR, Merico D, Scherer SW. A copy number variation map of the human genome. Nat Rev Genet. 2015;16(3):172–83.
Uddin M, Thiruvahindrapuram B, Walker S, Wang Z, Hu P, Lamoureux S, Wei J, MacDonald JR, Pellecchia G, Lu C, et al. A high-resolution copy-number variation resource for clinical and population genetics. Genet Med. 2015;17(9):747–52.
Uddin M, Pellecchia G, Thiruvahindrapuram B, D’Abate L, Merico D, Chan A, Zarrei M, Tammimies K, Walker S, Gazzellone MJ, et al. Indexing effects of copy number variation on genes involved in developmental delay. Sci Rep. 2016;6:28663.
Firth HV, Richards SM, Bevan AP, Clayton S, Corpas M, Rajan D, Van Vooren S, Moreau Y, Pettett RM, Carter NP. DECIPHER: database of chromosomal imbalance and phenotype in humans using Ensembl resources. Am J Hum Genet. 2009;84(4):524–33.
Iverson GL, Woodward TS, Green P. Base rates of WAIS-R VIQ-PIQ differences in 1593 psychiatric inpatients. J Clin Psychol. 2001;57(12):1579–87.
Chen DY, Liu XF, Lin XJ, Zhang D, Chai YC, Yu DH, Sun CL, Wang XL, Zhu WD, Chen Y, et al. A dominant variant in DMXL2 is linked to nonsyndromic hearing loss. Genet Med. 2017;19(5):553–8.
Tata B, Huijbregts L, Jacquier S, Csaba Z, Genin E, Meyer V, Leka S, Dupont J, Charles P, Chevenne D, et al. Haploinsufficiency of Dmxl2, encoding a synaptic protein, causes infertility associated with a loss of GnRH neurons in mouse. PLoS Biol. 2014;12(9):e1001952.
Bayes A, van de Lagemaat LN, Collins MO, Croning MD, Whittle IR, Choudhary JS, Grant SG. Characterization of the proteome, diseases and evolution of the human postsynaptic density. Nat Neurosci. 2011;14(1):19–21.
Ascano M Jr, Mukherjee N, Bandaru P, Miller JB, Nusbaum JD, Corcoran DL, Langlois C, Munschauer M, Dewell S, Hafner M, et al. FMRP targets distinct mRNA sequence elements to regulate protein expression. Nature. 2012;492(7429):382–6.
Darnell JC, Van Driesche SJ, Zhang C, Hung KY, Mele A, Fraser CE, Stone EF, Chen C, Fak JJ, Chi SW, et al. FMRP stalls ribosomal translocation on mRNAs linked to synaptic function and autism. Cell. 2011;146(2):247–61.
Pirooznia M, Wang T, Avramopoulos D, Potash JB, Zandi PP, Goes FS. High-throughput sequencing of the synaptome in major depressive disorder. Mol Psychiatry. 2016;21(5):650–5.
Eshed Y, Feinberg K, Poliak S, Sabanay H, Sarig-Nadir O, Spiegel I, Bermingham JR Jr, Peles E. Gliomedin mediates Schwann cell-axon interaction and the molecular assembly of the nodes of Ranvier. Neuron. 2005;47(2):215–29.
Maluenda J, Manso C, Quevarec L, Vivanti A, Marguet F, Gonzales M, Guimiot F, Petit F, Toutain A, Whalen S, et al. Mutations in GLDN, encoding Gliomedin, a critical component of the nodes of Ranvier, are responsible for lethal arthrogryposis. Am J Hum Genet. 2016;99(4):928–33.
Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, Grody WW, Hegde M, Lyon E, Spector E, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med. 2015;17(5):405–24.
Griswold AJ, Ma D, Cukier HN, Nations LD, Schmidt MA, Chung RH, Jaworski JM, Salyakina D, Konidari I, Whitehead PL, et al. Evaluation of copy number variations reveals novel candidate genes in autism spectrum disorder-associated pathways. Hum Mol Genet. 2012;21(15):3513–23.
Casey JP, Magalhaes T, Conroy JM, Regan R, Shah N, Anney R, Shields DC, Abrahams BS, Almeida J, Bacchelli E, et al. A novel approach of homozygous haplotype sharing identifies candidate genes in autism spectrum disorder. Hum Genet. 2012;131(4):565–79.
Takenouchi T, Hashida N, Torii C, Kosaki R, Takahashi T, Kosaki K. 1p34.3 deletion involving GRIK3: further clinical implication of GRIK family glutamate receptors in the pathogenesis of developmental delay. Am J Med Genet A. 2014;164A(2):456–60.
Aller MI, Pecoraro V, Paternain AV, Canals S, Lerma J. Increased dosage of high-affinity kainate receptor gene grik4 alters synaptic transmission and reproduces autism spectrum disorders features. J Neurosci. 2015;35(40):13619–28.
Micheau J, Vimeney A, Normand E, Mulle C, Riedel G. Impaired hippocampus-dependent spatial flexibility and sociability represent autism-like phenotypes in GluK2 mice. Hippocampus. 2014;24(9):1059–69.
Bienkowski RS, Banerjee A, Rounds JC, Rha J, Omotade OF, Gross C, Morris KJ, Leung SW, Pak C, Jones SK, et al. The conserved, disease-associated RNA binding protein dNab2 interacts with the fragile X protein ortholog in Drosophila neurons. Cell Rep. 2017;20(6):1372–84.
Rha J, Jones SK, Fidler J, Banerjee A, Leung SW, Morris KJ, Wong JC, Inglis GAS, Shapiro L, Deng Q, et al. The RNA-binding protein, ZC3H14, is required for proper poly(A) tail length control, expression of synaptic proteins, and brain function in mice. Hum Mol Genet. 2017;26(19):3663–81.
Pak C, Garshasbi M, Kahrizi K, Gross C, Apponi LH, Noto JJ, Kelly SM, Leung SW, Tzschach A, Behjati F, et al. Mutation of the conserved polyadenosine RNA binding protein, ZC3H14/dNab2, impairs neural function in Drosophila and humans. Proc Natl Acad Sci U S A. 2011;108(30):12390–5.
Rasmussen AH, Rasmussen HB, Silahtaroglu A. The DLGAP family: neuronal expression, function and role in brain disorders. Mol Brain. 2017;10(1):43.
Tempesta S, Sollima D, Ghezzo S, Politi V, Sinigaglia B, Balducci F, Celso B, Restuccia A, Stefani M, Cernetti R, et al. Mild mental retardation in a child with a de novo interstitial deletion of 15q21.2q22.1: a comparison with previously described cases. Eur J Med Genet. 2008;51(6):639–45.
Velazquez-Wong AC, Ruiz Esparza-Garrido R, Velazquez-Flores MA, Huicochea-Montiel JC, Cardenas-Conejo A, Miguez-Munoz CP, Araujo-Solis MA, Salamanca-Gomez F, Arenas-Aranda DJ. Clinical and molecular characterization of a patient with 15q21.2q22.2 deletion syndrome. Cytogenet Genome Res. 2014;144(3):183–9.
Vorstman JAS, Parr JR, Moreno-De-Luca D, Anney RJL, Nurnberger JI Jr, Hallmayer JF. Autism genetics: opportunities and challenges for clinical translation. Nat Rev Genet. 2017;18(6):362–76.
Takumi T, Tamada K. CNV biology in neurodevelopmental disorders. Curr Opin Neurobiol. 2018;48:183–92.
Sato D, Lionel AC, Leblond CS, Prasad A, Pinto D, Walker S, O'Connor I, Russell C, Drmic IE, Hamdan FF, et al. SHANK1 deletions in males with autism spectrum disorder. Am J Hum Genet. 2012;90(5):879–87.
Lowther C, Costain G, Melvin R, Stavropoulos DJ, Lionel AC, Marshall CR, Scherer SW, Bassett AS. Adult expression of a 3q13.31 microdeletion. Mol Cytogenet. 2014;7(1):23.
Girirajan S, Rosenfeld JA, Coe BP, Parikh S, Friedman N, Goldstein A, Filipink RA, McConnell JS, Angle B, Meschino WS, et al. Phenotypic heterogeneity of genomic disorders and rare copy-number variants. N Engl J Med. 2012;367(14):1321–31.
Bassett AS, Lowther C, Merico D, Costain G, Chow EWC, van Amelsvoort T, McDonald-McGinn D, Gur RE, Swillen A, Van den Bree M, et al. Rare genome-wide copy number variation and expression of schizophrenia in 22q11.2 deletion syndrome. Am J Psychiatry. 2017;174(11):1054–63.
Tata BK, Harbulot C, Csaba Z, Peineau S, Jacquier S, de Roux N. Rabconnectin-3alpha is required for the morphological maturation of GnRH neurons and kisspeptin responsiveness. Sci Rep. 2017;7:42463.
Kannan M, Bayam E, Wagner C, Rinaldi B, Kretz PF, Tilly P, Roos M, McGillewie L, Bar S, Minocha S, et al. WD40-repeat 47, a microtubule-associated protein, is essential for brain development and autophagy. Proc Natl Acad Sci U S A. 2017;114(44):E9308–17.
Li KW, Chen N, Klemmer P, Koopmans F, Karupothula R, Smit AB. Identifying true protein complex constituents in interaction proteomics: the example of the DMXL2 protein complex. Proteomics. 2012;12(15–16):2428–32.
Abekhoukh S, Bardoni B. CYFIP family proteins between autism and intellectual disability: links with Fragile X syndrome. Front Cell Neurosci. 2014;8:81.
Nakashima M, Kato M, Aoto K, Shiina M, Belal H, Mukaida S, Kumada S, Sato A, Zerem A, Lerman-Sagie T, et al. De novo hotspot variants in CYFIP2 cause early-onset epileptic encephalopathy. Ann Neurol. 2018;83(4):794–806.
Poirier K, Lebrun N, Broix L, Tian G, Saillour Y, Boscheron C, Parrini E, Valence S, Pierre BS, Oger M, et al. Mutations in TUBG1, DYNC1H1, KIF5C and KIF2A cause malformations of cortical development and microcephaly. Nat Genet. 2013;45(6):639–47.
Jamuar SS, Lam AT, Kircher M, D'Gama AM, Wang J, Barry BJ, Zhang X, Hill RS, Partlow JN, Rozzo A, et al. Somatic mutations in cerebral cortical malformations. N Engl J Med. 2014;371(8):733–43.
Johnson JO, Pioro EP, Boehringer A, Chia R, Feit H, Renton AE, Pliner HA, Abramzon Y, Marangi G, Winborn BJ, et al. Mutations in the Matrin 3 gene cause familial amyotrophic lateral sclerosis. Nat Neurosci. 2014;17(5):664–6.
Anazi S, Maddirevula S, Salpietro V, Asi YT, Alsahli S, Alhashem A, Shamseldin HE, AlZahrani F, Patel N, Ibrahim N, et al. Expanding the genetic heterogeneity of intellectual disability. Hum Genet. 2017;136(11–12):1419–29.
Freed D, Pevsner J. The contribution of mosaic variants to autism spectrum disorder. PLoS Genet. 2016;12(9):e1006245.
Han K, Chen H, Gennarino VA, Richman R, Lu HC, Zoghbi HY. Fragile X-like behaviors and abnormal cortical dendritic spines in cytoplasmic FMR1-interacting protein 2-mutant mice. Hum Mol Genet. 2015;24(7):1813–23.
De Rubeis S, Pasciuto E, Li KW, Fernandez E, Di Marino D, Buzzi A, Ostroff LE, Klann E, Zwartkruis FJ, Komiyama NH, et al. CYFIP1 coordinates mRNA translation and cytoskeleton remodeling to ensure proper dendritic spine formation. Neuron. 2013;79(6):1169–82.
Jiang YH, Ehlers MD. Modeling autism by SHANK gene mutations in mice. Neuron. 2013;78(1):8–27.
Sudhof TC. Synaptic neurexin complexes: a molecular code for the logic of neural circuits. Cell. 2017;171(4):745–69.
Kearney HM, Thorland EC, Brown KK, Quintero-Rivera F, South ST, Working Group of the American College of Medical Genetics Laboratory Quality Assurance C. American College of Medical Genetics standards and guidelines for interpretation and reporting of postnatal constitutional copy number variants. Genet Med. 2011;13(7):680–5.
Merico D, Zarrei M, Costain G, Ogura L, Alipanahi B, Gazzellone MJ, Butcher NJ, Thiruvahindrapuram B, Nalpathamkalam T, Chow EW, et al. Whole-genome sequencing suggests schizophrenia risk mechanisms in humans with 22q11.2 deletion syndrome. G3 (Bethesda). 2015;5(11):2453–61.
Niemi MEK, Martin HC, Rice DL, Gallone G, Gordon S, Kelemen M, McAloney K, McRae J, Radford EJ, Yu S, et al. Common genetic variants contribute to risk of rare severe neurodevelopmental disorders. Nature. 2018;562(7726):268–71.
Vermeesch JR, Balikova I, Schrander-Stumpel C, Fryns JP, Devriendt K. The causality of de novo copy number variants is overestimated. Eur J Hum Genet. 2011;19(11):1112–3.
Costain G. Parental expression is overvalued in the interpretation of rare inherited variants. Eur J Hum Genet. 2015;23(1):4–7.
Giorgio E, Ciolfi A, Biamino E, Caputo V, Di Gregorio E, Belligni EF, Calcia A, Gaidolfi E, Bruselles A, Mancini C, et al. Whole exome sequencing is necessary to clarify ID/DD cases with de novo copy number variants of uncertain significance: two proof-of-concept examples. Am J Med Genet A. 2016;170(7):1772–9.
Jarvik GP, Browning BL. Consideration of cosegregation in the pathogenicity classification of genomic variants. Am J Hum Genet. 2016;98(6):1077–81.
Tsiplova K, Zur RM, Marshall CR, Stavropoulos DJ, Pereira SL, Merico D, Young EJ, Sung WWL, Scherer SW, Ungar WJ. A microcosting and cost-consequence analysis of clinical genomic testing strategies in autism spectrum disorder. Genet Med. 2017;19(11):1268–75.
The authors thank the family for their participation, and staff and trainees at The Centre for Applied Genomics. The authors wish to acknowledge the resources of MSSNG (www.mss.ng), Autism Speaks and The Centre for Applied Genomics at The Hospital for Sick Children, Toronto, Canada. They thank the participating families for their time and contributions to this database, as well as the generosity of the donors who supported this program. This study also makes use of data generated by the DECIPHER community. A full list of centres who contributed to the generation of the data is available from http://decipher.sanger.ac.uk and via email from firstname.lastname@example.org. Funding for the project was provided by the Wellcome Trust.
Additional acknowledgements regarding the control dataset are as follows:
(1) Funding support for the Study of Addiction: Genetics and Environment (SAGE) was provided through the NIH Genes, Environment and Health Initiative [GEI] (U01 HG004422). SAGE is one of the genome-wide association studies funded as part of the Gene Environment Association Studies (GENEVA) under GEI. Assistance with phenotype harmonization and genotype cleaning, as well as with general study coordination, was provided by the GENEVA Coordinating Center (U01 HG004446). Assistance with data cleaning was provided by the National Center for Biotechnology Information. Support for collection of datasets and samples was provided by the Collaborative Study on the Genetics of Alcoholism (COGA; U10 AA008401), the Collaborative Genetic Study of Nicotine Dependence (COGEND; P01 CA089392), and the Family Study of Cocaine Dependence (FSCD; R01 DA013423). Funding support for genotyping, which was performed at the Johns Hopkins University Center for Inherited Disease Research, was provided by the NIH GEI (U01HG004438), the National Institute on Alcohol Abuse and Alcoholism, the National Institute on Drug Abuse, and the NIH contract “High throughput genotyping for studying the genetic contributions to human disease” (HHSN268200782096C). The datasets used for the analyses described in this manuscript were obtained from dbGaP at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000092.v1.p1 through dbGaP accession number phs000092.v1.p1.
(2) The authors acknowledge the contribution of data from Genetic Architecture of Smoking and Smoking Cessation accessed through dbGaP. Funding support for genotyping, which was performed at the Center for Inherited Disease Research (CIDR), was provided by 1 X01 HG005274-01. CIDR is fully funded through a federal contract from the National Institutes of Health to The Johns Hopkins University, contract number HHSN268200782096C. Assistance with genotype cleaning, as well as with general study coordination, was provided by the Gene Environment Association Studies (GENEVA) Coordinating Center (U01 HG004446). Funding support for collection of datasets and samples was provided by the Collaborative Genetic Study of Nicotine Dependence (COGEND; P01 CA089392) and the University of Wisconsin Transdisciplinary Tobacco Use Research Center (P50 DA019706, P50 CA084724). The datasets used for the analyses described in this manuscript were obtained from dbGaP at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000404.v1.p1 through dbGaP accession number phs000404.v1.p1.
(3) A dataset used for the analyses described in this manuscript were obtained from the NEI Refractive Error Collaboration (NEIREC). Funding support for NEIREC was provided by the National Eye Institute. We would like to thank NEIREC participants and the NEIREC Research Group for their valuable contribution to this research. The datasets used for the analyses described in this manuscript were obtained from dbGaP at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000303.v1.p1 through dbGaP accession number phs000303.v1.p1.
(4) Funding support for the “CIDR Visceral Adiposity Study” was provided through the Division of Aging Biology and the Division of Geriatrics and Clinical Gerontology, NIA. The CIDR Visceral Adiposity Study includes a genome-wide association study funded as part of the Division of Aging Biology and the Division of Geriatrics and Clinical Gerontology, NIA. Assistance with phenotype harmonization and genotype cleaning, as well as with general study coordination, was provided by Heath ABC Study Investigators. The datasets used for the analyses described in this manuscript were obtained from dbGaP at https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000169.v1.p1 through dbGaP accession number phs000169.v1.p1.
This study was funded by The Centre for Applied Genomics, The Hospital for Sick Children, Genome Canada, Autism Speaks, the Canadian Institutes of Health Research (CIHR), the Canadian Institutes for Advanced Research (CIFAR), Ontario Brain Institute, the Government of Ontario, Canada Foundation for Innovation (CFI), and the University of Toronto McLaughlin Centre. S.W.S. holds the GlaxoSmithKline-CIHR Endowed Chair in Genome Sciences at The Hospital for Sick Children and the University of Toronto. The funding bodies played no role in the design of the study, the collection, analysis, and interpretation of data, or the writing of the manuscript.
Availability of data and materials
The dataset generated during the current study will be deposited in an open-access repository.
Ethics approval and consent to participate
This study was approved by our local institutional Research Ethics Board, and consent to participate was provided by all participants.
Consent for publication
The participants provided consent for publication of this manuscript.
S.W.S. is on the Scientific Advisory Committees of Deep Genomics and Population Bio, and Lineagen licenses intellectual property based on his discoveries from The Hospital for Sick Children. A.P., M.A.S., and H.T. are employees of Lineagen. The remaining authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.