Biological insights from multi-omic analysis of 31 genomic risk loci for adult hearing difficulty
Authors:
Gurmannat Kalra aff001; Beatrice Milon aff003; Alex M. Casella aff001; Brian R. Herb aff001; Elizabeth Humphries aff001; Yang Song aff001; Kevin P. Rose aff001; Ronna Hertzano aff001; Seth A. Ament aff001; Kevin P. Rose aff002
Authors place of work:
Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff001; Program in Molecular Medicine, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff002; Department of Otorhinolaryngology-Head & Neck Surgery, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff003; Physician Scientist Training Program, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff004; Program in Molecular Epidemiology, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff005; Department of Anatomy and Neurobiology, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff006; Department of Psychiatry, University of Maryland School of Medicine, Baltimore, MD, United States of America
aff007
Published in the journal:
Biological insights from multi-omic analysis of 31 genomic risk loci for adult hearing difficulty. PLoS Genet 16(9): e32767. doi:10.1371/journal.pgen.1009025
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1009025
Summary
Age-related hearing impairment (ARHI), one of the most common medical conditions, is strongly heritable, yet its genetic causes remain largely unknown. We conducted a meta-analysis of GWAS summary statistics from multiple hearing-related traits in the UK Biobank (n = up to 330,759) and identified 31 genome-wide significant risk loci for self-reported hearing difficulty (p < 5x10-8), of which eight have not been reported previously in the peer-reviewed literature. We investigated the regulatory and cell specific expression for these loci by generating mRNA-seq, ATAC-seq, and single-cell RNA-seq from cells in the mouse cochlea. Risk-associated genes were most strongly enriched for expression in cochlear epithelial cells, as well as for genes related to sensory perception and known Mendelian deafness genes, supporting their relevance to auditory function. Regions of the human genome homologous to open chromatin in epithelial cells from the mouse were strongly enriched for heritable risk for hearing difficulty, even after adjusting for baseline effects of evolutionary conservation and cell-type non-specific regulatory regions. Epigenomic and statistical fine-mapping most strongly supported 50 putative risk genes. Of these, 39 were expressed robustly in mouse cochlea and 16 were enriched specifically in sensory hair cells. These results reveal new risk loci and risk genes for hearing difficulty and suggest an important role for altered gene regulation in the cochlear sensory epithelium.
Keywords:
Chromatin – Genetic loci – Mammalian genomics – Genome-wide association studies – Medical risk factors – Cochlea – Single nucleotide polymorphisms
Introduction
Age-related hearing impairment (ARHI) is characterized by a decline of auditory function due to impairments in the cochlear transduction of sound signals and affects approximately 25% of those aged 65–74 and 50% aged 75 and older[1]. Causes of ARHI and related forms of adult-onset hearing difficulty include a complex interplay between cochlear aging, noise exposure, genetic predisposition, and other health co-morbidities. Anatomical and physiological evidence suggest that these forms of hearing difficulty arise most commonly from damage to cochlear sensory epithelial cells, particularly inner and outer hair cells. Some forms of hearing difficulty also arise from damage to non-epithelial cells in the cochlea, including spiral ganglion neurons and cells of the stria vascularis.
Twin and family studies suggest that 25–75% of risk for ARHI is due to heritable causes[2]. Mutations in >100 genes cause monogenic deafness or hearing loss disorders[3]. However, a substantial fraction of patients with ARHI do not have a mutation in any known deafness gene, suggesting that additional genetic causes remain to be discovered. Common genetic variation may contribute to these unexplained cases of hearing difficulty, but specific risk variants remain poorly characterized. Genome-wide association studies (GWAS) of hearing-related traits, including ARHI, tinnitus, and increased hearing thresholds, have identified in aggregate approximately 50 genome-wide significant risk loci[4–8], with the largest study of adult hearing difficulty reporting 44 risk loci[9]. Positional candidate genes at these risk loci include TRIOBP, a gene associated with prelingual nonsyndromic hearing loss[4]; ISG20, encoding a protein involved in interferon signaling[4]; PCDH20, a member of the cadherin family[5]; and SLC28A3, a nucleoside transporter[5]. In addition, several studies have reported suggestive associations near GRM7, encoding a metabotropic glutamate receptor[6,8].
The identification of risk loci for ARHI is merely the first step toward understanding the biological mechanisms by which variants at these loci influence hearing loss. The majority of GWAS risk loci that have been identified for ARHI contain no protein-coding SNPs, making it difficult to infer the causal genes. These findings are consistent with GWAS of other common traits, which have further demonstrated that very frequently the gene closest to the risk-associated SNPs is not causal[10–13]. A logical hypothesis is that the causal risk variants at many of these loci influence gene regulation. However, this hypothesis has not been tested in the context of ARHI, largely because the effects of gene regulatory variants are often cell type-specific, and there is very little information about the locations of enhancers and promoters in relevant cell types in the human cochlea.
Here, we generated mRNA-seq, ATAC-seq, and single-cell RNA-seq from cochlear epithelial and non-epithelial cells of neonatal mice, and we used these data to predict causal variants and genes and disease-relevant cell types at risk loci for ARHI. This analysis also utilized a new multi-trait analysis of publicly available GWAS summary statistics from hearing-related traits in the UK Biobank (n up to 330,759), which supported 31 risk loci for hearing difficulty, of which eight have not been described in peer-reviewed publications. Our results indicate that heritable risk for hearing difficulty is enriched in genes and putative enhancers expressed in sensory epithelial cells, as well as for common variants near Mendelian hearing loss genes. Statistical and epigenomic fine-mapping most strongly supported 50 putative risk genes at these loci, predicting both protein-coding and gene regulatory mechanisms for ARHI.
Results
Heritability of hearing-related traits in the UK Biobank
The UK Biobank is a population-based, prospective study with over 500,000 participants in Britain, aged 40–69 years when recruited in 2006–2010[14]. The study has collected genome-wide genotyping data as well as phenotypic data for thousands of traits, including multiple hearing-related traits. Recently, Wells et al.[9] reported GWAS of hearing difficulty and hearing aid use in this population. However, the UK Biobank resource also includes additional hearing-related traits, including tinnitus, and the molecular mechanisms remain poorly characterized. Here, we applied multi-trait meta-analysis and multi-omic fine-mapping to gain insight into the genetic architecture of biological mechanisms of hearing difficulty.
As a starting point for our analysis, we considered which of the hearing-related traits in the UK Biobank have significant heritability explained by the genotyped and imputed single-nucleotide polymorphisms (SNPs). We used publicly available summary statistics from GWAS in up to 337,000 UK Biobank participants performed by the Neale lab at Massachusetts General Hospital (http://www.nealelab.is/uk-biobank/). We examined heritability of 31 manually selected hearing-related traits, including 14 self-reported traits and 17 traits derived from International Statistical Classification of Diseases and Related Health Problems (ICD-10) codes (Fig 1A; S1 Table). Four traits, all self-reported answers to survey questions, had statistically significant heritability (h2) explained by genotyped and imputed single nucleotide polymorphisms (SNPs), based on linkage disequilibrium (LD) score regression (LDSC[15]), after correction for multiple testing (raw p-values < 2.1x10-5; alpha = 0.05). The most significantly heritable trait was a subject’s response to the question, “Do you find it difficult to follow a conversation if there is background noise (such as TV, radio, children playing)?” (N = 330,759, prevalence = 38%, SNP heritability [h2] = 0.086, h2 p-value = 7.4e-65; henceforth, background noise problems). The second most heritable trait was the response to the question, “Do you have any difficulty with your hearing?” (N = 323,978, prevalence = 26%, h2 = 0.076, h2 p-value = 1.2e-32; henceforth, hearing difficulty/problems). Third, the response to the question, “Do you use a hearing aid most of the time?” (prevalence = 5.1%, h2 = 0.093, h2 p-value = 3.4e-8; henceforth, Hearing aid user). Fourth, the response to the question, “Do you get or have you had noises (such as ringing or buzzing) in your head or in one or both ears that lasts for more than five minutes at a time?” (henceforth, tinnitus, where the most heritable response was “yes, now most or all of the time”; N = 109,411, prevalence = 6.6%, h2 = 0.137, h2 p-value = 1.3e-7). In their published analysis of the UK Biobank hearing traits, in a separate analysis using many of these same data, Wells et al.[9] previously reported GWAS for hearing aid use and for a trait that combines responses to the hearing difficulty and background noise questions and defines cases as those who responded “yes” to both questions. The GWAS of the UK Biobank tinnitus data has not been reported previously.
We downloaded the GWAS summary statistics for the four most heritable hearing-related traits and used LDSC to study the mean χ2 statistics, estimating the proportion of inflation due to polygenic heritability versus confounding. As expected, quantile-quantile plots indicate substantial deviation of χ2 statistics from a null distribution (S1 Fig). Background noise problems (Intercept = 1.031, Int. p = 1.3x10-3) and hearing difficulty (Intercept = 1.018; Int. p = 3.7x10-2) had significant LDSC intercept terms, suggesting some confounding, whereas the intercept terms for hearing aid use and tinnitus were not significant. Reassuringly, for all four traits, LDSC intercepts ascribe >90% of the inflation in the mean χ2 to polygenic heritability rather than to confounding. These results suggest that hearing-related traits in the UK Biobank are heritable and highly polygenic. Moreover, the significant heritability and low confounding provide empirical validation for the Neale lab’s analytical approach, despite certain limitations in curation and quality control that are inherent to large-scale analyses. Based on these results, we chose to use the GWAS summary statistics provided by the Neale lab for these four traits as a starting point for downstream analyses. A detailed description of these traits and analyses is provided in Methods as well as in accompanying web resources from the Neale lab and UK Biobank.
We note that the proportion of risk explained by genotyped and imputed SNPs was <10% for all of these traits, considerably less than the broad-sense heritability of ARHI based on twin and family studies. As with other complex traits, there are many potential sources for this missing heritability, including contributions from rare variants and gene x environment and gene x gene interactions. In this case, we must also consider the limitations of measuring a complex and clinically heterogeneous trait based on a small number of self-reported questions, as compared to a clinical diagnosis of ARHI based on precise measurements of auditory function.
Genetic correlations among hearing-related and non-hearing-related traits
Next, we assessed whether risk for hearing-related traits in the UK Biobank arises from shared or distinct genetic factors. Using LDSC, we found that all pairs of hearing-related traits were genetically correlated (all rg ≥ 0.37; all p-values < 7.1x10-8; S2 Table). Genetic correlations were strongest between the two most significantly heritable traits, hearing difficulty and background noise problems (rg = 0.81). These results suggest substantial shared genetic architecture among these hearing-related traits.
In addition, we assessed genetic correlations between the UK Biobank hearing-related traits and GWAS of 234 non-hearing traits, available via LD Hub[16]. As expected, genetic correlations among hearing traits were stronger than genetic correlations between hearing traits and non-hearing traits. In addition, we detected significant genetic correlations (False Discovery Rate < 5%) between hearing-related traits and 14 of the 234 non-hearing traits (Fig 1B, S2 Table). Eleven of the 14 genetically correlated traits are psychiatric and personality traits, including positive genetic correlations of hearing difficulty with major depressive disorder and insomnia and a negative genetic correlation of hearing difficulty with subjective well-being. These positive genetic associations of hearing difficulty with depression-related traits is consistent with the recent report by Wells et al.[9] In addition, we detected positive genetic correlations between hearing difficulty and two anthropomorphic traits: obesity and waist circumference. Finally, we detected a negative genetic correlation between hearing difficulty and the age at first childbirth, a proxy for educational attainment and cognition. Interestingly, the genetic correlations with psychiatric and personality traits, were stronger for hearing difficulty than for hearing aid use, which could reflect differences in the underlying genetic architecture of these hearing-related traits or merely the stronger heritability of the hearing difficulty phenotype. Genetic correlations typically arise from diverse direct and indirect relationships, yet, remarkably, many of these correlations reflect known comorbidities and risk factors for hearing loss[17,18].
Genomic risk loci and replication in independent cohorts
Leveraging the shared heritability among the four selected hearing-related traits, we performed a multi-trait analysis with MTAG[19] (Multi-Trait Analysis of GWAS). MTAG uses GWAS summary statistics from multiple traits and can boost statistical power when the traits are genetically correlated. MTAG uses bivariate LDSC to account for sample overlap between traits, making it suitable for an analysis of multiple traits measured in overlapping subjects, as in the UK Biobank. The original GWAS summary statistics for hearing difficulty included genome-wide significant associations (p < 5x10-8) of hearing difficulty with 779 SNPs at 22 approximately LD-independent genomic loci (Fig 1C). Following joint analysis with MTAG, we identified genome-wide significant associations of hearing difficulty with 988 genotyped and imputed SNPs, located at 31 approximately LD-independent genomic loci (S2 Fig). In addition, MTAG analysis revealed 20 genome-wide significant loci for background noise problems, 25 for hearing aid use, and 20 for tinnitus (S4–S6 Tables). Most of these loci overlap the 31 loci for hearing difficulty. The p-values of the lead SNPs at most loci are strengthened in the joint analysis as compared to the individual analysis (S7 Table). In our subsequent analyses, we utilized MTAG summary statistics for hearing difficulty.
Next, we sought to replicate these findings in independent cohorts. The small sizes of available hearing-related cohorts outside UK Biobank[4–8] make them underpowered for a standard replication analysis of individual risk loci. This issue is not unique to hearing-related traits, and it has become common to report findings from biobank-scale GWAS without standard replication[20,21]. Nonetheless, these earlier cohorts provide valuable information, especially those that utilized more precise measures of hearing thresholds, which are likely to more accurately reflect hearing function than the self-reported traits in the UK Biobank. We took two approaches to replication.
First, we compared the 31 risk loci described above to the previous analysis of the UK Biobank data by Wells et al.[9]. 23 of the 31 risk loci in our analysis overlapped loci from the Wells et al. analysis. The remaining eight loci are not in LD with risk loci from Wells et al., nor with previously reported risk locus from independent cohorts. These novel risk loci are described in S3 Table, and the lead SNPs at these loci are rs2941580 (chr 2:54862003, p = 1.53 x 10−8), rs3915060 (chr 3:121712980, p = 5.84 x 10−9), rs117583072 (chr 10: 73418873, p = 1.54 x 10−8), rs61890355 (chr 11: 51422105, p = 2.39 x 10–8), rs118176061 (chr 11:54830428, p = 3.96 x 10–8), rs78417468 (chr 16:55492795, p = 1.33 x 10–8), rs118174674 (chr 18:44137400, p = 2.76 x 10−8), rs11881070 (chr 19:2389140, p = 3.85 x 10−9). Thus, meta-analysis across hearing-related traits revealed several loci that were not detected by GWAS of a more narrowly defined hearing trait.
Second, we analyzed 59 SNPs reported at genome-wide or suggestive significance levels in earlier GWAS of hearing-related traits[4–8,22] to determine whether these associations are replicated in the UK Biobank sample. Eight of these 59 SNPs showed nominally significant associations with hearing difficulty in the UK Biobank (p < 0.05; S8 Table), including both loci that reached genome-wide significance in the largest previous GWAS of ARHI[4]: rs4932196, 54 kb 3' of ISG20 (p = 2.6x10-5 in the UK Biobank); and rs5750477, in an intron of TRIOBP (p = 1.3x10-6). We note that other SNPs at the TRIOBP locus reached genome-wide significance in the UK Biobank. Also replicated in our analysis (notably, at genome-wide significance in the UK Biobank) were two SNPs previously reported at a suggestive significance level, in or near genes that cause Mendelian forms of hearing loss: rs9493627, a missense SNP in EYA4[4] (p = 7.7x10-10); rs2877561, a synonymous variant in ILDR1[4] (p = 1.1x10-8). In addition, we found a nominal level of support for rs11928865, in an intron of GRM7, previously reported at a suggestive significance level in multiple cohorts with ARHI[6,8] (p = 2.2x10-2). Therefore, hearing difficulty risk variants in the UK Biobank overlap and expand upon previously discovered risk variants.
Third, we tested whether the combined, polygenic effects of hearing difficulty-associated SNPs from the UK Biobank predicted hearing ability in an independent cohort. We obtained genotypes and phenotypes from a cohort of 1,472 Belgian individuals[22] whose hearing ability was assessed with binaural thresholds for detection of low-, medium-, and high-frequency sounds. These thresholds were then summarized by principal component analysis, with PC1 giving an overall measure of hearing capabilities across all frequencies (S9 Table). A polygenic risk score[23] derived from the UK Biobank MTAG hearing difficulty summary statistics explained 1.3% of variance in PC1 from the Belgian sample (p = 7.4x10-5; S3 Fig). The best prediction was obtained using all SNPs (GWAS p-value threshold = 1), and PRS scores using a wide range of GWAS p-value thresholds predicted a significant proportion of the variance. These results demonstrate shared genetic architecture for self-reported hearing difficulty and a more quantitative measure of hearing ability in an independent cohort.
Heritability for hearing difficulty is enriched near Mendelian deafness genes and genes expressed in cochlear cell types
Next, we sought biological insights into hearing difficulty through gene set enrichment analyses. We performed gene-based analyses of the MTAG summary statistics using MAGMA[24] and identified 104 genes reaching a genome-wide significance threshold, p < 2.5x10-6, correcting for 20,000 tests (S10 Table). Of these 104 genes, 40 overlap with the 31 risk loci, while the remaining genes are located at additional loci where no individual SNP reached genome-wide significance. We performed a series of hypothesis-based and exploratory gene set enrichment analyses.
It has been proposed that age-related hearing loss involves low penetrance variants in genes that are also associated with monogenic deafness disorders[4,25]. To test this hypothesis, we studied common-variant associations near 110 Mendelian deafness genes from the Online Mendelian Inheritance in Man (OMIM) database. These Mendelian deafness genes were enriched at hearing difficulty risk loci (p = 1.19x10-6; S11 Table). We detected gene-based associations with hearing difficulty at a nominal level of significance (p-values < 0.01) for 15 of these 110 genes, with the strongest associations at TRIOBP (MAGMA: p = 1.2x10-10), ILDR1 (p = 2.5x10-8), and MYO7A (p = 8.5x10-5). These findings support the hypothesis that Mendelian hearing loss genes contribute to age-related hearing difficulty, but also suggest that many risk loci for hearing difficulty involve genes that have not previously been implicated in hearing loss.
A more general hypothesis is that hearing difficulty risk is enriched in genes expressed in the cochlea. We generated mRNA-seq from FACS-sorted cochlear epithelial cells, cochlear mesenchymal cells, cochlear neurons, and cochlear vascular endothelial cells from mice at postnatal day 2. We calculated the median expression of each gene in each of these cell types, as well as in subtypes of sensory epithelial hair cells and supporting cells derived from published RNA-seq[26–28]. For comparison, we considered the expression of each gene in 5,674 cell types from single-cell RNA-seq experiments of diverse mammalian tissues (S12 Table), as well as 53 extracochlear human tissues and cell types from the Genotype-Tissue Expression consortium (GTEx)[29]. Using MAGMA gene property analysis, we tested for associations of tissue-specific expression levels with genetic risk for hearing difficulty. Risk for hearing difficulty was enriched in genes expressed in cochlear epithelial cells (mostly supporting cells and hair cells; p = 5.8e-6), as well as in a pure population of cochlear hair cells (p = 1.4e-5; Fig 2; S13 and S14 Tables). Notably, enrichments of hearing difficulty risk near genes expressed in these cochlear cell types were far stronger than enrichments for brain-expressed genes, which had been reported in a previous analysis that did not include cochlear cell types9, and for any other non-cochlear cell type (S14 Table). Therefore, our results suggest that many of the risk loci are explained by genes that are expressed in the cochlear sensory epithelium.
To identify additional functional categories enriched for hearing difficulty risk, we performed an exploratory analysis of 5,917 gene sets from Gene Ontology (GO). This analysis revealed a single significant GO term after correction for multiple testing: sensory perception of mechanical stimulus (150 genes in this set; p = 8.62x10-9) (S15 Table). This finding is consistent with results reported by Wells et. al9. Taken together, these results support the relevance of hearing difficulty risk loci to the auditory system, including many genes that have not previously been associated with hearing loss.
Heritable risk for hearing difficulty is enriched in open chromatin regions from cochlear epithelial cells
Many studies have demonstrated that GWAS associations are enriched in gene regulatory regions such as enhancers and promoters, especially in regulatory elements that are active in disease-relevant tissues and cell types[30,31]. Consequently, we hypothesized that SNPs influencing risk for hearing difficulty are enriched in gene regulatory regions active in the cochlea. To identify gene regulatory regions in the cochlea, we FACS-sorted epithelial cells (CD326+; including hair cells and supporting cells) and non-epithelial cells (CD326-; predominantly mesenchymal cells) from mouse cochlea at postnatal day 2 (Fig 3A), and performed ATAC-seq, on biological duplicates, to identify open chromatin regions in each cell type. Four considerations justify the use of mouse cochlea (rather than human) for this experiment: (i) mice have previously been used to successfully identify new deafness related genes and regulators of inner ear development[32–34]; (ii) at least 25% of non-coding gene regulatory elements are evolutionarily conserved between mouse and human[35]; (iii) unlike other tissues, human cochleae are not readily available for biopsies and are only rarely removed surgically; and (iv) the human cochlea (even in aborted fetuses) is already sufficiently mature to make tissue dissociation to single cells very challenging[36]. We identified 228,781 open chromatin regions in epithelial cells and 433,516 in non-epithelial cells, of which 113,733 regions were unique to epithelial cells (2.83% of the mouse genome), 320,871 unique to non-epithelial cells (4.47% of the mouse genome), and 120,919 overlapping (Fig 3B; S16 and S17 Tables). We validated these open chromatin regions through comparison to 15 experimentally validated enhancers from the VISTA Enhancer Database with activity in the ear[37] and found that ATAC-sensitive regions from both epithelial and non-epithelial cells overlapped significantly with known enhancers (epithelial cells: 3.1-fold enriched, p < 1.0x10-4; non-epithelial cells: 2.9-fold enriched, p < 1.0x10-4 based on 10,000 permutations). Examination of known cell type-specific genes suggested that chromatin accessibility in epithelial versus non-epithelial cells was correlated with cell type-specific gene expression (Fig 3C–3E). For instance, we detected open chromatin specific to epithelial cells near Epcam and Sox2, which are expressed specifically in cochlear epithelial cells[34,38]; and open chromatin specific to non-epithelial cells around Pou3f4, a marker for non-epithelial cells[39].
Next, we asked whether these putative regulatory regions in the cochlea are enriched for SNPs associated with hearing difficulty. Using the UCSC LiftOver tool[40], we mapped ATAC-sensitive regions from each cochlear tissue type to the human genome to identify homologous genomic regions. 55.5% of the mouse epithelial regions and 50.2% of non-epithelial regions mapped to the human genome. To validate that these lifted-over regions are still capturing regulatory regions, we tested their overlap with ChromHMM-derived enhancers and promoters in 111 extracochlear tissues and cell types from the ROADMAP Epigenome Mapping Consortium[41]. Lifted-over regions derived from both epithelial and non-epithelial cells were substantially enriched in human promoter and enhancer regions (>10-fold and ~5-fold, respectively) but depleted in quiescent regions, heterochromatin, and gene bodies (S4 Fig, S18 and S19 Tables).
We tested for enrichment of hearing difficulty risk in these conserved regions homologous to cochlear gene regulatory regions using stratified LD score regression[42]. This model tests for heritability in cochlea-specific regions after accounting for a baseline model consisting of 24 non-cell type-specific genomic annotations, including evolutionarily conserved regions and regions that are open chromatin across many tissues. Heritable risk for hearing difficulty was enriched 9-fold in epithelial open chromatin regions (Fig 3F). 2.1% of all SNPs are in the annotated regions, and these SNPs capture 19.5% of the total SNP heritability (p = 5.2x10-8). Heritability was less strongly–though still significantly—enriched in open chromatin regions from non-epithelial cells (4.6-fold enriched; 3.0% of all SNPs are in the annotated regions, and the SNPs capture 14.2% of the total heritability; p = 0.001). For comparison, we performed similar analyses using open chromatin regions from 147 DNase-seq experiments in 42 mouse tissues and cell types, generated by the ENCODE project[35] (S20 Table). The significance of the heritability enrichment in cochlear epithelial cells was greater than for any of the non-cochlear tissues. These results suggest that heritable risk for hearing difficulty is enriched specifically in evolutionarily conserved gene regulatory regions active in cochlear epithelial cells.
To confirm the robustness of these results, we performed comparable analyses using the original UK Biobank GWAS summary statistics for hearing difficulty as generated by the Neale lab (prior to MTAG meta-analysis); the hearing difficulty summary statistics from Wells et al.[9]; and the MTAG summary statistics for hearing aid use. In all of these analyses, we confirmed strong enrichments of heritable risk in the human genomic regions homologous to epithelial open chromatin regions, as well as more modest enrichments in the human genomic regions homologous to non-epithelial open chromatin regions (S20 Table).
Statistical and epigenomic fine-mapping supports functional consequences to 50 genes at hearing difficulty risk loci
Next, we sought to predict causal variants and target genes at each of the 31 hearing difficulty risk loci, considering both protein-coding and putative gene regulatory consequences of each variant. To begin this analysis, we identified 613 SNPs and short indels that are in strong LD–r2 > 0.9 in European samples from the 1000 Genomes project Phase 3 dataset[43]–with a genome-wide significant lead SNP at one of the 31 risk loci (S21 Table). 534 of these SNPs were included in the GWAS analysis, all of which had p-values < 2.6e-6. The remaining 79 SNPs are correlated tag SNPs for which associations were not tested directly.
We scanned these 613 risk variants for non-synonymous and stopgain SNPs, frameshift and non-frameshift indels, and effects on splice donor and acceptor sites, focusing on those variants that are predicted to be deleterious with a CADD Phred score > 10. We identified nine such protein-coding variants, including missense SNPs in CLRN2, CRIP3, EYA4, CHMP4C, TYR, TRIOBP (2x), BAIAP2L2, and KLHDC7B (Table 1). Notably, six of these nine SNPs are LD-independent lead SNPs at their respective loci, increasing the statistical likelihood that these variants are causal for hearing difficulty risk. The missense SNPs in TRIOBP and BAIAP2L2 are annotated to the same risk locus at 22q13.1 (S6 Fig). The two TRIOBP variants are in strong LD (r2 = 0.97), so their effects may be additive or synergistic. By contrast, the BAIAP2L2 variant is not in LD with either of the TRIOBP variants, suggesting an independent effect.
Causal variants on risk haplotypes that do not contain protein-coding variants may alter gene regulation. To elucidate these gene regulatory consequences, we annotated the 613 risk-associated SNPs in the context of the local two-dimensional and three-dimensional chromatin architecture. For the former, we utilized our ATAC-seq data from cochlear cells. For the latter, we used publicly available Hi-C data from 20 non-cochlear tissues and cell types[44]. Since data for chromatin architecture in the cochlea is unavailable, we considered all cell types in aggregate and set a stringent chromatin interaction significance threshold (p-value < 1x10-25) to focus on the strongest chromatin loops. 126 of the 613 SNPs had potential gene regulatory functions in cochlea, based on homology to open chromatin in cochlear epithelial and non-epithelial cells from neonatal mice (S18 Table). 57 of these 126 SNPs were located proximal (<10 kb) to the transcription start sites of 17 potential target genes. In addition, 100 of the 126 SNPs could be assigned to 72 distal target genes based on long-distance chromatin loops that connect the regions containing risk-associated SNPs to these genes’ transcription start sites located up to 3 Mb away (S22 Table).
We integrated the coding and non-coding functional annotations to prioritize the most likely causal genes at each locus. The union of functional annotations supported 84 genes (S23 Table). We prioritized 50 of these genes, as follows: (i) if one or more genes at a locus contained risk-associated protein-coding variants, we selected those genes; (ii) if no coding variants were identified at a locus, we selected the proximal target gene(s) of non-coding SNPs with predicted regulatory functions; (iii) if no proximal genes were identified, we considered distal target genes. This analysis identified putative risk genes at 19 of the 31 risk loci, including 10 loci at which a single gene appears most likely to be causal (S24 Table).
We sought independent support for roles of these genes in hearing loss or cochlear function based on prior evidence from genetic studies in humans and mice. Rare mutations in five of the 50 genes have been shown previously to cause Mendelian forms of deafness or hearing loss: TRIOBP, EYA4, FTO, SOX2, and LMX1A[4,45–53]. Genetic studies in mice have demonstrated hearing loss or cochlear development phenotypes for an additional seven of the 50 genes: SYNJ2, TYR, PTGDR, MMP2, RPGRIP1L, RBL2, and BAIAPL2[54–60]. Notably, three of the five genes with independent support from human rare variants—FTO, SOX2 (Fig 4A), and LMX1A (Fig 4B)—are located distal to risk-associated SNPs and were predicted as target genes based on long-distance chromatin interactions, validating this approach for predicting causal mechanisms. We note that there may be additional causal genes for which the functional variants are missed by our analysis. For instance, at the 3q13.3 risk locus our approach excludes a strong positional candidate, ILDR1, in which loss-of-function variants cause a recessive hearing loss disorder[47], since none of the risk variants at this locus were predicted to alter ILDR1 function.
Hearing difficulty risk genes are expressed in diverse cochlear cell types
To better understand the potential functions of the 50 putative risk genes in the cochlea, we investigated their expression patterns in cochlear cell types. We sequenced the transcriptomes of 3,411 single-cells from the mouse cochlea (postnatal day 2) using 10x Genomics Chromium technology. Cells were sequenced to a mean depth of 107,590 reads, which mapped to a median of 1,986 genes per cell. After quality control (Methods), we analyzed data from 3,314 cells. Louvain modularity clustering implemented with Seurat[61] revealed 12 major clusters of cells (Fig 5A, S5 Fig). Based on the expression of known marker genes (S25 Table), we assigned these cell clusters to the following cell types: three clusters of epithelial cells (Epcam+; n = 419, 101, and 24 cells per cluster), three clusters of mesenchymal cells (Pou3f4+; n = 887, 701, and 76 cells per cluster, of which the smallest cluster are 2810417H13Rik+ cells undergoing cell division), 324 glial cells (Mbp+), 391 medial interdental cells (Otoa+), 59 Oc90+ cells, 79 vascular cells (Cd34+), 161 sensory epithelium supporting cells (Sox2+), and 91 sensory hair cells (Pou4f3+).
We tested cell type specific expression for each hearing difficulty risk gene. 39 of the 50 risk genes were expressed highly enough in these cochlear cells to be included in this analysis. By far the largest number of genes, 14 out of 39, were expressed selectively in sensory hair cells (Fig 5F; S26 Table). These hair cell-specific risk genes included known hearing loss genes such as Triobp (p = 9.5x10-42) and Eya4 (p = 2.3x10-21), as well as genes that have not previously been implicated in human hearing loss; e.g., Baiap2l2 (p = 2.6x10-192; Fig 5E), Arhgef28 (p = 9.8x10-31; Fig 5C), Gnao1 (p = 4.1x10-29), Rpgrip1l (p = 6.3x10-24), and Crip3 (p = 9.1x10-21). We also found risk genes that were expressed selectively in other cell types, including sensory epithelium supporting cells (Sox2, p = 9.5x10-151; Fig 5D), other cochlear epithelia Oc90 cells (e.g., Lmx1a, p = 7.4x10-131; Fig 5B), medial interdental cells (Lpcat2, p = 1.8x10-139), and mesenchymal cells (Mmp2, p = 6.0x10-120).
We sought to corroborate the expression pattern of risk genes in hair cells using published expression profiles from hair cells isolated by three other methods: (i) transcriptome profiling of FACS-purified hair cells versus surrounding cells[62]; (ii) translatome profiling of RiboTag-purified hair cells versus surrounding cells[63]; and (iii) single-cell RNA-seq of sensory epithelial cells[64]. Meta-analysis of these three datasets confirmed selective expression in hair cells (FDR < 0.1) for 12 of the 14 genes above (S27 Table). This analysis also revealed low but highly specific expression in hair cells for two other risk genes, Clrn2 and Klhdc7b. Thus, in total, we find that 16 of the 50 putative hearing difficulty risk genes identified by GWAS are expressed selectively in sensory hair cells.
Discussion
Here, we have described a well-powered GWAS of hearing difficulty, leveraging data from >300,000 participants in the UK Biobank and high-throughput association analyses from the Neale lab. We interpreted these genetic associations in the context of multi-omic data from the mouse cochlea. We identified 31 risk loci for hearing difficulty, of which eight have not been reported previously in the peer-reviewed literature. Heritable risk for hearing difficulty was enriched in genes and gene regulatory regions expressed in cochlear epithelial cells, as well as for common variants near Mendelian hearing loss genes. We identified 50 putative risk genes at these loci, many of which were expressed selectively in sensory hair cells and other disease-relevant cell types.
Genetic correlations, meta-analyses, and heritability enrichment analyses all suggest that many loci have shared effects on self-reported hearing difficulty, background noise problems, tinnitus, and hearing aid use. This result suggests that each of the survey questions captures a similar underlying trait related to ARHI, and our MTAG meta-analysis is designed to emphasize these shared effects. However, the traits are not identical: the genetic correlations among the four traits are considerably below 1, and some distinctions were apparent in polygenic analyses. In particular, since many individuals with hearing loss do not use hearing aids, genetic factors contributing to the use of hearing aids may influence both hearing difficulty and behavioral/motivational states. In our analysis, we found a positive genetic correlation between increased risk for hearing difficulty and increased risk for major depression disorder (rg = 0.20, p = 6.0e-4), there was no significant genetic correlation between risk for hearing aid use and risk for depression (in fact, the trend is toward a negative genetic correlation; rg = -0.10, p > 0.05). ARHI is a well-known risk factor for depression in older adults. It is possible that the subset of hearing-impaired individuals who use hearing aids are genetically predisposed toward resilience or that hearing aid use ameliorates these adverse impacts. This interpretation is speculative, but the potential to detect effects of this kind motivates the continued development of large genetic cohorts for ARHI, including deeper phenotyping and questions into behavioral choices, comorbid conditions, and quality of life.
A prominent hypothesis has been that age-related hearing impairment involves lower-penetrance genetic variation in genes that cause Mendelian forms of hearing loss[4,25]. In support of this view, we found that heritability for hearing difficulty was enriched near Mendelian hearing loss genes, including genome-wide significant risk loci that overlapped three Mendelian hearing loss genes, TRIOBP, EYA4, and ILDR1. However, these signals represent a small fraction of the heritable risk. Indeed, our results better support a highly polygenic genetic architecture for hearing difficulty, spanning many genes that had not been known to influence hearing. Evidence for polygenicity includes the inflation of χ2 statistics (i.e., low p-values) across many thousands of SNPs and the enrichment of heritability across thousands of genes and putative gene regulatory regions expressed in the cochlear sensory epithelium. It is likely that the 31 risk loci identified here represent merely the tip of a larger genetic iceberg. Thus, as with other common traits[65], it is likely that additional risk loci and risk genes will be discovered as sample sizes for GWAS continue to grow larger.
Our results suggest that genetic risk factors for hearing difficulty act most frequently–but not exclusively—through mechanisms within sensory hair cells. The primacy of hair cells is supported by heritability enrichments for genes expressed in hair cells and for regions of open chromatin in the cochlear sensory epithelium, as well as by the fact that 16 of the 50 fine-mapped risk genes were expressed selectively in hair cells. These results strongly support the relevance of our findings to auditory function, since damage to hair cells is the most common pathophysiology in AHRI. Of the 16 hair cell-specific risk genes identified in our analysis, loss-of-function mutations in two are known to cause Mendelian hearing loss: TRIOBP[4,52,53] and EYA4[49,50]. An additional five genes have not previously been associated with human hearing loss but are known to cause hearing loss or cochlear development when mutated in mice: SYNJ2[54], RPGRIP1L[58], BAIAP2L2[60], TUB[66], and RBL2[59]. To our knowledge, this is the first report of a hearing loss phenotype for the remaining seven hair cell-specific risk genes: ANKRA2, ARHGEF28, CRIP3, CCDC68, EXOC6, GNAO1, IQCB1, and KLHDC7B. Hearing difficulty risk haplotypes contained protein-coding variants in 6 of these genes, while the others were supported by non-coding variants with predicted gene regulatory functions. These genes have diverse biological functions, ranging from transcriptional regulation to intracellular signaling to metabolic enzymes to structural components of synapses and stereocilia. Taken together, these results suggest that functional variants impacting a wide range of hair cell-specific genes contribute to risk for hearing loss.
Other risk genes suggest plausible mechanisms for hearing loss involving components of the stria vascularis. The lead SNP at a risk locus on chr11q14.3 is a deleterious missense SNP in TYR. TYR encodes tyrosinase, which catalyzes the production of melanin. In the cochlea, TYR is expressed specifically in melanin-producing intermediate cells of the stria vascularis, and TYR mutant mice have strial albinism, accompanied by age-associated marginal cell loss and endocochlear potential decline[55]. Chromosomal contacts at chr16q12.2 suggest that MMP2 is targeted by two distinct risk loci. MMP2 encodes matrix metalloproteinase-2, which serves an essential role in the cochlear response to acoustic trauma by regulating the functional integrity of the blood-labyrinth barrier[57]. Risk-associated variants in these genes may contribute to strial atrophy, a common non-sensory cause of age-related hearing impairment[67].
Perhaps more surprisingly, several of the risk genes identified in our analysis are best known for their functions in cochlear development. These include two well-characterized transcription factors: SOX2 and LMX1A. Loss-of-function mutations in each of these genes cause deformations of the cochlea and hearing loss in humans[46,48], while our analyses of adult hearing difficulty revealed non-coding genetic variation in putative distal enhancers. SOX2 is required for the formation of pro-sensory domains that give rise to hair cells and supporting cells[68]. LMX1A maintains proper neurogenic, sensory, and non-sensory domains in the mammalian inner ear, in part by restricting and sharpening SOX2 expression[69]. In addition, two genes supported by putative gene regulatory interactions at the chr10q23.3 risk locus, CYP26A1 and CYP26C1, metabolize retinoic acid and are involved in the specification of the otic anterior-posterior axis[70]. Changes in cochlear development may cause vulnerabilities to hearing loss later in life. Alternatively, there may be as yet undescribed roles for these genes in the adult cochlea contemporaneous with hearing loss.
Several limitations of this study should be noted. Our analysis is based on self-reported hearing difficulty, which is likely less accurate than a clinical diagnosis of ARHI or a quantitative hearing assessment. Many of the loci have not yet been replicated, which will require very large independent cohorts. The 50 genes prioritized by epigenomic fine-mapping remain data-driven hypotheses, which will need to be functionally validated in the future. Further resolving the gene regulatory interactions at these risk loci will also benefit from additional epigenomic data, including single-cell epigenomics to better resolve cell type-specificity and profiling of long-distance, ‘three-dimensional” chromatin interactions through techniques such as Hi-C.
In summary, we report the first systems genetics study of common genetic variation underlying risk for adult hearing difficulty. Our multi-trait and multi-omic analyses provide novel insights into genetic architecture and molecular mechanisms, prioritizing 50 putative risk genes. Functional studies of these risk genes are warranted, especially for several hair cell-specific genes that had not previously been implicated in hearing loss. In addition, our findings support a polygenic genetic architecture for hearing difficulty, suggesting that more risk genes will be discovered as genetic data become available from additional biobank-scale cohorts.
Methods
Ethics statement
All procedures involving animals were carried out in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals and were approved by the Animal Care Committee at the University of Maryland (protocol numbers 0915006 and 1015003).
Cell sorting by FACS followed by mRNA-seq and ATAC-seq
CD-1 timed-pregnant females were purchased from Charles River (Maryland). At postnatal day 2, the mice were euthanized and their temporal bone removed. Cochlear ducts from 20 mice were harvested, pooled and processed for Fluorescence-activated cell sorting (FACS) as described[34], for ATAC-seq. To generate cell population for mRNA-seq, single-cell suspensions were obtained from inner ears of postnatal day 0 newborn ICR mice and incubated with anti-CD326, anti-CD49f, and anti-CD34 antibodies to detect epithelial, neuronal, mesenchymal, and vascular endothelial cells. For ATAC-seq, a simplified protocol was utilized to distinguish epithelial from non-epithelial cells (primarily mesenchyme) based on labeling with anti-CD326. Cells were sorted by FACS using a BD FACS Aria II Cell Sorter (BD Biosciences) at the Flow Cytometry Facility, Center for Innovative Biomedical Resources (University of Maryland School of Medicine) (Fig 3A).
For mRNA-seq, libraries derived from total RNA from sorted cells were sequenced on an Illumina sequencer at the Institute for Genome Sciences (IGS) of the University of Maryland, School of Medicine. For ATAC-seq, fifty thousand cells and one hundred thousand cells from each sample were further processed as described [71] with the following modification: following the transposition reaction and purification step, a right side size selection (ratio 0.6) using SPRIselect (Beckman-Coulter, Indiana) was added before proceeding to the PCR amplification. This extra step resulted in the selection of DNA fragments between 150 bp to 700 bp. The following primers from [71] were used for library preparations:
Ad1_noMX 5’-AATGATACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTCAGATGTG-3’; Ad2.1_TAAGGCGA 5’-CAAGCAGAAGACGGCATACGAGATTCGCCTTAGTCTCGTGGGCTCGGAGATGT-3’; Ad2.2_CGTACTAG 5’-CAAGCAGAAGACGGCATACGAGATCTAGTACGGTCTCGTGGGCTCGGAGATGT-3’; Ad2.3_AGGCAGAA 5’-CAAGCAGAAGACGGCATACGAGATTTCTGCCTGTCTCGTGGGCTCGGAGATGT-3’; Ad2.4_TCCTGAGC 5’-CAAGCAGAAGACGGCATACGAGATGCTCAGGAGTCTCGTGGGCTCGGAGATGT-3’.After completion of the libraries, whole genome sequencing, paired-end and a depth of 66 million reads, was performed on an Illumina HiSeq 4000 at IGS.
Single-cell RNA sequencing of mouse cochlea
At postnatal day 2, 3 pups from a CD-1 timed-pregnant female were euthanized and their temporal bone removed. Cochlear ducts were harvested and pooled into Thermolysin (Sigma-Aldrich) for 20 min at 37°C. The Thermolysin was then replaced with Accutase (Sigma-Aldrich) and the tissue incubated for 3 min at 37°C followed by mechanical dissociation, repeating this step 3 times. After inactivation of the Accutase with 5% fetal bovine serum, the cell suspension was filter through a 35μm nylon mesh to remove cell clumps. The cell suspension was then processed for single-cell RNAseq.
Droplet-based molecular barcoding and single-cell sequencing were performed at the Institute for Genome Sciences (IGS) of the University of Maryland, School of Medicine. Approximately 10,000 dissociated cochlear cells were loaded into a Chromium Controller (10x Genomics) for droplet-based molecular barcoding of RNA from single cells. A sequencing library was produced using the 10x Single Cell Gene Expression Solution. Libraries from two cochlear samples were sequenced across three lanes of an Illumina HiSeq4000 sequencer to produce paired-end 75 bp reads.
Description of traits and genome-wide association studies
We started by manually identifying hearing-related traits (self-reported or ICD-10 codes) in the UK Biobank Data Showcase (https://www.ukbiobank.ac.uk/data-showcase/) for which GWAS summary statistics were available in the September 20, 2017, GWAS results data release from the Neale lab (http://www.nealelab.is/blog/2017/7/19/rapid-gwas-of-thousands-of-phenotypes-for-337000-samples-in-the-uk-biobank). We used publicly available LDSC heritability estimates for these traits produced by the Neale lab to select a smaller number for analysis. Details of the GWAS and heritability analyses are described on the Neale lab website (http://www.nealelab.is/blog/2017/9/11/details-and-considerations-of-the-uk-biobank-gwas). We identified 31 traits, shown in S1 Table. Four of these traits had significant heritability and were selected for further analysis. All of these were self-reported traits collected via ACE touchscreen questions, as follows:
Hearing difficulty/problems (2247; http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=2247). Participants were asked, “Do you have any difficulty with your hearing?” Possible answers were “yes”, “no”, “do not know”, “prefer not to answer”, and “I am completely deaf”. The Neale lab GWAS was performed using N = 323,978 unrelated individuals of European ancestry and compared individuals who answered yes (N = 84,839) to all other individuals (N = 239,139). We note that this analysis appears to misclassify N = 78 individuals who indicated they were completely deaf, who might better have been excluded from the analysis or treated as cases. Also, in this and other questions, it is debatable whether one should include those who answered “do not know” or “prefer not to answer.” The Neale lab did not indicate how many participants included in their analysis gave these answers, but they represent 4.7% of all UK Biobank participants shown in the Data Showcase (23,333 / 498,706). Despite these potential concerns, the LDSC heritability analysis of the Neale lab GWAS indicated strong heritability (https://nealelab.github.io/UKBB_ldsc/h2_summary_2247_1.html). As the individual-level data were not available to us, we decided to use the GWAS results as provided by the Neale lab.
Hearing difficulty / problems with background noise (2257; http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=2257). Participants were asked, “Do you find it difficult to follow a conversation if there is background noise (such as TV, radio, children playing)?” Possible answers were “yes”, “no”, “do not know”, or “prefer not to answer”. Participants who indicated they were previously deaf in trait 2247 were not asked this question or other subsequent hearing-related questions. The Neale lab analysis included N = 330,759 individuals and compared individuals who answered “yes” (N = 125,089) to all other individuals (N = 205,670; https://nealelab.github.io/UKBB_ldsc/h2_summary_2257.html).
Hearing aid user (3393; http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=3393). Participants were asked, “Do you use a hearing aid most of the time?” Possible answers were “yes”,”no”, or “prefer not to answer”. The Neale lab analysis included N = 204,240 individuals, with 10,322 answering “yes”, compared to 193,918 who answered”no” or “prefer not to answer” (https://nealelab.github.io/UKBB_ldsc/h2_summary_3393.html).
Tinnitus, most or all of the time (4803; http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=4803). Participants were asked, “Do you get or have you had noises (such as ringing or buzzing) in your head or in one or both ears that lasts for more than five minutes at a time?” Possible answers included “Yes, now most or all of the time”, “Yes, now a lot of the time”, “Yes, now some of the time”, “Yes, but not now, but have in the past”, “No, never”, “Do not know”, or “Prefer not to answer.” The Neale lab performed GWAS of each of the “yes” categories, separately, in N = 109,411 individuals. The GWAS with the strongest heritability compared those indicating they experienced tinnitus most or all of the time (N = 7,214) to those who gave any other response (N = 102,197; https://nealelab.github.io/UKBB_ldsc/h2_summary_4803_11.html).
We downloaded the summary statistics from these four GWAS from the Neale lab website and used them as the starting point for subsequent analyses.
Genetic correlations of hearing-related and non-hearing-related traits
Genetic correlations among the four hearing-related traits were calculated using the GWAS summary statistics from the Neale lab using LDSC v1.0.0. We utilized LD Scores derived from 1000 Genomes European genomes, downloaded from the LDSC website (https://data.broadinstitute.org/alkesgroup/LDSCORE/eur_w_ld_chr.tar.bz2). Genetic correlations of hearing-related traits with 234 additional traits was assessed using results from European ancestry GWAS (non-UK Biobank) of these traits available via LDHub v1.9.0 (http://ldsc.broadinstitute.org/ldhub/)[16].
Meta-analysis of hearing-related traits in the UK Biobank
Meta-analysis of the four hearing-related traits was performed with Multi-Trait Analysis of GWAS (MTAG)[19], using a version of the MTAG software downloaded on May 22, 2018 (https://github.com/omeed-maghzian/mtag). MTAG is explicitly designed for joint analysis of summary statistics from biobank-scale GWAS of genetically correlated traits in overlapping samples. As above, this analysis used European LD Scores from LDSC and was performed using default parameters.
Overlap of the loci identified in this study with those from the Wells et al study
The Wells et al summary statistics were downloaded from: https://zenodo.org/record/3490750#.XaXevEZKhPa. FUMA[72] v1.3.1 was used to perform functional annotation of the summary statistics. Overlap was checked by determining whether each SNP from our study was in linkage disequilibrium with any of the independent significant SNPs from the Wells et al study.
Polygenic risk score analysis
Polygenic risk score analysis was performed to test whether hearing difficulty-associated SNPs from the UK Biobank predict hearing difficulty in an independent sample. Imputed genotypes and binaural hearing threshold phenotypes of Belgian individuals from Fransen et al.[22] were downloaded from the TGen website (https://www.tgen.org/supplementary-data/gwas_polygenic_arhi_fransen_et_al/GWAS_POLYGENIC_ARHI_Fransen_et_al.tar.gz). Initial QC was performed using PLINK v1.9. Individuals with >10% missing genotypes were filtered. We also removed SNPs that were missing from more than 10% of individuals, had a minor allele frequency of less than 5%, or were not in Hardy-Weinberg equilibrium (p<10−6). After filtering, 1,472 individuals and 907,726 SNPs were included in this analysis. Risk scores were calculated in these samples using the R package PRSice-2[23]. Risk scores were based on the weighted sum of risk-associated SNPs from the UK Biobank hearing difficulty MTAG summary statistics, using the following cutoffs for selection of SNPs included in the risk score: 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, and 1.
Replication of specific risk loci in independent cohorts
We tested for replication of previously reported risk loci for ARHI and other hearing-related traits through lookups in the MTAG hearing difficulty summary statistics. We started with 62 previously-reported SNPs, derived from top-level results reported by Hoffmann et al. (2016)[4], Vuckovic et al. (2015)[5], and from several earlier studies as reported in S1 Table from Ref. [4]. Summary statistics for 59 of these 62 SNPs were available in the UK Biobank sample.
Gene set enrichment analysis
Gene set enrichment analyses were performed using MAGMA[24] v1.06 implemented within FUMA[72] v1.3.1, as well as via a standalone installation. Genotyped and imputed SNPs were annotated to ENSEMBL v92 gene models in FUMA. Annotations were limited to protein-coding genes, excluding the major histocompatibility (MHC) region of extended linkage disequilibrium (a common source of false positive results), and with SNPs mapping to a gene if they were located between the gene’s start and end position. MAGMA was then used to calculate a p-value for each gene, based on the mean association among the SNPs annotated to each gene. Gene-based p-values were used to perform the following gene-level analyses:
i) Gene set enrichment analysis with Mendelian deafness genes extracted from the Online Mendelian Inheritance in Man (OMIM) database (https://omim.org/). This analysis was implemented using a local installation of MAGMA.
ii) Gene property analyses to assess covariance of MAGMA gene p-values with gene expression in six cochlear cell types, 53 non-cochlear tissues, and 5,674 cell types from single-cell RNA-seq experiments of diverse mammalian tissues. For cochlear cell types, we computed the median transcripts per million (TPM) expression level for each gene in each cell type in RNA-seq of FACS-sorted cells from GSE64543[33] and GSE60019[26]. For non-cochlear cell types, we downloaded median TPM values for 53 tissues from the GTEx v7 portal (gtexportal.org). We identified the set of genes quantified in all datasets and performed a quantile normalization across log-transformed TPM values from all cell types. Using these normalized expression levels, we performed a one-sided MAGMA gene property analysis, conditioning on the median expression level of each gene across all cell types, as well as standard covariate due to gene length, the number of SNP annotated to each gene, and correlations among nearby genes due to LD. These analyses were performed using a local installation of MAGMA. Equivalent gene property analyses of single-cell RNA-seq experiments were performed using FUMA, as described at https://fuma.ctglab.nl/tutorial#celltype.
iii) Gene set enrichment analyses for Gene Ontology terms, utilizing 6,166 gene sets from the c5.bp, c5.cc and c5.mf databases from MSigDB v5.2. This analysis was implemented in FUMA.
ATAC-seq data processing
Four ATAC-seq fastq files (two epithelial and 2 non-epithelial samples from P1 mouse cochlea) from each tissue type were aligned to mm10 genome using BWA aligner bwa mem method (https://github.com/lh3/bwa). Sorted BAM files from each of the four samples were filtered to mapped reads only using samtools, converted to BED format using bedtools, and analyzed for open chromatin signal enrichment using F-Seq[73] https://umgear.org/p?l=3a70e6e7. The two BED files for each tissue type were merged using bedtools intersect, to identify regions common to both samples, requiring at least a 1 base pair overlap. We removed blacklist regions computed by the ENCODE project (ENCFF547MET), which show high non-specific signal across many assays.
Determining enrichment of ATAC-seq peaks for known tissue specific enhancers
We examined overlap between open chromatin regions from ATAC-seq experiments and tissue-specific enhancers from VISTA (https://enhancer.lbl.gov/) using the Genomic Association Tester (GAT; https://github.com/AndreasHeger/gat). GAT determines the significance of overlap between genomic annotations though re-sampling within a genomic workspace defined as the mm10 genome, excluding ENCODE blacklist regions and regions of low mappability (ENCODE accession: ENCFF547MET).
Enrichment of hearing difficulty heritability in open chromatin regions
Enrichment of hearing difficulty heritability in tissue-specific open chromatin regions from our cochlear ATAC-seq experiments, as well as from ENCODE DNase-seq experiments, was examined using stratified LDSC. 1000 Genomes Phase 3 baseline model LD scores (non tissue-specific annotations) described by Finucane, Bulik-Sullivan et al. (2015)[42] were downloaded from http://data.broadinstitute.org/alkesgroup/LDSCORE/. Open chromatin regions from DNase-seq of mouse tissues and cell types were downloaded from encodeportal.org; accession identifiers for the specific files are shown in S14 Table. Regions of the mouse genome identified in these cochlear and non-cochlear open chromatin experiments were mapped to the human genome with the UCSC Genome Browser liftOver tool, using the mm10toHg19 UCSC chain file, requiring a minimum of 50% of base pairs identical between the two genomes.
Statistical fine mapping and functional annotation of GWAS risk loci
Fine-mapping was performed using a combination of standard annotations and analyses performed with the SNP2GENE function in FUMA v1.3.1 (http://fuma.ctglab.nl)[72], as well as additional cochlea-specific annotations downstream data integration, as described below. The architecture of risk loci was determined based on LD structure in the 1000 Genomes Phase 3 European-American sample, calculated with PLINK. LD-independent lead SNPs at each locus had p-values < 5e-8. We defined risk loci using a minimum pairwise r2 > 0.6 between lead SNPs and other SNPs. In addition, we set a minimum minor allele frequency of 0.01, and the maximum distance between LD blocks to merge into interval was 250. Subsequently, we selected 613 SNPs for deeper annotation, using a pairwise r2 threshold > 0.9 with an LD-independent lead SNP.
Annotations of protein-coding variants were performed in FUMA, using ANNOVAR[74] with ENSEMBL v92 gene models. Deleteriousness of variants was predicted using CADD v1.3[75], and we selected variants with a CADD Phred score threshold > = 10.
Regulatory functions were predicted for non-coding variants based on overlap with open chromatin in mouse cochlear epithelial and non-epithelial cells, followed by prediction of target genes based on proximity to transcription start sites (TSS) and chromatin interactions from Hi-C experiments. First, we selected a subset of the 613 risk-associated SNPs that were located +/-500bp of regions homologous to open chromatin in mouse cochlear epithelial and non-epithelial cells. SNPs were annotated to proximal target genes if they were located within 20kb of the TSS from an ENSEMBL v92 gene model.
SNPs were annotated to distal target genes based on chromatin interactions from Hi-C of 20 human tissues and cell types[44]. Hi-C data were processed with FUMA, using Fit-Hi-C[76] to compute the significance of interactions between 40kb chromosomal segments. Using these data, we identified chromosomal interactions that connect the genomic segment containing each risk-associated, open chromatin-overlapping SNP to distal chromosomal segments. We annotated genes whose transcription start sites were located within these distal segments. We considered chromosomal loops identified in each of the 20 tissues and cell types. As chromosomal contacts differ from tissue to tissue and Hi-C data are inherently noisy, aggregating loops from multiple tissues can lead to false positive signals. To mitigate this risk, we selected a strict p-value threshold for the significance of loops, p < 1e-25, manually determined by inspection of the data to capture one or a few of the strongest loops at each locus.
Single cell RNA-seq data analysis
Genomic alignment, de-multiplexing, and mapping of unique molecular identifiers (UMIs) mapping to each gene was performed using cellranger. Downstream analyses were performed with the Seurat R package[61]. We filtered cells with <50 or >20,000 UMIs and with >20% of UMIs coming from mitochondrial genes. Counts of UMIs per gene were log normalized. Highly variable genes were identified using the FindVariableGenes function with the following parameters: dispersion formula = LogVMR, minimum = 0.0123, maximum = 3, y cutoff = 0.5. Counts from variable genes were then scaled. We regressed out effects of cell cycle, percent of mitochondrial genes, and number of unique molecular identifiers. The list of cell cycle genes was obtained from the Seurat website: https://satijalab.org/seurat/cell_cycle_vignette.html. We constructed a shared-nearest neighbors graph based on the first 10 principal components of variation in the scaled and normalized expression patterns of variable genes. Cell clusters were identified from the nearest-neighbors group based on Louvain modularity, using the FindClusters() function, with a resolution of 0.6. Clusters were visualized by t-distributed stochastic neighbor embedding (t-SNE) of the first 10 principal components and annotated to known cochlear cell types based on the expression of all marker genes. Cell type specificity for the 50 risk genes was calculated using FindAllMarkers() function, using Wilcoxon tests to compare counts in each cell type to counts of all other cell types in aggregate.
Meta-analysis of hair cell-specific gene expression for hearing difficulty risk genes
Processed data from GSE60019, GSE71982, and GSE116703 were downloaded and imported into R using the GEOquery R package. log-transformed transcripts per million were fit to linear models, using the lmFit, contrasts.fit, and eBayes functions in the limma R package. The main effect of cell type (hair cells versus all other cells) was calculated in each dataset, separately, controlling for covariates due to age. We then computed a combined meta-analytic p-value for each gene across the three datasets, using Stouffer’s z-score method with equal weights.
Supporting information
S1 Fig [tif]
Quantile-quantile plots of the four hearing-related traits with statistically significant heritability in the UK Biobank.
S2 Fig [pdf]
Region plots for the 31 hearing difficulty risk loci (PDF document).
S3 Fig [prs]
Heritability of binaural hearing thresholds explained by hearing difficulty polygenic risk scores in an independent sample.
S4 Fig [a]
Human genomic regions homologous to open chromatin in mouse cochlea are enriched in known promoters and enhancers.
S5 Fig [tsne]
Expression patterns of marker genes used to identify cell types in single-cell RNA-seq of mouse cochlea.
S6 Fig [tif]
Region plot of risk-associated SNPs at the chr22q13.1 risk locus.
S1 Table [xlsx]
Heritability of hearing-related traits.
S2 Table [rg]
Genetic correlations between hearing related traits and non-hearing related traits.
S3 Table [xlsx]
Genomic risk loci for hearing difficulty.
S4 Table [xlsx]
Genomic risk loci for background noise problems.
S5 Table [xlsx]
Genomic risk loci for hearing aid use.
S6 Table [xlsx]
Genomic risk loci for tinnitus.
S7 Table [xlsx]
Associations of independently significant SNPs at the 31 hearing difficulty risk loci with other hearing-related traits in the UK Biobank.
S8 Table [xlsx]
Replication in the UK Biobank for risk-associated SNPs from previous GWAS of hearing difficulty.
S9 Table [r2]
Polygenic risk prediction in a cohort of 1,472 Belgian individuals.
S10 Table [nsnps]
MAGMA gene-based p-values.
S11 Table [nsnps]
MAGMA gene set enrichments for Mendelian deafness genes and genes expressed in cochlear cell types.
S12 Table [xlsx]
MAGMA gene set enrichments for 5,674 cell types from single-cell RNA-seq experiments of diverse mammalian tissues.
S13 Table [tpm]
MAGMA cell specific expression.
S14 Table [xlsx]
MAGMA gene set enrichments for genes expressed in tissues from GTEx.
S15 Table [xlsx]
GO Term enrichment of hearing difficulty risk loci.
S16 Table [xlsx]
Open chromatin regions in cochlear epithelial cells.
S17 Table [xlsx]
Open chromatin regions in cochlear non-epithelial cells.
S18 Table [csv]
Overlap of conserved cochlear epithelial open chromatin regions and other cell types from ROADMAP.
S19 Table [csv]
Overlap of conserved cochlear non- epithelial open chromatin regions and other cell types from ROADMAP.
S20 Table [xlsx]
Enrichments of hearing difficulty risk in open chromatin regions from cochlear and non-cochlear cell types.
S21 Table [xlsx]
Functional annotations of 613 SNPs in LD with LD-independent genome-wide significant SNPs at hearing difficulty risk loci.
S22 Table [xlsx]
Chromatin Interactions with hearing difficulty SNPs.
S23 Table [rsid]
List of likely causal genes from integrating coding and non-coding functional annotations.
S24 Table [xlsx]
Annotation of 50 fine-mapped risk genes at hearing difficulty risk loci.
S25 Table [xlsx]
Marker genes for cochlear cell types in P2 scRNA-seq data.
S26 Table [xlsx]
Cell type-specificity analysis for hearing difficulty risk genes.
S27 Table [xlsx]
Meta-analysis of risk gene expression in hair cells vs. other cochlear cell types.
Zdroje
1. Yamasoba T, Lin FR, Someya S, Kashio A, Sakamoto T, Kondo K. Current concepts in age-related hearing loss: epidemiology and mechanistic pathways. Hear Res. 2013;303: 30–38. doi: 10.1016/j.heares.2013.01.021 23422312
2. Momi SK, Wolber LE, Fabiane SM, MacGregor AJ, Williams FMK. Genetic and Environmental Factors in Age-Related Hearing Impairment. Twin Res Hum Genet. 2015;18: 383–392. doi: 10.1017/thg.2015.35 26081266
3. Van Camp G, Smith R. Hereditary Hearing Loss Homepage. 2019. https://hereditaryhearingloss.org/
4. Hoffmann TJ, Keats BJ, Yoshikawa N, Schaefer C, Risch N, Lustig LR. A Large Genome-Wide Association Study of Age-Related Hearing Impairment Using Electronic Health Records. PLoS Genet. 2016;12: e1006371. doi: 10.1371/journal.pgen.1006371 27764096
5. Vuckovic D, Dawson S, Scheffer DI, Rantanen T, Morgan A, Di Stazio M, et al. Genome-wide association analysis on normal hearing function identifies PCDH20 and SLC28A3 as candidates for hearing function and loss. Hum Mol Genet. 2015;24: 5655–64. doi: 10.1093/hmg/ddv279 26188009
6. Van Laer L, Huyghe JR, Hannula S, Van Eyken E, Stephan DA, Maki-Torkko E, et al. A genome-wide association study for age-related hearing impairment in the Saami. Eur J Hum Genet. 2010;18: 685–693. doi: 10.1038/ejhg.2009.234 20068591
7. Girotto G, Pirastu N, Sorice R, Biino G, Campbell H, d’Adamo AP, et al. Hearing function and thresholds: a genome-wide association study in European isolated populations identifies new loci and pathways. J Med Genet. 2011;48: 369–374. doi: 10.1136/jmg.2010.088310 21493956
8. Friedman RA, Van Laer L, Huentelman MJ, Sheth SS, Van Eyken E, Corneveaux JJ, et al. GRM7 variants confer susceptibility to age-related hearing impairment. Hum Mol Genet. 2009;18: 785–796. doi: 10.1093/hmg/ddn402 19047183
9. Wells HRR, Freidin MB, Zainul Abidin FN, Payton A, Dawes P, Munro KJ, et al. GWAS Identifies 44 Independent Associated Genomic Loci for Self-Reported Adult Hearing Difficulty in UK Biobank. Am J Hum Genet. 2019;105: 788–802. doi: 10.1016/j.ajhg.2019.09.008 31564434
10. Fromer M, Roussos P, Sieberts SK, Johnson JS, Kavanagh DH, Perumal TM, et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat Neurosci. 2016;19: 1442–1453. doi: 10.1038/nn.4399 27668389
11. Gusev A, Lee SH, Trynka G, Finucane H, Vilhjálmsson BJ, Xu H, et al. Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. Am J Hum Genet. 2014. doi: 10.1016/j.ajhg.2014.10.004 25439723
12. Claussnitzer M, Dankel SN, Kim K-H, Quon G, Meuleman W, Haugen C, et al. FTO Obesity Variant Circuitry and Adipocyte Browning in Humans. N Engl J Med. 2015; 150819140043007. doi: 10.1056/NEJMoa1502214 26287746
13. Maurano MT, Humbert R, Rynes E, Thurman RE, Haugen E, Wang H, et al. Systematic localization of common disease-associated variation in regulatory DNA. Science. 2012;337: 1190–5. doi: 10.1126/science.1222794 22955828
14. Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLoS Med. 2015. doi: 10.1371/journal.pmed.1001779 25826379
15. Bulik-Sullivan BK, Loh P-R, Finucane HK, Ripke S, Yang J, Patterson N, et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47: 291–295. doi: 10.1038/ng.3211 25642630
16. Zheng J, Erzurumluoglu AM, Elsworth BL, Kemp JP, Howe L, Haycock PC, et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics. 2016/09/22. 2017;33: 272–279. doi: 10.1093/bioinformatics/btw613 27663502
17. Rutherford BR, Brewster K, Golub JS, Kim AH, Roose SP. Sensation and Psychiatry: Linking Age-Related Hearing Loss to Late-Life Depression and Cognitive Decline. Am J Psychiatry. 2018;175: 215–224. doi: 10.1176/appi.ajp.2017.17040423 29202654
18. Dhanda N, Taheri S. A narrative review of obesity and hearing loss. Int J Obes (Lond). 2017;41: 1066–1073. doi: 10.1038/ijo.2017.32 28163314
19. Turley P, Walters RK, Maghzian O, Okbay A, Lee JJ, Fontana MA, et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet. 2018;50: 229–237. doi: 10.1038/s41588-017-0009-4 29292387
20. Verweij N, van de Vegte YJ, van der Harst P. Genetic study links components of the autonomous nervous system to heart-rate profile during exercise. Nat Commun. 2018;9: 898. doi: 10.1038/s41467-018-03395-6 29497042
21. Huffman JE. Examining the current standards for genetic discovery and replication in the era of mega-biobanks. Nat Commun. 2018;9: 5054. doi: 10.1038/s41467-018-07348-x 30498205
22. Fransen E, Bonneux S, Corneveaux JJ, Schrauwen I, Di Berardino F, White CH, et al. Genome-wide association analysis demonstrates the highly polygenic character of age-related hearing impairment. Eur J Hum Genet. 2015;23: 110–115. doi: 10.1038/ejhg.2014.56 24939585
23. Euesden J, Lewis CM, O’Reilly PF. PRSice: Polygenic Risk Score software. Bioinformatics. 2015;31: 1466–8. doi: 10.1093/bioinformatics/btu848 25550326
24. de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol. 2015;11: e1004219. doi: 10.1371/journal.pcbi.1004219 25885710
25. Mchugh RK, Friedman RA. Genetics of hearing loss: Allelism and modifier genes produce a phenotypic continuum. Anat Rec Part A Discov Mol Cell Evol Biol. 2006;288A: 370–381. doi: 10.1002/ar.a.20297 16550584
26. Scheffer DI, Shen J, Corey DP, Chen Z-Y. Gene Expression by Mouse Inner Ear Hair Cells during Development. J Neurosci. 2015;35: 6366–6380. doi: 10.1523/JNEUROSCI.5126-14.2015 25904789
27. Li Y, Liu H, Giffen KP, Chen L, Beisel KW, He DZZ. Transcriptomes of cochlear inner and outer hair cells from adult mice. Sci data. 2018;5: 180199. doi: 10.1038/sdata.2018.199 30277483
28. Liu H, Chen L, Giffen KP, Stringham ST, Li Y, Judge PD, et al. Cell-Specific Transcriptome Analysis Shows That Adult Pillar and Deiters’ Cells Express Genes Encoding Machinery for Specializations of Cochlear Hair Cells. Front Mol Neurosci. 2018;11: 356. doi: 10.3389/fnmol.2018.00356 30327589
29. Aguet F, Ardlie KG, Cummings BB, Gelfand ET, Getz G, Hadley K, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550: 204–213. doi: 10.1038/nature24277 29022597
30. de la Torre-Ubieta L, Stein JL, Won H, Opland CK, Liang D, Lu D, et al. The Dynamic Landscape of Open Chromatin during Human Cortical Neurogenesis. Cell. 2018;172: 289–304.e18. doi: 10.1016/j.cell.2017.12.014 29307494
31. Huang K-L, Marcora E, Pimenova AA, Di Narzo AF, Kapoor M, Jin SC, et al. A common haplotype lowers PU.1 expression in myeloid cells and delays onset of Alzheimer’s disease. Nat Neurosci. 2017;20: 1052–1061. doi: 10.1038/nn.4587 28628103
32. Friedman LM, Dror AA, Avraham KB. Mouse models to study inner ear development and hereditary hearing loss. Int J Dev Biol. 2007;51: 609–631. doi: 10.1387/ijdb.072365lf 17891721
33. Elkon R, Milon B, Morrison L, Shah M, Vijayakumar S, Racherla M, et al. RFX transcription factors are essential for hearing in mice. Nat Commun. 2015;6: 8549. doi: 10.1038/ncomms9549 26469318
34. Hertzano R, Elkon R, Kurima K, Morrisson A, Chan S-L, Sallin M, et al. Cell type-specific transcriptome analysis reveals a major role for Zeb1 and miR-200b in mouse inner ear morphogenesis. PLoS Genet. 2011;7: e1002309. doi: 10.1371/journal.pgen.1002309 21980309
35. Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T, et al. A comparative encyclopedia of DNA elements in the mouse genome. Nature. 2014;515: 355–64. doi: 10.1038/nature13992 25409824
36. Roccio M, Perny M, Ealy M, Widmer HR, Heller S, Senn P. Molecular characterization and prospective isolation of human fetal cochlear hair cell progenitors. Nat Commun. 2018;9: 4027. doi: 10.1038/s41467-018-06334-7 30279445
37. Visel A, Minovitsky S, Dubchak I, Pennacchio LA. VISTA Enhancer Browser—a database of tissue-specific human enhancers. Nucleic Acids Res. 2007;35: D88–D92. doi: 10.1093/nar/gkl822 17130149
38. Hume CR, Bratt DL, Oesterle EC. Expression of LHX3 and SOX2 during mouse inner ear development. Gene Expr Patterns. 2007;7: 798–807. doi: 10.1016/j.modgep.2007.05.002 17604700
39. Ahn KJ, Passero FJ, Crenshaw EB 3rd. Otic mesenchyme expression of Cre recombinase directed by the inner ear enhancer of the Brn4/Pou3f4 gene. Genesis. 2009;47: 137–141. doi: 10.1002/dvg.20454 19217071
40. Hinrichs AS, Karolchik D, Baertsch R, Barber GP, Bejerano G, Clawson H, et al. The UCSC Genome Browser Database: update 2006. Nucleic Acids Res. 2006;34: D590–8. doi: 10.1093/nar/gkj144 16381938
41. Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518: 317–30. doi: 10.1038/nature14248 25693563
42. Finucane HK, Bulik-Sullivan B, Gusev A, Trynka G, Reshef Y, Loh P-R, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat Genet. 2015;47: 1228–1235. doi: 10.1038/ng.3404 26414678
43. Consortium T 1000 GP. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491: 56–65. doi: 10.1038/nature11632 23128226
44. Schmitt AD, Hu M, Jung I, Xu Z, Qiu Y, Tan CL, et al. A Compendium of Chromatin Contact Maps Reveals Spatially Active Regions in the Human Genome. Cell Rep. 2016;17: 2042–2059. doi: 10.1016/j.celrep.2016.10.061 27851967
45. Schrauwen I, Chakchouk I, Liaqat K, Jan A, Nasir A, Hussain S, et al. A variant in LMX1A causes autosomal recessive severe-to-profound hearing impairment. Hum Genet. 2018;137: 471–478. doi: 10.1007/s00439-018-1899-7 29971487
46. Wesdorp M, de Koning Gans PAM, Schraders M, Oostrik J, Huynen MA, Venselaar H, et al. Heterozygous missense variants of LMX1A lead to nonsyndromic hearing impairment and vestibular dysfunction. Hum Genet. 2018. doi: 10.1007/s00439-018-1880-5 29754270
47. Borck G, Ur Rehman A, Lee K, Pogoda H-M, Kakar N, von Ameln S, et al. Loss-of-function mutations of ILDR1 cause autosomal-recessive hearing impairment DFNB42. Am J Hum Genet. 2011;88: 127–137. doi: 10.1016/j.ajhg.2010.12.011 21255762
48. Zhang Y, Zhang X, Long R, Yu L. A novel deletion mutation of the SOX2 gene in a child of Chinese origin with congenital bilateral anophthalmia and sensorineural hearing loss. J Genet. 2018;97: 1007–1011. 30262714
49. Wayne S, Robertson NG, DeClau F, Chen N, Verhoeven K, Prasad S, et al. Mutations in the transcriptional activator EYA4 cause late-onset deafness at the DFNA10 locus. Hum Mol Genet. 2001;10: 195–200. doi: 10.1093/hmg/10.3.195 11159937
50. Schonberger J, Wang L, Shin JT, Kim S Do, Depreux FFS, Zhu H, et al. Mutation in the transcriptional coactivator EYA4 causes dilated cardiomyopathy and sensorineural hearing loss. Nat Genet. 2005;37: 418–422. doi: 10.1038/ng1527 15735644
51. Daoud H, Zhang D, McMurray F, Yu A, Luco SM, Vanstone J, et al. Identification of a pathogenic FTO mutation by next-generation sequencing in a newborn with growth retardation and developmental delay. J Med Genet. 2016;53: 200–207. doi: 10.1136/jmedgenet-2015-103399 26378117
52. Pollak A, Lechowicz U, Murcia Pienkowski VA, Stawinski P, Kosinska J, Skarzynski H, et al. Whole exome sequencing identifies TRIOBP pathogenic variants as a cause of post-lingual bilateral moderate-to-severe sensorineural hearing loss. BMC Med Genet. 2017;18: 142. doi: 10.1186/s12881-017-0499-z 29197352
53. Shahin H, Walsh T, Sobe T, Abu Sa’ed J, Abu Rayan A, Lynch ED, et al. Mutations in a novel isoform of TRIOBP that encodes a filamentous-actin binding protein are responsible for DFNB28 recessive nonsyndromic hearing loss. Am J Hum Genet. 2006;78: 144–152. doi: 10.1086/499495 16385458
54. Manji SSM, Williams LH, Miller KA, Ooms LM, Bahlo M, Mitchell CA, et al. A mutation in synaptojanin 2 causes progressive hearing loss in the ENU-mutagenised mouse strain mozart. PLoS One. 2011. doi: 10.1371/journal.pone.0017607 21423608
55. Ohlemiller KK, Rybak Rice ME, Lett JM, Gagnon PM. Absence of strial melanin coincides with age-associated marginal cell loss and endocochlear potential decline. Hear Res. 2009. doi: 10.1016/j.heares.2008.12.005 19141317
56. Eguchi M, Kariya S, Okano M, Higaki T, Makihara S, Fujiwara T, et al. Lipopolysaccharide induces proinflammatory cytokines and chemokines in experimental otitis media through the prostaglandin D2 receptor (DP)-dependent pathway. Clin Exp Immunol. 2011;163: 260–269. doi: 10.1111/j.1365-2249.2010.04292.x 21166666
57. Wu J, Han W, Chen X, Guo W, Liu K, Wang R, et al. Matrix metalloproteinase-2 and -9 contribute to functional integrity and noise-induced damage to the blood-labyrinth-barrier. Mol Med Rep. 2017;16: 1731–1738. doi: 10.3892/mmr.2017.6784 28627704
58. Mahuzier A, Gaude H-M, Grampa V, Anselme I, Silbermann F, Leroux-Berger M, et al. Dishevelled stabilization by the ciliopathy protein Rpgrip1l is essential for planar cell polarity. J Cell Biol. 2012;198: 927–940. doi: 10.1083/jcb.201111009 22927466
59. Rocha-Sanchez SM, Scheetz LR, Contreras M, Weston MD, Korte M, McGee J, et al. Mature mice lacking Rbl2/p130 gene have supernumerary inner ear hair cells and supporting cells. J Neurosci. 2011;31: 8883–8893. doi: 10.1523/JNEUROSCI.5821-10.2011 21677172
60. Bowl MR, Simon MM, Ingham NJ, Greenaway S, Santos L, Cater H, et al. A large scale hearing loss screen reveals an extensive unexplored genetic landscape for auditory dysfunction. Nat Commun. 2017;8: 886. doi: 10.1038/s41467-017-00595-4 29026089
61. Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol. 2018;36: 411. Available: doi: 10.1038/nbt.4096 29608179
62. Shen J, Scheffer DI, Kwan KY, Corey DP. SHIELD: an integrative gene expression database for inner ear research. Database (Oxford). 2015;2015: bav071. doi: 10.1093/database/bav071 26209310
63. Chessum L, Matern MS, Kelly MC, Johnson SL, Ogawa Y, Milon B, et al. Helios is a key transcriptional regulator of outer hair cell maturation. Nature. 2018;563: 696–700. doi: 10.1038/s41586-018-0728-4 30464345
64. Burns JC, Kelly MC, Hoa M, Morell RJ, Kelley MW. Single-cell RNA-Seq resolves cellular complexity in sensory organs from the neonatal inner ear. Nat Commun. 2015;6: 8557. doi: 10.1038/ncomms9557 26469390
65. Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169: 1177–1186. doi: 10.1016/j.cell.2017.05.038 28622505
66. Ohlemiller KK, Hughes RM, Lett JM, Ogilvie JM, Speck JD, Wright JS, et al. Progression of cochlear and retinal degeneration in the tubby (rd5) mouse. Audiol Neurootol. 1997;2: 175–85. doi: 10.1159/000259242 9390831
67. Schuknecht HF, Watanuki K, Takahashi T, Belal AA, Kimura RS, Jones DD, et al. Atrophy of the stria vascularis, a common cause for hearing loss. Laryngoscope. 1974;84: 1777–821. doi: 10.1288/00005537-197410000-00012 4138750
68. Puligilla C, Kelley MW. Dual role for Sox2 in specification of sensory competence and regulation of Atoh1 function. Dev Neurobiol. 2017;77: 3–13. doi: 10.1002/dneu.22401 27203669
69. Koo SK, Hill JK, Hwang CH, Lin ZS, Millen KJ, Wu DK. Lmx1a maintains proper neurogenic, sensory, and non-sensory domains in the mammalian inner ear. Dev Biol. 2009;333: 14–25. doi: 10.1016/j.ydbio.2009.06.016 19540218
70. Bok J, Raft S, Kong K-A, Koo SK, Dräger UC, Wu DK. Transient retinoic acid signaling confers anterior-posterior polarity to the inner ear. Proc Natl Acad Sci U S A. 2011;108: 161–6. doi: 10.1073/pnas.1010547108 21173260
71. Buenrostro JD, Wu B, Chang HY, Greenleaf WJ. ATAC-seq: A Method for Assaying Chromatin Accessibility Genome-Wide. Curr Protoc Mol Biol. 2015;109: 21.29.1–9. doi: 10.1002/0471142727.mb2129s109 25559105
72. Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8: 1826. doi: 10.1038/s41467-017-01261-5 29184056
73. Boyle AP, Guinney J, Crawford GE, Furey TS. F-Seq: a feature density estimator for high-throughput sequence tags. Bioinformatics. 2008;24: 2537–2538. doi: 10.1093/bioinformatics/btn480 18784119
74. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38: e164–e164. doi: 10.1093/nar/gkq603 20601685
75. Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47: D886–D894. doi: 10.1093/nar/gky1016 30371827
76. Ay F, Bailey TL, Noble WS. Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts. Genome Res. 2014;24: 999–1011. doi: 10.1101/gr.160374.113 24501021
77. Kircher M, Witten DM, Jain P, O’Roak BJ, Cooper GM, Shendure J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 2014;46: 310–5. doi: 10.1038/ng.2892 24487276
Článek vyšel v časopise
PLOS Genetics
2020 Číslo 9
- Antibiotika na nachlazení nezabírají! Jak můžeme zpomalit šíření rezistence?
- FDA varuje před selfmonitoringem cukru pomocí chytrých hodinek. Jak je to v Česku?
- Prof. Jan Škrha: Metformin je bezpečný, ale je třeba jej bezpečně užívat a léčbu kontrolovat
- Ibuprofen jako alternativa antibiotik při léčbě infekcí močových cest
- Jak a kdy u celiakie začíná reakce na lepek? Možnou odpověď poodkryla čerstvá kanadská studie
Nejčtenější v tomto čísle
- Alleviating chronic ER stress by p38-Ire1-Xbp1 pathway and insulin-associated autophagy in C. elegans neurons
- Cocoonase is indispensable for Lepidoptera insects breaking the sealed cocoon
- A mega-analysis of expression quantitative trait loci in retinal tissue
- Adiponectin GWAS loci harboring extensive allelic heterogeneity exhibit distinct molecular consequences