Multiple Loci Are Associated with White Blood Cell Phenotypes
White blood cell (WBC) count is a common clinical measure from complete blood count assays, and it varies widely among healthy individuals. Total WBC count and its constituent subtypes have been shown to be moderately heritable, with the heritability estimates varying across cell types. We studied 19,509 subjects from seven cohorts in a discovery analysis, and 11,823 subjects from ten cohorts for replication analyses, to determine genetic factors influencing variability within the normal hematological range for total WBC count and five WBC subtype measures. Cohort specific data was supplied by the CHARGE, HeamGen, and INGI consortia, as well as independent collaborative studies. We identified and replicated ten associations with total WBC count and five WBC subtypes at seven different genomic loci (total WBC count—6p21 in the HLA region, 17q21 near ORMDL3, and CSF3; neutrophil count—17q21; basophil count- 3p21 near RPN1 and C3orf27; lymphocyte count—6p21, 19p13 at EPS15L1; monocyte count—2q31 at ITGA4, 3q21, 8q24 an intergenic region, 9q31 near EDG2), including three previously reported associations and seven novel associations. To investigate functional relationships among variants contributing to variability in the six WBC traits, we utilized gene expression- and pathways-based analyses. We implemented gene-clustering algorithms to evaluate functional connectivity among implicated loci and showed functional relationships across cell types. Gene expression data from whole blood was utilized to show that significant biological consequences can be extracted from our genome-wide analyses, with effect estimates for significant loci from the meta-analyses being highly corellated with the proximal gene expression. In addition, collaborative efforts between the groups contributing to this study and related studies conducted by the COGENT and RIKEN groups allowed for the examination of effect homogeneity for genome-wide significant associations across populations of diverse ancestral backgrounds.
Published in the journal:
. PLoS Genet 7(6): e32767. doi:10.1371/journal.pgen.1002113
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1002113
Summary
White blood cell (WBC) count is a common clinical measure from complete blood count assays, and it varies widely among healthy individuals. Total WBC count and its constituent subtypes have been shown to be moderately heritable, with the heritability estimates varying across cell types. We studied 19,509 subjects from seven cohorts in a discovery analysis, and 11,823 subjects from ten cohorts for replication analyses, to determine genetic factors influencing variability within the normal hematological range for total WBC count and five WBC subtype measures. Cohort specific data was supplied by the CHARGE, HeamGen, and INGI consortia, as well as independent collaborative studies. We identified and replicated ten associations with total WBC count and five WBC subtypes at seven different genomic loci (total WBC count—6p21 in the HLA region, 17q21 near ORMDL3, and CSF3; neutrophil count—17q21; basophil count- 3p21 near RPN1 and C3orf27; lymphocyte count—6p21, 19p13 at EPS15L1; monocyte count—2q31 at ITGA4, 3q21, 8q24 an intergenic region, 9q31 near EDG2), including three previously reported associations and seven novel associations. To investigate functional relationships among variants contributing to variability in the six WBC traits, we utilized gene expression- and pathways-based analyses. We implemented gene-clustering algorithms to evaluate functional connectivity among implicated loci and showed functional relationships across cell types. Gene expression data from whole blood was utilized to show that significant biological consequences can be extracted from our genome-wide analyses, with effect estimates for significant loci from the meta-analyses being highly corellated with the proximal gene expression. In addition, collaborative efforts between the groups contributing to this study and related studies conducted by the COGENT and RIKEN groups allowed for the examination of effect homogeneity for genome-wide significant associations across populations of diverse ancestral backgrounds.
Introduction
The WBC count, a classic marker of immune or inflammatory response, varies substantially among healthy individuals. The counts of constituent cell subtypes comprising the WBC count measure are assayed as part of a standard clinical WBC differential test. While the WBC count and WBC differential count are often obtained to assess for evidence of infection or underlying inflammation, prospective epidemiologic studies have consistently linked higher WBC counts, within the clinically designated normal range, along with other inflammatory markers, to increased risk of coronary artery disease, cancer, and total mortality [1], [2], [3], [4], [5], [6], [7], [8], [9], [10]. Studies are often not consistent on the specific WBC subpopulations involved, but granulocytes, in general, and neutrophils, in particular, are most often implicated in these observations [11], [12]. In the general population, the total WBC count is also directly associated with many cardiovascular disease risk factors, such as higher blood pressure, cigarette smoking, adiposity, lower socioeconomic status, and higher levels of plasma inflammatory markers [13].
WBC counts are also moderately heritable [14], with heritability estimates varing from approximately 0.14 to 0.4 across total leukocyte count and cell subtypes, as assessed in a Sardinian population, with the highest heritability estimates for monocyte counts [14]. In addition, the substantially lower neutrophil count and total WBC count in African Americans compared to European-ancestry individuals seems to be at least partially explained by a regulatory variant in the Duffy Antigen Receptor for Chemokine (DARC) gene, which accounts for ∼20% of total variation in the measures [15], [16]. Recent studies have sought to investigate the common genetic variants associated with several blood count traits in European-ancestry and Japanese individuals, but have not focused specifically on the multiple cell types comprising the total WBC count measurement [17], [18], [19], [20].
In the current study, we sought to identify and replicate common genetic variants that influence normal variation in six WBC phenotypes commonly measured in the clinical setting and in population studies that comprise the CHARGE Consortium [21], the HaemGen Consortium [18] and independent collaborative studies. We utilized genome-wide association study (GWAS) data and meta-analytic techniques to identify 10 genome-wide significant loci in a study of over 31,000 individuals (Table 1), examining variants possibly associated with total WBC count, three granulocyte phenotypes (neutrophil, basophil and eosinophil counts), and two non-granulocyte phenotypes (lymphocytes and monocytes). We also investigated the shared functional connectivity of identified loci across phenotypes with regards to both known biological pathways and nearby gene expression effects. As previous research has shown strong selective pressures at the locus identified to affect WBC counts in African American populaftions, we examined the possibility of recent selective effects at significantly associated loci in European ancestry individuals [15], [16]. Additional efforts were made in collaboration with RIKEN and COGENT investigators to identify homogenous associations across populations of diverse ancestral backgrounds.
Results
In the discovery meta-analysis of 19,509 subjects from seven cohorts, we identified 11 genome-wide significant associations with six white cell phenotypes (total WBC, neutrophil, basophil, eosinophil, lymphocyte and monocyte counts, see Table 1, Table S1, Figure 1). Further, we found strong evidence for replication of 10 of the 11 trait-locus associations in 11,823 independent samples from 10 GWAS cohorts who contributed summary statistics for SNPs of interest. The discovery analysis results (Figure 1 and Figures S1, S2, S3, S4, S5, S6, S7, S8, S9, S10 for details), and the results of replication testing for the 10 replicated trait-loci are summarized in Table 2. These results are presented in greater detail for all genome-wide significant SNPs in Table S2.
Total WBC count was associated with two independent loci in the discovery phase of analyses; the first locus was on chromosome 6p21 encompassing a region from 31,131,127 bp to 31,161,846 bp near HLA and PSORS1 gene families, and the second locus on chromosome 17q21 from 35,345,186 bp to 35,470,048 bp near candidate genes ORMDL3 and CSF3. Both loci showed independent replication (Table 2). Neutrophil count was associated with a 196,381 bp region on chromosme 17q21 containing 46 genome-wide significant SNPs from the discovery phase. This region overlaps the locus on chromosome 17q21 identified for the total WBC count phenotype and showed positive association in replication testing at all but 2 SNPs. Basophil count was associated with one SNP, rs4328821 on chromosome 3q21 near RPN1 and C3orf27. This SNP showed a significant positive association between basophil count and minor allele dosage (minor allele frequency 0.110, p-value in discovery phase 2.58E-08, p-value in replication phase 8.40E-06). No regions showed genome-wide significance for association with eosinophil count.
Lymphocyte count was associated with two loci, with one locus on chromosome 6p21 overlapping with the chromosome 6p21 total WBC count locus. The second locus associated with lymphocyte counts is on chromosome 19p13, from 16,32,871 bp to 16,429,197 bp at EPS15L1 and was successfully replicated. Monocyte count was associated with the largest number of independent hits for any of the traits, with five loci identified in the discovery analysis, four of which showed significant associations in replication testing. These four loci include two intergenic regions (>100 kb to nearest known genes) on chromosome 8q24 (from 130,672,817 to 130,693,287) and chromosome 9q31 (112,917,232 to 113,073,157). We also identified a novel association on chromosome 2q31 (182,027,546 to 182,036,459) near ITGA4, and a single genome-wide significant SNP on chromosome 3q21, which is located 18,866 bp from the SNP significantly associated with basophil count (r2 = 0.076, D′ = 0.841). The monocyte- associated locus on chromosome 1q22 contained only one genome-wide significant SNP which failed to replicate and is not included in Table 2 (denoted by rs17131683, which exhibited a replication p-value of 0.770 but a consistent negative effect associated with the A allele).
Many of the loci described in this report contain genome-wide significant SNPs spread throughout much of each locus suggestive of either extensive LD or multiple association signals at each locus. For loci described in Table S2 (except those containing less than 3 genome-wide significant SNPs), conditional analyses were conducted using the allele dosage of the most significant SNP per locus as a covariate in a subset of discovery cohorts (AGES, ARIC, BLSA, Health ABC and InChianti). Statistical models were identical to those used in the discovery analyses except for the additional SNP covariates. No SNPs analyzed remained significant after correcting for 147 tests, showing that only one signal of association exists per locus. The complete results for these analyses are evident in Table S3. As each locus accounts for only one unique signal per trait, adjusted r2 estimates were calculated for each trait across loci within this subset of cohorts, and may be found in Table 2.
To assess the independence of associated SNPs from the total WBC count measurement, all genome-wide significant SNP associations for white cell subtypes (Table S2) were re-analyzed as per the discovery phase analysis methods adjusting for total WBC count as an additional covariate in a subset of discovery cohorts (AGES, ARIC, BLSA, Health ABC and InChianti). After Bonferroni correction for 97 independent tests at least one SNP per locus remained significant, suggesting some independence from the total WBC measure in the SNP associations. The extended results of this analysis and a table of r2 values for the phenotypes of interest based on the same subset of discovery cohorts may be found in Table S4 and Table S5.
These results demonstrate a high degree of relatedness across traits, with individual loci affecting multiple WBC traits, that may be pleiotropic to some degree or due to the biological relatedness of the traits. Neutrophils are the most abundant WBC subtype, and the locus on chromosome 17q21 associated with both total WBC count and neutrophil count independently based on conditional analyses described above. In this region, 38 SNPs were common to both traits in the discovery analysis, and these SNPs showed identical directions of effects across both traits. In the chromosome 3q21 between C3orf27 and RPN1, rs9880192 was associated with monocyte count and rs4328821 was associated with basophil count, suggesting pleiotropy in this region. The region on chromosome 6p21 near the PSORS1 family of genes as well as HLA-C and HLA-B contains associated SNPs with both total WBC count and lymphocyte count, although, interestingly, spatially overlapping SNPs failed to replicate in the other trait, further suggesting independence of effects seen in conditional analyses of all loci.
To further examine the functional connectivity of these loci across white blood cell phenotypes, the Gene Relationships Among Implicated Loci package (GRAIL, http://www.broad.mit.edu/mpg/grail/) was utilized to mine PubMed archives using textual analyses of known functional associations to identify concurrent effects and relationships across phenotypes [22]. In brief, GRAIL incorporates functional annotations from text mining related to specific genomic loci, usually genome-wide association study results, to assess the functional inter-relatedness of genes in linkage disequilibrium with the regions of interest and construct networks of related genes sharing biological function. In our analyses, we utilized GRAIL to survey the functional relatedness of all regions containing significant results passing Bonferroni correction in both the discovery and replication phases. We identified four clusters of related genes with false-discovery rate adjusted p-values<0.05 out of the 49 gene clusters generated by the GRAIL package, which are described in Figure 2 and Table S6. All four clusters show significant interconnectivity between genes proximal to loci on chromosome 17q21 associated with total WBC and neutrophil counts and the chromosome 19p13 locus associated with lymphocyte count. The two most significant clusters also show relationships between genes proximal to the previously mentioned loci and candidate genes at the chromosome 8q24 region associated with monocyte counts. Genes at the chromosome 17q21 locus associated with both total WBC and neutrophil counts appeared in all significant pathways identified in the GRAIL analyses, suggesting biological connectivity across both granulocyte and non-granulocyte cell lineages. Candidate genes from the gasdermin (GSDML) and mediator complex subunit (MED) families were highly enriched in the significant gene clusters and comprised 37.5% of genes in these clusters. These results suggest shared biological pathways between these genes and cell proliferation among WBC subtypes associated with these genomic regions, although as emphasized in conditional analyses, these effects at these loci remain to some degree independent of the total WBC measure.
We also examined possible functional consequences of individual SNP associations by analyzing whole blood genome-wide gene expression data from the InChianti study to identify associations between SNPs found to be significant in the meta-analysis and cis changes in gene expression. All SNPs significant in both the discovery and replication phases for all phenotypes were used in the expression analysis. Each SNP in this dataset which was within 500 kb of an expression probe was treated as a possible expression quantitative trait locus (eQTL). For 85 SNPs in our subset of significant meta-analysis results, we tested at total 741 candidate eQTL associations using multivariate linear regression. This analysis tested each SNP in the subset for an association with each expression probe within 500 kb. After Bonferroni correction for the 741 tests conducted, 36 SNPs in the chromosome 17q21 locus associated with total WBC count and neutrophil count in the meta-analysis were also significantly associated with cis-expression levels. In total, these 36 SNPs were associated with three expression probes, 2 probes tagging transcripts in the GSDML gene (probes ILMN_2347193 and ILMN_1666206) and another probe tagging a single probe in ORMD3L (probe ILMN_1662174), for a total of 103 signficicant eQTLs (Figure 3). Both probes in GSDML were highly correlated (r2 = 0.582), although neither GSDML probe was strongly corellated with the probe of interest in the ORMD3L gene (r2<0.200). With each SNP's minor allele as the point of reference, all effect directions for significant meta-analysis and eQTL associations were concordant, showing a strong correlation between the effect sizes in the meta-analysis and eQTL analysis, suggesting that the effect of the identified SNPs on the corresponding WBC trait may be transcriptionally mediated. For example, the correlation of effect estimates between the eQTL and meta-analysis for neutrophil associated SNPs also associated with the ILMN_1666206 probe highly significant, with an r2 of 0.898. For details of all eQTLs tested, please refer to Table S7 and the Methods and Materials section.
Previous studies of WBC genetic variation in African American populations have shown evidence of WBC associated loci in a region of high selective pressures, with the putative functional SNP rs2814778 being almost fixed in frequency in sub-Saharan African populations where malaria is endemic [15]. We therefore undertook an investigation of recent selective pressures acting upon SNPs identified associated with WBC phenotypes in European-ancestry populations. Evidence for selection for all 10 significant trait-loci from the meta-analysis was evaluated by mining HapMap2 CEU data. Integrated Haplotype Scores (iHS) were used to quantify selection at each locus based on homozygosity decay in extended haplotypes and are availible for download from the Haplotter website (http://hg-wen.uchicago.edu/selection) [23]. Selection was quantified by an absolute value of iHS>2 indicating strong selective pressures, and absolute iHS values ≤2 and ≥1.635 were interpreted as indicating recent-moderate selection, positive or negative iHS values indicated the direction of selection [24]. All replicated SNPs were evaluated for signatures of selection. One locus showed evidence of selection, and this was the lymphocyte associated locus on chromosome 19p13, from 16,336,388–16,429,197 bp. This locus on chromosome 19p13 showed evidence of selection for all SNPs analyzed in the replication phase. All SNPs in this locus appear to be under some degree of negative selection with 2 of the 11 SNPs being under strong negative selection and the rest under recent moderate selection. However, when testing the correlation between effect estimates and iHS at this locus using the ancestral allele as a reference for effect direction, no clear correlation between iHS and effect size was detected (r2 = 0.226, p-value = 0.140). Full iHS annotation for replicated SNPs are shown in in Table S8.
As part of collaborative efforts with the COGENT and RIKEN groups, a coordinated exchange of summary statistics for SNP identified in Table 2 was organized from the parallel studies conducted by these groups. Random-effects meta-analysis techniques were utilized to identify effects that were consistent across studies of diverese ethnic backgrounds. While all joint effect estimates remained in consistent directions with those described in Table 2, heterogeneity of effects across the 3 ancestral populatons severely attenuated the strength of the associations for all but 2 of the genome-wide significant associations identified in this report. The associations at rs4794822 (WBC count) and rs11878602 (lymphocyte count) remained genome-wide significant. Consistent robust effects across ancestral populations at rs11878602 may lend some support to the recent-moderate negative selection at this SNP (iHS = −1.924) being associated with an increase in frequency of the derived A allele associated with decreasing monocyte counts. These results suggest that these SNPs may be very close to the functional variants associated with these effects, as well as exhibiting relatively consistent effects across multiple population ancestries with differing LD structures. The results of these analyses are detailed in Table 3.
Discussion
This meta-analysis has identified ten genome-wide significant trait-locus associations with WBC phenotypes spread across seven genomic loci. Of these trait-locus associations, seven associations represent novel findings, and three associations, across two genomic loci, confirm previously identified associations. The association of chromosome 17q21 near ORMDL3 and CSF3 associated in our study with total WBC count and neutrophil count, and the 9q31 locus associated with monocyte count have been previously demonstrated in European-ancestry and Japanese populations [25], [26]. The chromosome 3q21 locus near RPN1 and C3orf27 has been previously shown to be associated with eosinophil count, and is instead associated with related granulocyte cell measures of monocyte and basophil counts in this study, although we do identify suggestive p-values at ∼1.00E-4 to 1.00E-8 and the same direction of effect for the additional loci identified in Gubjartsson et al., 2010 [19], [27]. In addition, a number of previous GWAS have implicated the monocyte count associated locus at chromosome 8q24.21 as affecting height, renal function, serum protein levels, multiple sclerosis, glioma, leukemia and a number of other cancers [28]–[59]. Through conditional analyses and an analysis of functional relatedness, we have shown correlation among related traits and possible pleiotropic connectivity of these loci across phenotypes that represent measures of biologically related cellular lineages, as well as identified loci showing a direct association between allelic gene expression differences and variation in phenotypic measures.
The associations identified in this study are robust, and several have been previously identified in GWAS studies of immunologically relevant phenotypes, such as the association of celiac disease with the chromosome 2q31.1 locus containing ITGA4. ITGA4 encodes an alpha integrin subunit present on monocytes, lymphocytes, endothelial cells and erythrocytes that serves as an adhesion molecular receptor for VCAM-1, fibronectin. VCAM-1 is expressed at high levels on the vasculature of the bone marrow, and therefore alpha4integrin receptors may play a role in homing and recruitment of certain cell types during hematopoiesis [60], [61].
The overlapping loci on chromosome 17q21 associated with both total WBC and neutrophil counts constitute a single plausible candidate locus contributing to general variability in total WBC count and neutrophil count via hematopoietic mechanisms. These measures are highly correlated, and the estimated SNP effects are also likely correlated for this reason. From a functional perspective, the role of G-CSF, the CSF3 gene product, has been well-described in the biology of myeloid progenitor production and differentiation. This locus previously reached genome-wide significance in a joint analysis of discovery and replication cohorts of total WBC count in individuals of European ancestry [18]. The same locus, containing the genes PSMD3 and CSF3, was associated with neutrophil count in a cohort of Japanese participants [27]. This study of Japanese subjects also reported a significant genome-wide association with neutrophil count for three SNPs in a locus at chromosome 20, containing the gene PLCB4. Due to the lower minor allele frequency in European ancestry individuals of these three correlated SNPs (minor allele frequency = 0.076 in HapMap CEU samples versus 0.289 in HapMap JPT samples for rs2072910) and the marginal effect size detected in the original report, statistical power to detect this association was limited in our analysis [27].
Our data suggest that the locus on chromosome 17q21 has functional connectivity across white cell subtypes. Multiple genes at this locus appeared in all significant pathways identified in the GRAIL analyses showing a functional connectivity across both granulocyte and non-granulocyte cell lineages. Gene clusters detailed in the GRAIL analyses show significant functional clusters including genomic regions that are separately associated with granulocyte and non-granulocyte traits within the same cluster. However, the results of the GRAIL analyses may be influenced by both funding avenues and publication bias, as the classifications are based only on PubMed searchable published results.
Cis-eQTL effects on transcripts in the ORMDL3 and GSDML were shown to be highly correlated with allelic effects associated with neutrophil and total WBC count measures quantified in the meta-analysis. This suggests that variation in total WBC count and neutrophil counts are at least in part due to polymorphism-based regulation of gene expression. This regulation of gene expression by SNPs in this region may be related to some form of systemic immunological function, as variants in this region were also shown to be associated with expression of ORMDL3 in childhood asthma [62]. Interestingly, Okada et al. also showed significant eQTL associations between SNPs and transcripts at this locus, although their associations implicate regulation of PSMD3 as affecting neutrophil variation, rather than the exact transcript associations identified in this report [27]. This slight difference may be attributable to the larger sample size in our study, differences in population ancestry of the two studies, and the use of expression data derived from lymphoblastoid cell lines instead of whole blood. The CSF3 gene at this locus is a possible candidate contributing to effects across multiple cell subtypes at this locus, as functional studies have demonstrated that this gene creates a protein integral to the differentiation and functionality of granulocyte cells [63], [64], [65], [66]. One important caveat to our eQTL analysis is the lack of representation of the CSF3 gene on the expression array used.
The loci on chromosome 6p21 associated with total WBC and lymphocyte counts appear to be independent of each other as the lymphocyte association persists after adjustment for total WBC. While the closest replicated SNP associations are roughly 200 kb from each other, and a shared functional connectivity between these regions was not elucidated in the GRAIL analysis. At this locus, rs2524079 associated with lymphocyte count is in moderate LD with a number of SNPs in the periphery of the total WBC count locus, including rs2844619, a SNP significant in the total WBC count discovery phase analysis (D′ = 0.762, r2 = 0.305, from HapMap Phase II CEU samples). The finding of multiple independent effects at a single locus has occurred in prior studies and include examples such as the finding of two independent signals within the PLAG1 locus associated with human height, suggesting a locus specific effect, in the current example affecting leucopoiesis or leukocyte homeostasis [48]. Our chromosome 6p21 WBC and lymphocyte loci, both harbor candidate genes that have been previously implicated as associated with phenotypes closely related to immunological function. The locus associated with total WBC count on chromosome 6p21 contains genes associated with follicular lymphoma (CHCG22), progression of HIV-1 infection (CDSN, PSORS1C1 and PSORS1C2) and psoriasis (CCHCR1, PSORS1C1 and PSORS1C2) [67], [68], [69]. This gene rich region includes HLA family genes, particularly HLA-B and HLA-C that are candidates within the lymphocyte associated locus, and actually overlap with the psoriasis candidate locus identified previously via linkage mapping studies and includes PSORS1C1 and PSORS1C2, showing the relatedness of these two loci on chromosome 6p21 [69], although conditional analyses adjusting for total white blood cell count validate these as primarily independent effects. The lymphocyte associated regions containing HLA-B and HLA-C, harbors two genes that have been implicated in multiple GWAS as modifiers of immunological responses, associated with IL-18 levels, HIV-1 control, vitiligo, multiple clerosis, and psoriasis [49], [68], [70], [71], [72], [73], [74], [75], [76].
The region associated with basophil and monocyte counts on chromosome 3q21 proximal to the GATA2 gene was previously described as associated with variation in eosinophil count in European and Asian ancestry populations [19]. This association with eosinophil count was not identified as genome-wide significant in our analyses, with multiple SNPs in the GATA2 region (GATA2+/−250 kb) approaching genome-wide significance, exhibiting p-values in the discovery meta-analysis including a regional minimum of 6.73E-07 (rs4328821 ). This is likely due to our decreased sample size for analyses of eosinophil count compared to the original report. However, our data show a significant association between this genomic region and basophil count, and basophils and eosinophils are both granulocytic cells with common lineage in WBC differentiation. GATA2 is a well-known transcription factor involved in maintenance of early hematopoietic cell pools and proximal hematopoietic pathways. In addition, this region proximal to GATA2 is also significantly associated with monocyte counts, showing overlapping associations across both granulocyte and non-granulocyte cell lineages and supporting the previously described functional role of GATA2 more proximally in the WBC differentiation process [77], [78]. The independence across granulocyte and non-granulocyte lineages is evident as both associations showed independent signals of association after adjustment for total WBC.
Our analyses have identified genomic loci associated with total WBC and constituent white cell subtypes in European ancestry cohorts. Our findings differ from the results of similar studies of African American and Asian ancestry populations. Population variation at previously identified total WBC count associated loci of DARC in African American cohorts and PLCB4 in Asian ancestry samples motivated our investigation of possible selection at the loci identified in this report, as allele frequencies of SNPs in DARC and PLCB4 differ across populations. This may be suggestive of recent selection. Our analyses of iHS statistics for genome-wide significant SNPs yielded only one locus under selection, with all SNPs investigated in this region being under negative selection in European ancestry populations. These SNPs on chromosome 19p13 associated with lymphocyte counts are proximal to candidate genes such as CHREP, which function in calcium homeostasis in lymphocytes, and mutations in the coding region of CALR3 are associated with familial hypertrophic cardiomyopathy [79], [80]. A thorough search of literature did not reveal any known selective factors associated with this locus. In addition, the fact that this locus remained genome-wide significant in random-effects modeling across diverse ancestral populations suggests a highly generalizable effect at this locus that may or may not be related to selective factors.
In conclusion, we have identified and replicated a set of 10 independent trait-locus associations influencing multiple related WBC traits, of which seven are novel associations. Integrative analyses of our association data and gene expression analyses support pleiotropic effects that will require further functional testing to clarify.
Materials and Methods
Ethics Statement
All participating studies conducted their research in accordance with their respective institutional scientific and ethical review boards. All human participants provided informed consent and all clinical investigation was conducted in accordance with the Declaration of Helsinki.
Study Methods
WBC counts were measured in 19,509 subjects in 7 discovery cohorts (The Rotterdam Study (RS), Framingham Heart Study(FHS), the NHLBI's Atherosclerosis Risk in Communities (ARIC) Study, the Age, Gene/Environment Susceptibility – Reykjavik Study (AGES) Study, Health Aging and Body Composition Study (Health ABC), the Baltimore Longitudinal Study of Aging (BLSA), and the Invecchiare in Chianti Study (InChianti)) and 11,823 subjects in 10 replication cohorts (the Sorbs, the Twins UK cohort (TwinsUK), Kooperative Gesundheitsforschung in der Region Augsburg (KORAF3 & KORAF4) and UK Blood Services Donor Panel 1 (UKBS1) studies, three of the Italian Network on Genetic Isolates (INGI) studies (Carlantino, Val Borbera and Friuli Venezia Giulia), the Rotterdam Study II (RSII) and the Heart and Vascular Health Study (HVH)). In order to study genetic factors affecting variation of these traits within normal ranges, each study excluded all participants with any WBC measure (total WBC or one of the 5 cell subtypes) outside of +/−2 standard deviations from the mean value for that trait.
WBC phenotypes were derived from data provided by fluorescence activated cell sorting technologies commonly employed in clinical and epidemiological studies to interrogate common hematological elements found in peripheral blood. Total WBC count was reported in thousands of cells per ml, and sub-type specific cell counts were calculated by multiplying the proportion of the WBC count comprised by each cell type by the total WBC measure. Any subject with a trait value greater than 2 SD from the corresponding mean of that trait in each cohort or missing data for any assayed phenotype were excluded from all analyses. Shapiro-Wilk tests of normality were implemented in the smallest discovery cohorts as the data was available at the time of study design (the InChianti study and the Baltimore Longitudinal Study of Aging, BLSA) to evaluate normality of the phenotypes for analyses. Raw values, natural log transformed and square-root transformed values for each phenotype of interest in these two studies were compared with regard to deviations from normality based assessment of the Shapiro-Wilk test statistic in the two studies. Based upon these reviews, a uniform analysis plan was established for conducting each study-level analyses, analysis, using either log transformation (total WBC count, neutrophil count, and monocyte count) or square-root transformation (basophil count, eosinophil count and lymphocyte count) in order to normalize the distributions of the phenotypic data.
At the study level, GWAS analyses were conducted on unrelated participants (except for FHS) of confirmed European ancestry based on either multi-dimensional scaling or principal components analyses, concordance between genotypic and self-reported gender, and successfully genotyped at >95% of attempted SNPs. SNPs were filtered based on criteria of minor allele frequency >0.01 (MAF), missingness per SNP<5% and Hardy-Weinberg equilibrium p-value>1.00E-7 (HWE, used to exclude poorly clustered genotypes). Participants and SNPs passing basic quality control were imputed to >2.4 million SNPs based on HapMapII haplotype data. All studies utilized multivariate linear regression to generate study level summary statistics for each phenotype, with allelic dosages at each SNP used as the independent variable and primary covariates of age at hematology assay, current smoking status and sex. Detailed descriptions of participating studies, their quality control practices and study-level analyses which may differ slightly from those described above are provided in Text S1.
To conduct meta-analyses, all studies submitted summary statistics from the study-level linear regression analyses for each phenotype. Meta-analyses were conducted using inverse-variance weighted fixed-effects models to combine beta coefficients and standard errors from study level regression results for each SNP to derive a combined p-value. Prior to discovery meta-analyses, SNPs were excluded if imputation quality metrics (equivalent to the squared correlation between proximal imputed and genotyped SNPs) were less than 0.30. Study level results were also corrected for genomic inflation factors (λGC) by incorporating study specific λGC estimates into the scaling of the standard errors (SE) of the regression coefficients by multiplying the SE by the square-root of the genomic inflation factor (see Table S1 for study and phenotype specific genomic inflation factors) [81]. Study specific genomic inflation factor estimates for all discovery cohorts were all <1.05 except for 1.12 in the Health ABC analysis of basophil count and 1.09 in the analysis of total white blood cell count in AGES (Table S1). No definitive cause of this inflation could be identified, and of particular interest, the genomic inflation factors for related traits in these two studies were within the normally accepted range. Meta-analyses were implemented using METAL and independently re-analyzed using R to confirm results [82].
We chose a priori to carry over all results from discovery meta-analyses at p-values<5.00E-08 to replication meta-analyses, excluding any SNPs with Cochran's Q test of heterogeneity p-values<0.01 or missing in more than 2 studies. These conservative exclusion criteria caused the exclusion of 6 of 167 genome-wide significant SNPs from replication analyses, and these SNPs did not constitute any new loci of interest. The final number of SNPs for replication analyses was then reduced to 161 candidate SNPs across all phenotypes. For replication meta-analyses of individual SNPs, each phenotype was analyzed separately using similar inverse-variance weighted meta-analyses as in the discovery stage analyses, although no genomic control was used. P-values for significant associations in the replication stage were corrected for the number of SNPs tested per phenotype using the standard Bonferroni correction for multiple testing (total WBC count corrected for 63 SNPs, with a significance threshold of p-value≤7.94E-4; neutrophil count corrected for 46 SNPs, with a significance threshold of p-value≤1.09E-3; basophil count corrected for 1 SNP, with a significance threshold of p-value≤0.05; lymphocyte count corrected for 14 SNPs, with a significance threshold of p-value≤3.57E-3; and monocyte count corrected for 37 SNPs, with a significance threshold of p-value≤1.35E-3). Of the 161 candidate SNPs included in the replication phase, 152 SNPs passed the trait-specific replication p-value thresholds. Ony one genome-wide significant locus failed to replicate, and this locus on chromosome 1q22 contained only one genome-wide significant SNP associated with monocyte count in the discovery phase.
Of the 152 successfully replicated associations, 109 SNPs were unique, since some SNPs were significant across multiple phenotypes. These replicated SNPs were analyzed in GRAIL to infer a possible biological connection between significant meta-analysis loci. GRAIL was used to mine textual data based on PubMed keywords to examine functional relatedness across phenotypes based on inferred biological interconectivity between genes proximal to meta-analysis results. SNP (rs) identifiers for these associated SNPs were input into the GRAIL webserver as a means of specifying genomic regions of interest in constructing query and seed regions to be analyzed. Genes for text mining of the functional datasource were identified using the LD structure of HapMap2 Release 22 CEU samples, gathering gene identifiers to search indexed abstracts from PubMed last curated on May 2010. Genes in regions of interest were clustered based on keyword similarity. These genes and clusters were then scored based on ranked similarity, adjusting for gene size, to generate p-values evaluating the strength of the functional interconnectivity of genes in the regions of interest. P-values for these functional clusters were then false-discovery rate adjusted (FDR) to correct for multiple testing, with the FDR adjusted p-value of 0.05 considered the threshold for significance.
For the eQTL analysis, 501 participants with complete genotyping data from the InChianti study were also successfully assayed on Illumina HT12v.3 genome-wide expression arrays using RNA isolated from whole blood. Quality control of the genome-wide expression data included the exclusion of probes with detection p-values>0.01 with missing data for greater than 5% of participants. Samples must have been assayed with at least 95% of the filtered probe sets passing quality control in order to be included in analyses.
5094 probes passed quality control and were subsequently cubic-spline normalized prior to analysis. In the investigation of possible cis-eQTL associations at regions of interest identified in the meta-analysis, all probes within 500 kb of successfully replicated SNPs from the meta-analysis were identified based on annotations from ReMOAT (http://www.compbio.group.cam.ac.uk/Resources/Annotation/) [83]. Thus, we tested 741 possible cis-eQTLs. Multivariate linear regression was implemented using PLINKv.1.07 [84], testing the dosage of minor alleles as a predictor of gene expression level for each probe. These linear regression models were adjusted for hybridization batch, amplification batches, sex, smoking, study site and age at baseline of study. The p-values generated by each analysis was corrected for the number of tests, with a minimum threshold of significance at p-value≤6.75E-05.
Supporting Information
Zdroje
1. DaneshJCollinsRApplebyPPetoR 1998 Association of fibrinogen, C-reactive protein, albumin, or leukocyte count with coronary heart disease: meta-analyses of prospective studies. JAMA 279 1477 1482
2. MadjidMAwanIWillersonJTCasscellsSW 2004 Leukocyte count and coronary heart disease: implications for risk assessment. J Am Coll Cardiol 44 1945 1956
3. ShankarAWangJJRochtchinaEYuMCKeffordR 2006 Association between circulating white blood cell count and cancer mortality: a population-based cohort study. Arch Intern Med 166 188 194
4. RuggieroCMetterEJCherubiniAMaggioMSenR 2007 White blood cell count and mortality in the Baltimore Longitudinal Study of Aging. J Am Coll Cardiol 49 1841 1850
5. Lloyd-JonesDMCamargoCAAllenLAGiuglianoRPO'DonnellCJ 2003 Predictors of long-term mortality after hospitalization for primary unstable angina pectoris and non-ST-elevation myocardial infarction. Am J Cardiol 92 1155 1159
6. LeeCDFolsomARNietoFJChamblessLEShaharE 2001 White blood cell count and incidence of coronary heart disease and ischemic stroke and mortality from cardiovascular disease in African-American and White men and women: atherosclerosis risk in communities study. Am J Epidemiol 154 758 764
7. GrimmRHJrNeatonJDLudwigW 1985 Prognostic importance of the white blood cell count for coronary, cancer, and all-cause mortality. JAMA 254 1932 1937
8. GillumRFMussolinoMEMadansJH 2005 Counts of neutrophils, lymphocytes, and monocytes, cause-specific mortality and coronary heart disease: the NHANES-I epidemiologic follow-up study. Ann Epidemiol 15 266 271
9. FriedmanGDKlatskyALSiegelaubAB 1974 The leukocyte count as a predictor of myocardial infarction. N Engl J Med 290 1275 1278
10. BrownDWGilesWHCroftJB 2001 White blood cell count: an independent predictor of coronary heart disease mortality among a national cohort. J Clin Epidemiol 54 316 322
11. WheelerJGMussolinoMEGillumRFDaneshJ 2004 Associations between differential leucocyte count and incident coronary heart disease: 1764 incident cases from seven prospective studies of 30,374 individuals. Eur Heart J 25 1287 1292
12. RanaJSBoekholdtSMRidkerPMJukemaJWLubenR 2007 Differential leucocyte count and the risk of future coronary artery disease in healthy men and women: the EPIC-Norfolk Prospective Population Study. J Intern Med 262 678 689
13. NietoFJSzkloMFolsomARRockRMercuriM 1992 Leukocyte count correlates in middle-aged adults: the Atherosclerosis Risk in Communities (ARIC) Study. Am J Epidemiol 136 525 537
14. PiliaGChenWMScuteriAOrruMAlbaiG 2006 Heritability of cardiovascular and personality traits in 6,148 Sardinians. PLoS Genet 2 e132 doi:10.1371/journal.pgen.0020132
15. ReichDNallsMAKaoWHAkylbekovaELTandonA 2009 Reduced neutrophil count in people of African descent is due to a regulatory variant in the Duffy antigen receptor for chemokines gene. PLoS Genet 5 e1000360 doi:10.1371/journal.pgen.1000360
16. NallsMAWilsonJGPattersonNJTandonAZmudaJM 2008 Admixture mapping of white cell count: genetic locus responsible for lower white blood cell count in the Health ABC and Jackson Heart studies. Am J Hum Genet 82 81 87
17. GaneshSKZakaiNAvan RooijFJSoranzoNSmithAV 2009 Multiple loci influence erythrocyte phenotypes in the CHARGE Consortium. Nat Genet 41 1191 1198
18. SoranzoNSpectorTDManginoMKuhnelBRendonA 2009 A genome-wide meta-analysis identifies 22 loci associated with eight hematological parameters in the HaemGen consortium. Nat Genet 41 1182 1190
19. GudbjartssonDFBjornsdottirUSHalapiEHelgadottirASulemP 2009 Sequence variants affecting eosinophil numbers associate with asthma and myocardial infarction. Nat Genet 41 342 347
20. KamataniYMatsudaKOkadaYKuboMHosonoN Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat Genet 42 210 215
21. PsatyBMO'DonnellCJGudnasonVLunettaKLFolsomAR 2009 Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium: Design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ Cardiovasc Genet 2 73 80
22. RaychaudhuriSPlengeRMRossinEJNgACPurcellSM 2009 Identifying relationships among genomic disease regions: predicting genes at pathogenic SNP associations and rare deletions. PLoS Genet 5 e1000534 doi:10.1371/journal.pgen.1000534
23. VoightBFKudaravalliSWenXPritchardJK 2006 A map of recent positive selection in the human genome. PLoS Biol 4 e72 doi:10.1371/journal.pbio.0040072
24. HindorffLASethupathyPJunkinsHARamosEMMehtaJP 2009 Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A 106 9362 9367
25. KamataniYMatsudaKOkadaYKuboMHosonoN 2010 Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat Genet 42 210 215
26. FerreiraMAHottengaJJWarringtonNMMedlandSEWillemsenG 2009 Sequence variants in three loci influence monocyte counts and erythrocyte volume. Am J Hum Genet 85 745 749
27. OkadaYKamataniYTakahashiAMatsudaKHosonoN 2010 Common variations in PSMD3-CSF3 and PLCB4 are associated with neutrophil count. Hum Mol Genet 19 2079 2085
28. 2009 Genome-wide association study identifies new multiple sclerosis susceptibility loci on chromosomes 12 and 20. Nat Genet 41 824 828
29. BeatyTHMurrayJCMarazitaMLMungerRGRuczinskiI A genome-wide association study of cleft lip with and without cleft palate identifies risk variants near MAFB and ABCA4. Nat Genet 42 525 529
30. BirnbaumSLudwigKUReutterHHermsSSteffensM 2009 Key susceptibility locus for nonsyndromic cleft lip with or without cleft palate on chromosome 8q24. Nat Genet 41 473 477
31. Crowther-SwanepoelDBroderickPDi BernardoMCDobbinsSETorresM Common variants at 2q37.3, 8q24.21, 15q21.3 and 16q24.1 influence chronic lymphocytic leukemia risk. Nat Genet 42 132 136
32. CuiROkadaYJangSGKuJLParkJG Common variant in 6q26-q27 is associated with distal colon cancer in an Asian population. Gut
33. DuboisPCTrynkaGFrankeLHuntKARomanosJ Multiple common variants for celiac disease influencing immune gene expression. Nat Genet 42 295 302
34. EastonDFPooleyKADunningAMPharoahPDThompsonD 2007 Genome-wide association study identifies novel breast cancer susceptibility loci. Nature 447 1087 1093
35. EelesRAKote-JaraiZAl OlamaAAGilesGGGuyM 2009 Identification of seven new prostate cancer susceptibility loci through a genome-wide association study. Nat Genet 41 1116 1121
36. EelesRAKote-JaraiZGilesGGOlamaAAGuyM 2008 Multiple newly identified loci associated with prostate cancer susceptibility. Nat Genet 40 316 321
37. Enciso-MoraVBroderickPMaYJarrettRFHjalgrimH A genome-wide association study of Hodgkin's lymphoma identifies new susceptibility loci at 2p16.1 (REL), 8q24.21 and 10p14 (GATA3). Nat Genet 42 1126 1130
38. FerreiraRCPan-HammarstromQGrahamRRGatevaVFontanG Association of IFIH1 and other autoimmunity risk alleles with selective IgA deficiency. Nat Genet 42 777 780
39. FletcherOJohnsonNOrrNHoskingFJGibsonLJ Novel breast cancer susceptibility locus at 9q31.2: results of a genome-wide association study. J Natl Cancer Inst 103 425 435
40. FrankeAMcGovernDPBarrettJCWangKRadford-SmithGL Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet 42 1118 1125
41. GoodeELChenevix-TrenchGSongHRamusSJNotaridouM A genome-wide association study identifies susceptibility loci for ovarian cancer at 2q31 and 8q24. Nat Genet 42 874 879
42. GrantSFWangKZhangHGlabersonWAnnaiahK 2009 A genome-wide association study identifies a locus for nonsyndromic cleft lip with or without cleft palate on 8q24. J Pediatr 155 909 913
43. GudmundssonJSulemPGudbjartssonDFBlondalTGylfasonA 2009 Genome-wide association and replication studies identify four variants associated with prostate cancer susceptibility. Nat Genet 41 1122 1126
44. GudmundssonJSulemPManolescuAAmundadottirLTGudbjartssonD 2007 Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24. Nat Genet 39 631 637
45. HansonRLCraigDWMillisMPYeattsKAKobesS 2007 Identification of PVT1 as a candidate gene for end-stage renal disease in type 2 diabetes using a pooling-based genome-wide single nucleotide polymorphism association study. Diabetes 56 975 983
46. KiemeneyLASulemPBesenbacherSVermeulenSHSigurdssonA A sequence variant at 4p16.3 confers susceptibility to urinary bladder cancer. Nat Genet 42 415 419
47. KiemeneyLAThorlaciusSSulemPGellerFAbenKK 2008 Sequence variant on 8q24 confers susceptibility to urinary bladder cancer. Nat Genet 40 1307 1312
48. Lango AllenHEstradaKLettreGBerndtSIWeedonMN Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467 832 838
49. MelzerDPerryJRHernandezDCorsiAMStevensK 2008 A genome-wide association study identifies protein quantitative trait loci (pQTLs). PLoS Genet 4 e1000072 doi:10.1371/journal.pgen.1000072
50. RothmanNGarcia-ClosasMChatterjeeNMalatsNWuX A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci. Nat Genet 42 978 984
51. SheteSHoskingFJRobertsonLBDobbinsSESansonM 2009 Genome-wide association study identifies five susceptibility loci for glioma. Nat Genet 41 899 904
52. TakataRAkamatsuSKuboMTakahashiAHosonoN Genome-wide association study identifies five new susceptibility loci for prostate cancer in the Japanese population. Nat Genet 42 751 754
53. TenesaAFarringtonSMPrendergastJGPorteousMEWalkerM 2008 Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21. Nat Genet 40 631 637
54. ThomasGJacobsKBYeagerMKraftPWacholderS 2008 Multiple loci identified in a genome-wide association study of prostate cancer. Nat Genet 40 310 315
55. TomlinsonIWebbECarvajal-CarmonaLBroderickPKempZ 2007 A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21. Nat Genet 39 984 988
56. TomlinsonIPWebbECarvajal-CarmonaLBroderickPHowarthK 2008 A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3. Nat Genet 40 623 630
57. TurnbullCAhmedSMorrisonJPernetDRenwickA Genome-wide association study identifies five new breast cancer susceptibility loci. Nat Genet 42 504 507
58. YeagerMOrrNHayesRBJacobsKBKraftP 2007 Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet 39 645 649
59. ZankeBWGreenwoodCMRangrejJKustraRTenesaA 2007 Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24. Nat Genet 39 989 994
60. SimmonsPJMasinovskyBLongeneckerBMBerensonRTorok-StorbB 1992 Vascular cell adhesion molecule-1 expressed by bone marrow stromal cells mediates the binding of hematopoietic progenitor cells. Blood 80 388 395
61. AvecillaSTHattoriKHeissigBTejadaRLiaoF 2004 Chemokine-mediated interaction of hematopoietic progenitors with the bone marrow vascular niche is required for thrombopoiesis. Nat Med 10 64 71
62. MoffattMFKabeschMLiangLDixonALStrachanD 2007 Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma. Nature 448 470 473
63. O'MahonyDSPhamUIyerRHawnTRLilesWC 2008 Differential constitutive and cytokine-modulated expression of human Toll-like receptors in primary neutrophils, monocytes, and macrophages. Int J Med Sci 5 1 8
64. ValenteJFAlexanderJWLiBGNoelJGCusterDA 2002 Effect of in vivo infusion of granulocyte colony-stimulating factor on immune function. Shock 17 23 29
65. NagataSTsuchiyaMAsanoSKaziroYYamazakiT 1986 Molecular cloning and expression of cDNA for human granulocyte colony-stimulating factor. Nature 319 415 418
66. SaitoTUsuiNAsaiODobashiNYanoS 2007 Elevated serum levels of human matrix metalloproteinase-9 (MMP-9) during the induction of peripheral blood stem cell mobilization by granulocyte colony-stimulating factor (G-CSF). J Infect Chemother 13 426 428
67. SkibolaCFBracciPMHalperinECondeLCraigDW 2009 Genetic variants at 6p21.33 are associated with susceptibility to follicular lymphoma. Nat Genet 41 873 875
68. FellayJGeDShiannaKVColomboSLedergerberB 2009 Common genetic variation and the control of HIV-1 in humans. PLoS Genet 5 e1000791 doi:10.1371/journal.pgen.1000791
69. ValdimarssonH 2007 The genetic basis of psoriasis. Clin Dermatol 25 563 567
70. CaponFBijlmakersMJWolfNQuarantaMHuffmeierU 2008 Identification of ZNF313/RNF114 as a novel psoriasis susceptibility gene. Hum Mol Genet 17 1938 1945
71. De JagerPLJiaXWangJde BakkerPIOttoboniL 2009 Meta-analysis of genome scans and replication identify CD6, IRF8 and TNFRSF1A as new multiple sclerosis susceptibility loci. Nat Genet 41 776 782
72. LimouSLe ClercSCoulongesCCarpentierWDinaC 2009 Genomewide association study of an AIDS-nonprogression cohort emphasizes the role played by HLA genes (ANRS Genomewide Association Study 02). J Infect Dis 199 419 426
73. LiuYHelmsCLiaoWZabaLCDuanS 2008 A genome-wide association study of psoriasis and psoriatic arthritis identifies new disease loci. PLoS Genet 4 e1000041 doi:10.1371/journal.pgen.1000041
74. NairRPDuffinKCHelmsCDingJStuartPE 2009 Genome-wide scan reveals association of psoriasis with IL-23 and NF-kappaB pathways. Nat Genet 41 199 204
75. PelakKGoldsteinDBWalleyNMFellayJGeD 2010 Host determinants of HIV-1 control in African Americans. J Infect Dis 201 1141 1149
76. QuanCRenYQXiangLHSunLDXuAE 2010 Genome-wide association study for vitiligo identifies susceptibility loci at 6q27 and the MHC. Nat Genet 42 614 618
77. TsaiFYOrkinSH 1997 Transcription factor GATA-2 is required for proliferation/survival of early hematopoietic cells and mast cell formation, but not for erythroid and myeloid terminal differentiation. Blood 89 3636 3643
78. LabbayeCValtieriMBarberiTMecciaEMasellaB 1995 Differential expression and functional role of GATA-2, NF-E2, and GATA-1 in normal adult hematopoiesis. J Clin Invest 95 2346 2358
79. O'RourkeFALaPlanteJMFeinsteinMB 2003 Antisense-mediated loss of calcium homoeostasis endoplasmic reticulum protein (CHERP; ERPROT213-21) impairs Ca2+ mobilization, nuclear factor of activated T-cells (NFAT) activation and cell proliferation in Jurkat T-lymphocytes. Biochem J 373 133 143
80. ChiuCTeboMInglesJYeatesLArthurJW 2007 Genetic screening of calcium regulation genes in familial hypertrophic cardiomyopathy. J Mol Cell Cardiol 43 337 343
81. de BakkerPIFerreiraMAJiaXNealeBMRaychaudhuriS 2008 Practical aspects of imputation-driven meta-analysis of genome-wide association studies. Hum Mol Genet 17 R122 128
82. WillerCJLiYAbecasisGR 2010 METAL: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics
83. Barbosa-MoraisNLDunningMJSamarajiwaSADarotJFRitchieME 2010 A re-annotation pipeline for Illumina BeadArrays: improving the interpretation of gene expression data. Nucleic Acids Res 38 e17
84. PurcellSNealeBTodd-BrownKThomasLFerreiraMA 2007 PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81 559 575
Štítky
Genetika Reprodukční medicínaČlánek vyšel v časopise
PLOS Genetics
2011 Číslo 6
Nejčtenější v tomto čísle
- Statistical Inference on the Mechanisms of Genome Evolution
- Recurrent Chromosome 16p13.1 Duplications Are a Risk Factor for Aortic Dissections
- Chromosomal Macrodomains and Associated Proteins: Implications for DNA Organization and Replication in Gram Negative Bacteria
- Maps of Open Chromatin Guide the Functional Follow-Up of Genome-Wide Association Signals: Application to Hematological Traits