Advertisement
Research Article

Identification, Replication, and Fine-Mapping of Loci Associated with Adult Height in Individuals of African Ancestry

  • Amidou N'Diaye equal contributor,

    equal contributor Contributed equally to this work with: Amidou N'Diaye, Gary K. Chen, Cameron D. Palmer

    Affiliation: Montreal Heart Institute, Montréal, Canada

    X
  • Gary K. Chen equal contributor,

    equal contributor Contributed equally to this work with: Amidou N'Diaye, Gary K. Chen, Cameron D. Palmer

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Cameron D. Palmer equal contributor,

    equal contributor Contributed equally to this work with: Amidou N'Diaye, Gary K. Chen, Cameron D. Palmer

    Affiliations: Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America, Divisions of Genetics and Endocrinology and Program in Genomics, Children's Hospital Boston, Boston, Massachusetts, United States of America

    X
  • Bing Ge,

    Affiliation: Department of Human Genetics, McGill University and Genome Quebec Innovation Centre, Montreal, Canada

    X
  • Bamidele Tayo,

    Affiliation: Department of Preventive Medicine and Epidemiology, Loyola University Chicago Stritch School of Medicine, Maywood, Illinois, United States of America

    X
  • Rasika A. Mathias,

    Affiliation: Department of Medicine, The Johns Hopkins GeneSTAR Research Program, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America

    X
  • Jingzhong Ding,

    Affiliation: Sticht Center on Aging, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America

    X
  • Michael A. Nalls,

    Affiliation: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Adebowale Adeyemo,

    Affiliation: NIH Intramural Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Véronique Adoue,

    Affiliation: Department of Human Genetics, McGill University and Genome Quebec Innovation Centre, Montreal, Canada

    X
  • Christine B. Ambrosone,

    Affiliation: Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, New York, United States of America

    X
  • Larry Atwood,

    Affiliation: Department of Neurology, Boston University School of Medicine, Boston, Massachusetts, United States of America

    X
  • Elisa V. Bandera,

    Affiliation: The Cancer Institute of New Jersey, New Brunswick, New Jersey, United States of America

    X
  • Lewis C. Becker,

    Affiliation: Department of Medicine, The Johns Hopkins GeneSTAR Research Program, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America

    X
  • Sonja I. Berndt,

    Affiliation: Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Leslie Bernstein,

    Affiliation: Division of Cancer Etiology, Department of Population Science, Beckman Research Institute, City of Hope, Duarte, California, United States of America

    X
  • William J. Blot,

    Affiliations: International Epidemiology Institute, Rockville, Maryland, United States of America, Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, Tennessee, United States of America, Vanderbilt–Ingram Cancer Center, Nashville, Tennessee, United States of America

    X
  • Eric Boerwinkle,

    Affiliation: Human Genetics Center and Institute of Molecular Medicine and Division of Epidemiology, University of Texas Health Science Center, Houston, Texas, United States of America

    X
  • Angela Britton,

    Affiliation: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Graham Casey,

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Stephen J. Chanock,

    Affiliation: Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Ellen Demerath,

    Affiliation: Division of Epidemiology and Community Health, University of Minnesota School of Public Health, Minneapolis, Minnesota, United States of America

    X
  • Sandra L. Deming,

    Affiliations: Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, Tennessee, United States of America, Vanderbilt–Ingram Cancer Center, Nashville, Tennessee, United States of America

    X
  • W. Ryan Diver,

    Affiliation: Epidemiology Research Program, American Cancer Society, Atlanta, Georgia, United States of America

    X
  • Caroline Fox,

    Affiliation: Department of Medicine, University of Mississippi Medical Center, Jackson, Mississippi, United States of America

    X
  • Tamara B. Harris,

    Affiliation: Laboratory of Epidemiology, Demography, and Biometry, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Dena G. Hernandez,

    Affiliation: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Jennifer J. Hu,

    Affiliation: Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, Florida, United States of America

    X
  • Sue A. Ingles,

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Esther M. John,

    Affiliations: Cancer Prevention Institute of California, Fremont, California, United States of America, School of Medicine and Stanford Cancer Center, Stanford University, Stanford, California, United States of America

    X
  • Craig Johnson,

    Affiliation: Department of Biostatistics, University of Washington, Seattle, Washington, United States of America

    X
  • Brendan Keating,

    Affiliation: Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America

    X
  • Rick A. Kittles,

    Affiliation: Department of Medicine, University of Illinois at Chicago, Chicago, Illinois, United States of America

    X
  • Laurence N. Kolonel,

    Affiliation: Epidemiology Program, Cancer Research Center, University of Hawaii, Honolulu, Hawaii, United States of America

    X
  • Stephen B. Kritchevsky,

    Affiliation: Sticht Center on Aging, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America

    X
  • Loic Le Marchand,

    Affiliation: Epidemiology Program, Cancer Research Center, University of Hawaii, Honolulu, Hawaii, United States of America

    X
  • Kurt Lohman,

    Affiliation: Sticht Center on Aging, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America

    X
  • Jiankang Liu,

    Affiliation: University of Mississippi Medical Center, Jackson, Mississippi, United States of America

    X
  • Robert C. Millikan,

    Affiliation: Department of Epidemiology, Gillings School of Global Public Health and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, United States of America

    X
  • Adam Murphy,

    Affiliation: Department of Urology, Northwestern University, Chicago, Illinois, United States of America

    X
  • Solomon Musani,

    Affiliation: Jackson Heart Study, Department of Medicine, Division of Cardiovascular Disease, University of Mississippi Medical Center, Jackson, Mississippi, United States of America

    X
  • Christine Neslund-Dudas,

    Affiliation: Department of Biostatistics and Research Epidemiology, Henry Ford Hospital, Detroit, Michigan, United States of America

    X
  • Kari E. North,

    Affiliations: Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, United States of America, Carolina Center for Genome Sciences, University of North Carolina, Chapel Hill, North Carolina, United States of America

    X
  • Sarah Nyante,

    Affiliation: Department of Epidemiology, Gillings School of Global Public Health and Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, United States of America

    X
  • Adesola Ogunniyi,

    Affiliation: Department of Medicine, University of Ibadan, Ibadan, Nigeria

    X
  • Elaine A. Ostrander,

    Affiliation: Cancer Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • George Papanicolaou,

    Affiliation: National Heart, Lung, and Blood Institute (NHLBI), Division of Cardiovascular Sciences, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Sanjay Patel,

    Affiliation: Division of Sleep Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States of America

    X
  • Curtis A. Pettaway,

    Affiliation: Department of Urology, The University of Texas M. D. Anderson Cancer Center, Houston, Texas, United States of America

    X
  • Michael F. Press,

    Affiliation: Department of Pathology, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Susan Redline,

    Affiliation: Division of Sleep Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States of America

    X
  • Jorge L. Rodriguez-Gil,

    Affiliation: Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, Florida, United States of America

    X
  • Charles Rotimi,

    Affiliation: NIH Intramural Center for Research on Genomics and Global Health, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Benjamin A. Rybicki,

    Affiliation: Department of Biostatistics and Research Epidemiology, Henry Ford Hospital, Detroit, Michigan, United States of America

    X
  • Babatunde Salako,

    Affiliation: Department of Medicine, University of Ibadan, Ibadan, Nigeria

    X
  • Pamela J. Schreiner,

    Affiliation: Division of Epidemiology and Community Health, University of Minnesota, Minneapolis, Minnesota, United States of America

    X
  • Lisa B. Signorello,

    Affiliations: International Epidemiology Institute, Rockville, Maryland, United States of America, Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, Tennessee, United States of America, Vanderbilt–Ingram Cancer Center, Nashville, Tennessee, United States of America

    X
  • Andrew B. Singleton,

    Affiliation: Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Janet L. Stanford,

    Affiliation: Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America

    X
  • Alex H. Stram,

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Daniel O. Stram,

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Sara S. Strom,

    Affiliation: Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, Texas, United States of America

    X
  • Bhoom Suktitipat,

    Affiliations: Department of Medicine, The Johns Hopkins GeneSTAR Research Program, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America, The Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, United States of America

    X
  • Michael J. Thun,

    Affiliation: Epidemiology Research Program, American Cancer Society, Atlanta, Georgia, United States of America

    X
  • John S. Witte,

    Affiliation: Institute for Human Genetics, Departments of Epidemiology and Biostatistics and Urology, University of California San Francisco, San Francisco, California, United States of America

    X
  • Lisa R. Yanek,

    Affiliation: Department of Medicine, The Johns Hopkins GeneSTAR Research Program, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America

    X
  • Regina G. Ziegler,

    Affiliation: Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, United States of America

    X
  • Wei Zheng,

    Affiliations: Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, Tennessee, United States of America, Vanderbilt–Ingram Cancer Center, Nashville, Tennessee, United States of America

    X
  • Xiaofeng Zhu,

    Affiliation: Department of Biostatistics and Epidemiology, Case Western Reserve University, Cleveland, Ohio, United States of America

    X
  • Joseph M. Zmuda,

    Affiliation: Department of Epidemiology, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America

    X
  • Alan B. Zonderman,

    Affiliation: Laboratory of Personality and Cognition, National Institute on Aging, National Institutes of Health, Baltimore, Maryland, United States of America

    X
  • Michele K. Evans,

    Affiliation: Health Disparities Research Section, Clinical Research Branch, National Institute on Aging, National Institutes of Health, Baltimore, Maryland, United States of America

    X
  • Yongmei Liu,

    Affiliation: Sticht Center on Aging, Wake Forest University School of Medicine, Winston-Salem, North Carolina, United States of America

    X
  • Diane M. Becker,

    Affiliation: Department of Medicine, The Johns Hopkins GeneSTAR Research Program, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America

    X
  • Richard S. Cooper,

    Affiliation: Department of Preventive Medicine and Epidemiology, Loyola University Chicago Stritch School of Medicine, Maywood, Illinois, United States of America

    X
  • Tomi Pastinen,

    Affiliation: Department of Human Genetics, McGill University and Genome Quebec Innovation Centre, Montreal, Canada

    X
  • Brian E. Henderson,

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Joel N. Hirschhorn mail,

    joelh@broadinstitute.org (JN Hirschhorn); guillaume.lettre@umontreal.ca (G Lettre); haiman@usc.edu (CA Haiman)

    Affiliations: Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America, Divisions of Genetics and Endocrinology and Program in Genomics, Children's Hospital Boston, Boston, Massachusetts, United States of America, Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America

    X
  • Guillaume Lettre mail,

    joelh@broadinstitute.org (JN Hirschhorn); guillaume.lettre@umontreal.ca (G Lettre); haiman@usc.edu (CA Haiman)

    Affiliations: Montreal Heart Institute, Montréal, Canada, Département de Médecine, Université de Montréal, Montréal, Canada

    X
  • Christopher A. Haiman mail

    joelh@broadinstitute.org (JN Hirschhorn); guillaume.lettre@umontreal.ca (G Lettre); haiman@usc.edu (CA Haiman)

    Affiliation: Department of Preventive Medicine, Keck School of Medicine and Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, United States of America

    X
  • Published: October 06, 2011
  • DOI: 10.1371/journal.pgen.1002298
Corrections

21 Nov 2011: N'Diaye A, Chen GK, Palmer CD, Ge B, Tayo B, et al. (2011) Correction: Identification, Replication, and Fine-Mapping of Loci Associated with Adult Height in Individuals of African Ancestry. PLoS Genet 7(11): 10.1371/annotation/58c67154-3f10-4155-9085-dcd6e3689008. doi: 10.1371/annotation/58c67154-3f10-4155-9085-dcd6e3689008 | View correction

Abstract

Adult height is a classic polygenic trait of high heritability (h2 ~0.8). More than 180 single nucleotide polymorphisms (SNPs), identified mostly in populations of European descent, are associated with height. These variants convey modest effects and explain ~10% of the variance in height. Discovery efforts in other populations, while limited, have revealed loci for height not previously implicated in individuals of European ancestry. Here, we performed a meta-analysis of genome-wide association (GWA) results for adult height in 20,427 individuals of African ancestry with replication in up to 16,436 African Americans. We found two novel height loci (Xp22-rs12393627, P = 3.4×10−12 and 2p14-rs4315565, P = 1.2×10−8). As a group, height associations discovered in European-ancestry samples replicate in individuals of African ancestry (P = 1.7×10−4 for overall replication). Fine-mapping of the European height loci in African-ancestry individuals showed an enrichment of SNPs that are associated with expression of nearby genes when compared to the index European height SNPs (P<0.01). Our results highlight the utility of genetic studies in non-European populations to understand the etiology of complex human diseases and traits.

Author Summary

Adult height is an ideal phenotype to improve our understanding of the genetic architecture of complex diseases and traits: it is easily measured and usually available in large cohorts, relatively stable, and mostly influenced by genetics (narrow-sense heritability of height h2~0.8). Genome-wide association (GWA) studies in individuals of European ancestry have identified >180 single nucleotide polymorphisms (SNPs) associated with height. In the current study, we continued to use height as a model polygenic trait and explored the genetic influence in populations of African ancestry through a meta-analysis of GWA height results from 20,809 individuals of African descent. We identified two novel height loci not previously found in Europeans. We also replicated the European height signals, suggesting that many of the genetic variants that are associated with height are shared between individuals of European and African descent. Finally, in fine-mapping the European height loci in African-ancestry individuals, we found SNPs more likely to be associated with the expression of nearby genes than the SNPs originally found in Europeans. Thus, our results support the utility of performing genetic studies in non-European populations to gain insights into complex human diseases and traits.

Introduction

Adult height is a classic polygenic trait of high heritability (h2~0.8) [1], [2]. A recent large meta-analysis of genome-wide association (GWA) results for height, which included data from >180,000 individuals of European descent, identified 180 loci that associate with variation in height [3]. The most significantly associated variants at these loci explain approximately 10% of the variance, consistent with the hypothesis put forward in 1918 by Fisher on the “cumulative Mendelian factors”, which suggested that the segregation of a large number of genetic variants, each of small effect, is sufficient to explain the variation in height observed in humans [4].

In parallel to the work in European-ancestry populations, GWA studies for adult height in other ethnic groups, including Koreans, Japanese, Africans, and African Americans have also been performed [5][10]. The GWA scans in East Asians replicated several of the height loci already identified in individuals of European descent, and also found evidence for new height loci not previously implicated in individuals of European ancestry [6], [7]. The studies in Africans and African Americans were modest in size and, although they replicated nominally some of the associations previously found in European populations, were not well-powered to find new population-specific height loci [8], [9].

To search for novel loci for height in populations of African ancestry, and to explore systematically the replication of previously validated height loci, we combined GWA results for height from nine studies totaling 20,427 individuals of African descent. We identified two novel height loci and observed significant evidence for the replication of European height signals in African-derived populations. In fine-mapping of the European height loci we also identified variants that better define the association in individuals of African ancestry and control local gene expression in cis (cis-eQTLs), suggesting that they are likely to be better surrogates of the biologically functional alleles.

Results/Discussion

The meta-analysis included results from nine studies: four population-based African-American studies (ARIC (N = 2,740), CARDIA (N = 699), JHS (N = 2,119), and MESA (N = 1,646)), one family-based African-American study (CFS (N = 386)), African-American GWA study consortia of breast (AABC (N = 5,380)) and prostate cancer (AAPC (N = 5,526)) and two case-control studies of obesity (Maywood (N = 743)) and hypertension (Nigeria (N = 1,188)) (Materials and Methods, Text S1 and Table S1). We tested associations between 3,310,998 genotyped or imputed SNPs and sex-, age-, and disease status-adjusted height Z-scores under an additive genetic model, correcting for global admixture using principal components (PCs) as covariates, and modeling family structure when appropriate (Text S1). Height results for each study were scaled using genomic control, and then combined using the inverse-variance meta-analytic method (Text S1).

The quantile-quantile (QQ) plot suggested little departure from the null expectation, except at the right end tail of the distribution (Figure 1). The associations that deviate most strongly from the null correspond to loci previously associated with height in European populations, providing a strong validation of our approach (Table 1). The overall inflation factor in the meta-analysis was λGC = 1.064 and results were again scaled using genomic control, a slightly conservative approach [11].

thumbnail

Figure 1. Quantile–quantile (QQ) plot of the meta-analysis with up to 3.3 M SNPs across 9 studies (N = 20,427).

Each black circle represents an observed statistic for genotyped SNPs only (defined as the −log10P) against the corresponding expected statistic. The grey area corresponds to the 90% confidence intervals calculated empirically using permutations. The individual studies' inflation factors, as well as the inflation factor of the meta-analysis, were corrected using genomic control. The inflation factor of the meta-analysis is λGC = 1.064.

doi:10.1371/journal.pgen.1002298.g001
thumbnail

Table 1. SNPs that reached genome-wide significance (P<5×10−8) in the combined analysis.

doi:10.1371/journal.pgen.1002298.t001

Two genomic loci (LCORL on chromosome 4 and PPARD on chromosome 6), previously implicated in height in European populations [3], reached genome-wide significance in the discovery meta-analysis (P<5×10−8; Table 1, Figure S1 and Table S2). We prioritized 153 SNPs with P<1×10−5 from our meta-analysis for in silico replication in up to 16,436 African Americans from five additional studies (Text S1). After combining the data in a joint analysis, 40 SNPs from 11 different chromosomal regions reached genome-wide significance (Table 1 and Table S2), including two SNPs not previously implicated in the regulation of height: rs12393627 on the X-chromosome and rs4315565 on chromosome 2 (Table 1).

rs12393627 is located 3.2 kb upstream of the arylsulfatase E (ARSE) gene on chromosome Xp22 (Figure 2a). Mutations in the ARSE gene cause X-linked brachytelephalangic chondrodysplasia punctata (CDPX1; OMIM #302950), a congenital disorder of bone and cartilage development also characterized by short stature [12]. The co-localization of human growth syndrome genes with SNPs associated with adult height has been reported in European-ancestry samples [3], [13], [14]. rs12393627 reached a P = 1.4×10−6 in the initial meta-analysis (N = 8,333; the SNP was not on the genotyping arrays and/or could not be imputed for AABC, AAPC, Maywood, and Nigeria), and was strongly replicated for association with height in 13,153 African Americans (replication P = 2.6×10−7; combined P = 5.7×10−12) (Table 1). When considering the number of independent markers in a 1 Mb window we found no secondary independent signals in the region conditioning on genotype at rs12393627. We also found no significant evidence of heterogeneity at rs12393627 between men and women (P = 0.26).

thumbnail

Figure 2. SNPs are plotted using LocusZoom [22] by position on the chromosome against association with adult height (−log10 P).

The SNP name shown on the plot was the most significant SNP after the discovery meta-analysis. Estimated recombination rates (from HapMap) are plotted in cyan to reflect the local LD structure. The SNPs surrounding the most significant SNP are color coded to reflect their LD with this SNP (taken from pairwise r2 values from the ARIC African Americans Affymetrix6.0 dataset for rs1239627 (A) and from the HapMap YRI data for rs4315565 (B)). The size of the points on the plots is proportional to the number of individuals with available genotype for any given SNP. Genes, the position of exons and the direction of transcription from the UCSC genome browser are noted. Hashmarks represent SNP positions available in the meta-analysis.

doi:10.1371/journal.pgen.1002298.g002

The derived A-allele (i.e. non-ancestral allele based on the chimp genome) at rs12393627 is monomorphic in the HapMap CEU individuals and has a frequency of 54% in the HapMap YRI participants. We also investigated the association of rs12393627 with height in 3,487 Japanese Americans and 2,979 Latinos from the Multiethnic Cohort (MEC) (Text S1). Whereas the marker was monomorphic in Japanese Americans, the association between height and rs12393627 was replicated in Latinos with a comparable effect size (A-allele frequency = 97%, standardized effect size = −0.177±0.088, P = 0.044). The frequency of this allele is consistent with previous estimates of ~5–10% African ancestry among Latinos in the MEC [15]. Measures of local ancestry (the number of European-derived chromosomes (0, 1, or 2) in each individual) were not available for the X-chromosome, but since the marker is polymorphic only in African-derived populations (according to HapMap phase 3 data [16]), the height association signal defined by rs12393627 on Xp22 is likely to be specific to these populations.

SNP rs4315565 on 2p14 (discovery P = 1.5×10−7; combined P = 1.2×10−8) is located in intron 3 of the anthrax toxin receptor 1 (ANTXR1) gene, and 189 kb upstream of the bone morphogenetic protein 10 (BMP10) gene (Figure 2b), a member of the TGF-β signaling pathway. This pathway is important in normal skeletal growth [17] and implicated in previous GWA studies of height [3]. We observed no evidence of heterogeneity by sex (P = 0.34) and no independent signals when conditioning on rs4315565 within a 1 Mb window.

The allele frequency of rs4315565 differs strongly between the HapMap CEU and YRI samples: the derived A-allele, which is associated with decreased height, has a frequency of 85% in CEU and 2% in YRI, respectively (Fst = 0.701). This allele frequency difference is consistent with recent weak positive selection acting in individuals of European ancestry (iHS = −1.668) [18], and could indicate an association with local ancestry. In a conditional analysis where we controlled for global ancestry using PCs as covariates, we did observe a significant association between height and local ancestry at the ANTXR1 locus, with an increase in the number of European chromosomes associated with a decrease in height (P = 1.6×10−6; N = 18,495 samples available for this analysis) [19]. Still controlling for global ancestry with PCs, genotypes at rs4315565 could account for the association between local ancestry and height (P = 0.22 for local ancestry conditional on rs4315565), while the association of rs4315565 with height diminished but remained significant in the same model (P = 4.6×10−8 and P = 0.0044, before and after conditioning on local ancestry; N = 18,495).

To investigate the relationship between rs4135565 and local ancestry further, we considered the background on which the rs4135565 variants were present in different individuals. In analyses stratified by the number of African/European chromosomes in the region, rs4315565 was nominally associated with height in African Americans that are homozygous (P = 0.038) or heterozygous (P = 0.043) for African chromosomes (with effect size stronger in African chromosome homozygotes) (Table 2). In 1,188 Nigerians from the discovery phase, a similar trend between height and rs4315565 was observed (P = 0.075). rs4315565 was not significantly associated with height in African Americans that are homozygous for European chromosomes at the locus (P = 0.91), although the sample size of this sub-group is small (N = 943) (Table 2). More strikingly, this variant is not associated with height in populations of European ancestry in the GIANT Consortium (N = 133,653, P = 0.66) [3]. Together, these results suggest that 2p14 harbors at least one novel height-associated variant that is strongly associated with African ancestry and is correlated with rs4315565 in African- but not European-derived chromosomes. Our results also indicate that rs4315565 is a better marker of the functional variant(s) than is local ancestry or any other SNPs represented in HapMap.

thumbnail

Table 2. Stratified analysis of the association between adult height and rs4315565 based on ancestry at the locus.

doi:10.1371/journal.pgen.1002298.t002

We then considered the previously known height loci. Of the 180 SNPs previously reported by the GIANT Consortium to be associated with height in populations of European ancestry, the effect estimates for 38 SNPs were in the same direction as the initial report and nominally associated (P<0.05) with height in the African-derived height meta-analysis. This number is however a lower-bound estimate of the number of known European height loci that replicate in individuals of African ancestry because it does not take into account different LD relationships in European and African chromosomes: since any of the SNPs in LD in European-ancestry individuals with the GIANT height SNPs could be causal, this entire set of SNPs need to be evaluated, both in terms of statistical significance and direction of effect, for replication in the African height meta-analysis. To address this issue, we utilized a rigorous framework, described in the Materials and Methods section and graphically summarized in Figure S2, to test systematically for replication at the previously known European height loci in the African meta-analysis. We started with 161 of the 180 height SNPs identified by the GIANT Consortium (19 SNPs could not be tested because linkage disequilibrium (LD) information in HapMap was not available) [3], and generated 5,819 sets of 161 SNPs matched on minor allele frequency using the HapMap2+3 CEU dataset. We then counted the number of SNPs (also considering LD proxies) in the African height meta-analysis with directionally consistent (one-tailed) P≤0.05 for the set of 161 height-associated SNPs and the simulated sets. We found one simulation with a count of nominal associations equal to or higher than what we observed for the 161 height-associated SNPs (P = 1.7×10−4; 171 nominal associations for the GIANT height SNPs (and their proxies); median number of nominal associations to height in the matched sets of SNPs = 28 (range = 8–172)). Therefore, we found strong overall evidence of replication in our large meta-analysis of 20,427 individuals of African ancestry for SNPs previously associated with adult height in individuals of European ancestry, indicating a substantial shared genetic basis for height in populations separated since the out-of-Africa event.

The replication procedure described above also allowed us to identify, for each of the 161 European height loci that we assessed using data from our African meta-analysis, the best candidate height index SNP (Table 3 and Table S3). For instance in population of European ancestry at the LCORL locus on chromosome 4, the GIANT height SNP (rs6449353) and the SNP identified by fine-mapping in the African height meta-analysis (rs7663818) are both strongly associated with height (P<1×10−25) and in strong LD (r2>0.8) with each other (Figure 3a). However, in African-derived populations, LD is weaker between the two SNPs (r2<0.6) and the association with height is stronger for rs7663818 (P = 2.9×10−7) than for rs6449353 (P = 0.0025) (Figure 3b). When we consider SNPs in strong LD (r2>0.8) with rs7663818 in HapMap CEU and YRI populations, they define genomic intervals of 250 kb and 80 kb, respectively (light blue boxes in Figure 3). Finally, in lymphoblastoid cell lines derived from YRI individuals (Materials and Methods), rs7663818, but not rs6449353, is associated with LCORL gene expression levels (LCORL eQTL P = 0.0026 and P = 0.13 for rs7663818 and rs6449353, respectively). Thus, the LCORL locus illustrates a clear example of the utility of fine-mapping association signals in other ethnic groups, both in terms of narrowing the genomic interval of interest and highlighting potential functional variants (cis-eQTL).

thumbnail

Figure 3. Height association results.

In Europeans from the GIANT Consortium (A) [3] and in individuals of African ancestry (B) (this study) at the LCORL locus on chromosome 4. The GIANT Consortium originally reported SNP rs6449353, whereas rs7663818 was fine-mapped in the African height meta-analysis. For each panel, the light blue box corresponds to the chromosomal interval flanked by the leftmost and rightmost SNPs with a r2≥0.8 with rs7663818 in HapMap CEU (A) and YRI (B) participants: these intervals are 250 kb and 80 kb wide in CEU and YRI, respectively.

doi:10.1371/journal.pgen.1002298.g003
thumbnail

Table 3. Fine-mapping results for SNPs associated with height in individuals of European descent [3].

doi:10.1371/journal.pgen.1002298.t003

For 40 loci, the index SNPs from our fine-mapping list was nominally associated with height (P<0.05) in the African height meta-analysis, whereas the corresponding index European height SNPs was not. To test whether this result reflects an enrichment of surrogates for functional variants identified by fine-mapping, we designed an experiment using allelic gene expression phenotypes in the HapMap YRI cell lines as functional readouts. We hypothesized that if our trans-ethnic fine-mapping strategy was successful, a larger fraction of variants in the list of fine-mapped height SNPs should be associated with phenotypes (in this case gene expression) than of variants in the list of European index height SNPs. In other words, the list of SNPs from our fine-mapping experiment should contain more cis-eQTLs than the GIANT list of height SNPs in cell lines derived from Africans. We retrieved allelic expression mapping datasets from the HapMap YRI cell lines (Materials and Methods) and observed that 4.7% of the GIANT index height SNPs and 8.6% of the best candidate height SNPs obtained by trans-ethnic fine-mapping, were both nominally associated with height (P<0.05) in our meta-analysis and with allelic expression phenotypes (P<0.01). When we used simulations to assess the significance of these results, we found no simulated set with a cis-eQTL enrichment equal or above that observed in the data (P<0.01, obtained from 100 simulations (Text S1)). Therefore, fine-mapping European height loci in African-ancestry individuals generated a list of markers more likely to control gene expression, potentially improving mechanistic insights into the biology of height. Although we did not see an enrichment when compared to the list of GIANT index height SNPs, we also found that 17 missense SNPs are in strong LD (r2≥0.8 based on HapMap phase II YRI) with the fine-mapped height SNPs (Table S4).

In conclusion, our study shows the benefit of performing large-scale genetic studies in non-European populations to discover new biology (we identified two novel height loci), and to gain functional insights at the loci previously found in European-derived individuals (in this case, by enrichment of cis-eQTL signals). The strong replication of most of the European height loci in African-ancestry populations suggest that many of the published association signals with common variants from GWA studies – for height and perhaps other complex diseases and traits – are relevant across different populations and caused by shared genetic factors that predate the out-of-Africa event.

Materials and Methods

Ethics statement

All participants gave informed written consent. The project has been approved by the local ethics committees and/or institutional review boards.

Studies

Five discovery studies/consortia (AABC, AAPC, CARe, Maywood, and Nigeria) and five replication studies (GeneSTAR, HANDLS, Health ABC, WHI, and MEC) contributed height association results to this project. There were eight population-based cohorts (ARIC (N = 2,740), CARDIA (N = 699), JHS (N = 2119), MESA (N = 1,646), HANDLS (N = 993), HABC (N = 1,139), WHI (N = 8,149) and MEC (N = 11,569)), two family-based cohorts (CFS (N = 386) and GeneSTAR (N = 1,148)) two case-control studies (Maywood (obesity, N = 743) and Nigeria (hypertension, N = 1,188)), and two cancer consortia comprised of case-control studies that were population-based or nested within prospective cohorts (AABC (breast cancer, N = 5380), AAPC (prostate cancer, N = 5,526). All cohorts with genome-wide genotyping data available were genotyped on the Affymetrix 6.0 array, except AABC, AAPC, HANDLS, HABC and GeneSTAR, that were genotyped on the Illumina 1M-duo or 1Mv1_c chip. The studies, including genotyping and quality control steps, are described in detail in Text S1. The statistics (height and age) are summarized in Table S1. Genotype imputation was performed as previously described [20] and is summarized in Text S1.

Statistical analysis

Height measures were corrected for sex, age, disease status, and other appropriate covariates (e.g. recruitment centers), and were normalized into Z-scores (Text S1). Association analysis was performed using linear regression for studies of unrelated individuals and a linear mixed effect model for family-based studies, testing an additive model and including the 4–10 first principal components. Results were combined using the inverse variance meta-analysis method. Local ancestry was estimated using the HAPMIX software using default parameters [19]. Conditional analyses were performed by including SNP genotypes or local ancestry estimates in the linear models.

Replication of European height loci in African Americans

The list of European height loci from the largest study to date was used as a source of known European loci for fine-mapping [3]. The procedure is graphically summarized in Figure S2. Of the 180 SNPs from this list, 19 were filtered for lack of available LD data (we combined data from HapMap2 haplotype release 22 (Aug 2007), HapMap3 haplotype release 2 (Jul 2009), and HapMap2+3 LD data release 27 (Apr 2009); conflicting data, as is the case for these 19 SNPs, were excluded). LD estimates (r2) from CEU HapMap 2+3 were used to generate the set of common SNPs (proxies) tagging the remaining putative loci (r2≥0.8). These sets were then binned using YRI HapMap 2+3 LD as follows: the whole list of proxies was randomized, to remove any bias towards significance in the representative P-values; the first SNP was removed and set as an “index” SNP; then all SNPs not yet binned were filtered based on LD (r2≥0.3) with the index SNP. This procedure was repeated until all SNPs were binned. The metric for replication of a European signal was the number of SNP bins nominally significant (P≤0.05), and replication of the entire list of known SNPs was the number of significant bins across all loci. Each SNP bin was represented by the index SNP used to generate it. Because the SNPs are in LD with known European signals, there is a strong prediction as to which index SNP allele should be increasing height: it should be the allele in LD with the height-increasing allele in Europeans. Therefore, all index SNP P-values were made one-tailed (set to P/2 or 1-P/2) based on the hypothesis that the height-increasing allele should be the one predicted by the European SNP, based on the phased HapMap CEU data.

The LD thresholds used for proxy determination in European ancestry and binning in African ancestry were arbitrary and likely do not fully encompass the LD structure of the populations in this meta-analysis. To control for artifacts introduced by these thresholds and the HapMap data, 5,819 sets of 161 SNPs, matched to the European known loci on HapMap 2+3 CEU minor allele frequency, were generated. Since the European SNP list contains independent loci, each simulated list was designed to contain relatively independent SNPs (CEU r2≥0.2); changing this threshold did not alter the results. The same procedure of proxy generation and SNP binning (see Figure S2 for a graphical description of the binning strategy) was performed on each of the 5,819 sets to generate a null distribution of significant bins.

To generate the list of “best” SNP for each locus (fine-mapped list), the binning procedure was repeated for the known SNPs, except each iteration selected an index SNP from the list of remaining SNPs, sorted on P-value, not randomized. Note that the best SNPs at each locus are not perfectly concordant between Table 1 and Table 3 because our fine-mapping approach did not consider the in silico replication data and required that the SNPs are available in the HapMap phased haplotypes. We note that our fine-mapping approach focuses on SNPs with low P-values and is thus more likely to identify markers with fewer missing genotypes, that is markers for which we have more statistical power.

Analysis of cis-acting eQTLs

To assess whether European SNPs replicated for height (at nominal P<0.05) in African-ancestry populations would also be more likely to show links to functional variation in samples of African ancestry, we applied a sensitive technique for mapping cis-regulatory allelic expression SNPs [21] in lymphoblastoid cell lines (LCLs) derived from 56 unrelated Yoruba HapMap participants. A detailed description of the protocols and statistical methods used is available in the Text S1.

Supporting Information

Figure S1.

Manhattan plot of the height meta-analysis (3,310,998 SNPs in up to 20,809 participants from 9 studies). The dashed line highlights the genome-wide significance threshold used in this study (P<5×10−8). In the discovery phase of the project, SNPs at 4 loci reached genome-wide significance: LCORL on chromosome 4, PPARD on chromosome 6, SULF1 on chromosome 8, and ACAN on chromosome 15. The association between height and SNPs near SULF1 did not replicate. The 3 remaining loci – LCORL, PPARD, and ACAN – are loci previously associated with height in Europeans. Genomic-control P-values are displayed.

doi:10.1371/journal.pgen.1002298.s001

(TIF)

Figure S2.

On the left, an example analysis for the European SNP rs12470505 (CCDC108). Top: rs12470505 (square) and proxies (circles; r2≥0.8 in HapMap2+3 CEU), plotted with their P-values in the GIANT European analysis. Bottom: the same SNPs, plotted with African-American meta-analysis P-values, converted to one-tailed P-values based on predicted direction of effect from the European result and phased HapMap2 CEU data. Colors segregate SNPs into 6 randomly seeded “independent” clusters (r2≥0.3) using HapMap2+3 YRI linkage disequilibrium estimates. Right: simulation results for the fine-mapping analysis. Simulations were matched to the European SNP list by minor allele frequency; SNPs in each simulation were independent of each other at r2≥0.2 in HapMap2+3 CEU. The result for each simulation is significant bins/total bins. Red line indicates observed proportion of significant bins for true European SNP replication (P = 8.6×10−6).

doi:10.1371/journal.pgen.1002298.s002

(TIF)

Table S1.

Baseline characteristics of cohorts involved in the study.

doi:10.1371/journal.pgen.1002298.s003

(DOC)

Table S2.

Association results for the top 153 SNPs in the discovery meta-analysis. Positions are on NCBI build 36.1 (hg18) and the alleles are on the forward strand. Beta (effect size) and SE (standard error) are in standardized ‘Z-score’ units.

doi:10.1371/journal.pgen.1002298.s004

(DOC)

Table S3.

Fine-mapping results for SNPs associated with height in Caucasians [49]. We could not fine-map 19 of the 180 SNPs reported by the GIANT Consortium because they were not available in the HapMap phased datasets. In the left-handed side of the table, we present the association results in the African-American height meta-analysis for SNPs associated with height in Caucasians. In the right-handed side of the table, we present results from our fine-mapping experiment using data from our African-American height meta-analysis. For intergenic SNPs, we provide the closest gene and the physical distance between them.

doi:10.1371/journal.pgen.1002298.s005

(DOC)

Text S1.

Supplementary information.

doi:10.1371/journal.pgen.1002298.s006

(DOC)

Acknowledgments

The authors wish to acknowledge the contributions of the research institutions, study investigators, field staff, and study participants.

Author Contributions

Conceived and designed the experiments: JN Hirschhorn, G Lettre, CA Haiman. Performed the experiments: A N'Diaye, GK Chen, CD Palmer, B Ge, V Adoue, AB Singleton, T Pastinen, JN Hirschhorn, G Lettre, CA Haiman. Analyzed the data: A N'Diaye, GK Chen, CD Palmer, B Ge, V Adoue, AB Singleton, T Pastinen, JN Hirschhorn, G Lettre, CA Haiman. Contributed reagents/materials/analysis tools: A N'Diaye, GK Chen, CD Palmer, B Ge, B Tayo, CB Ambrosone, RA Mathias, J Ding, MA Nalls, A Adeyemo, V Adoue, L Atwood, EV Bandera, LC Becker, SI Berndt, L Bernstein, WJ Blot, E Boerwinkle, A Britton, G Casey, SJ Chanock, E Demerath, SL Deming, WR Diver, C Fox, TB Harris, DG Hernandez, JJ Hu, SA Ingles, EM John, C Johnson, B Keating, RA Kittles, LN Kolonel, SB Kritchevsky, L Le Marchand, K Lohman, J Liu, RC Millikan, A Murphy, S Musani, C Neslund-Dudas, KE North, S Nyante, A Ogunniyi, EA Ostrander, G Papanicolaou, S Patel, CA Pettaway, MF Press, S Redline, JL Rodriguez-Gil, C Rotimi, BA Rybicki, B Salako, PJ Schreiner, LB Signorello, AB Singleton, JL Stanford, AH Stram, DO Stram, SS Strom, B Suktitipat, MJ Thun, JS Witte, LR Yanek, RG Zieger, W Zheng, X Zhu, JM Zmuda, AB Zonderman, MK Evans, Y Liu, DM Becker, RS Cooper, T Pastinen, BE Henderson, JN Hirschhorn, G Lettre, CA Haiman. Wrote the paper: A N'Diaye, GK Chen, CD Palmer, T Pastinen, JN Hirschhorn, G Lettre, CA Haiman.

References

  1. 1. Perola M, Sammalisto S, Hiekkalinna T, Martin NG, Visscher PM, et al. (2007) Combined genome scans for body stature in 6,602 European twins: evidence for common Caucasian loci. PLoS Genet 3: e97. doi:10.1371/journal.pgen.0030097.
  2. 2. Visscher PM, Medland SE, Ferreira MA, Morley KI, Zhu G, et al. (2006) Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings. PLoS Genet 2: e41. doi:10.1371/journal.pgen.0020041.
  3. 3. Lango Allen H, Estrada K, Lettre G, Berndt SI, Weedon MN, et al. (2010) Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467: 832–838.
  4. 4. Fisher RA (1918) The Correlation Between Relatives on the Supposition of Mendelian Inheritance. Transactions of the Royal Society of Edinburgh 52: 399–433.
  5. 5. Cho YS, Go MJ, Kim YJ, Heo JY, Oh JH, et al. (2009) A large-scale genome-wide association study of Asian populations uncovers genetic factors influencing eight quantitative traits. Nat Genet 41: 527–534.
  6. 6. Okada Y, Kamatani Y, Takahashi A, Matsuda K, Hosono N, et al. (2010) A genome-wide association study in 19 633 Japanese subjects identified LHX3-QSOX2 and IGF1 as adult height loci. Hum Mol Genet 19: 2303–2312.
  7. 7. Kim JJ, Lee HI, Park T, Kim K, Lee JE, et al. (2010) Identification of 15 loci influencing height in a Korean population. J Hum Genet 55: 27–31.
  8. 8. Kang SJ, Chiang CW, Palmer CD, Tayo BO, Lettre G, et al. (2010) Genome-wide association of anthropometric traits in African- and African-derived populations. Hum Mol Genet 19: 2725–2738.
  9. 9. Shriner D, Adeyemo A, Gerry NP, Herbert A, Chen G, et al. (2009) Transferability and fine-mapping of genome-wide associated loci for adult height across human populations. PLoS ONE 4: e8398. doi:10.1371/journal.pone.0008398.
  10. 10. Lanktree MB, Guo Y, Murtaza M, Glessner JT, Bailey SD, et al. (2011) Meta-analysis of Dense Genecentric Association Studies Reveals Common and Uncommon Variants Associated with Height. Am J Hum Genet 88: 6–18.
  11. 11. Yang J, Weedon MN, Purcell S, Lettre G, Estrada K, et al. (2011) Genomic inflation factors under polygenic inheritance. Eur J Hum Genet.
  12. 12. Casarin A, Rusalen F, Doimo M, Trevisson E, Carraro S, et al. (2009) X-linked brachytelephalangic chondrodysplasia punctata: a simple trait that is not so simple. Am J Med Genet A 149A: 2464–2468.
  13. 13. Weedon MN, Lettre G, Freathy RM, Lindgren CM, Voight BF, et al. (2007) A common variant of HMGA2 is associated with adult and childhood height in the general population. Nat Genet 39: 1245–1250.
  14. 14. Sanna S, Jackson AU, Nagaraja R, Willer CJ, Chen WM, et al. (2008) Common variants in the GDF5-UQCC region are associated with variation in human height. Nat Genet 40: 198–203.
  15. 15. Price AL, Patterson N, Yu F, Cox DR, Waliszewska A, et al. (2007) A genomewide admixture map for Latino populations. Am J Hum Genet 80: 1024–1036.
  16. 16. Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, et al. (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467: 52–58.
  17. 17. Neptune ER, Frischmeyer PA, Arking DE, Myers L, Bunton TE, et al. (2003) Dysregulation of TGF-beta activation contributes to pathogenesis in Marfan syndrome. Nat Genet 33: 407–411.
  18. 18. Voight BF, Kudaravalli S, Wen X, Pritchard JK (2006) A map of recent positive selection in the human genome. PLoS Biol 4: e72. doi:10.1371/journal.pbio.0040072.
  19. 19. Price AL, Tandon A, Patterson N, Barnes KC, Rafaels N, et al. (2009) Sensitive detection of chromosomal segments of distinct ancestry in admixed populations. PLoS Genet 5: e1000519. doi:10.1371/journal.pgen.1000519.
  20. 20. Lettre G, Palmer CD, Young T, Ejebe KG, Allayee H, et al. (2011) Genome-Wide Association Study of Coronary Heart Disease and Its Risk Factors in 8,090 African Americans: The NHLBI CARe Project. PLoS Genet 7: e1001300. doi:10.1371/journal.pgen.1001300.
  21. 21. Ge B, Pokholok DK, Kwan T, Grundberg E, Morcos L, et al. (2009) Global patterns of cis variation in human cells revealed by high-density allelic expression analysis. Nat Genet 41: 1216–1222.
  22. 22. Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, et al. (2010) LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 26: 2336–2337.