A SNP upstream of the INSIG2 gene, rs7566605, was recently found to be associated with obesity as measured by body mass index (BMI) by Herbert and colleagues. The association between increased BMI and homozygosity for the minor allele was first observed in data from a genome-wide association scan of 86,604 SNPs in 923 related individuals from the Framingham Heart Study offspring cohort. The association was reproduced in four additional cohorts, but was not seen in a fifth cohort. To further assess the general reproducibility of this association, we genotyped rs7566605 in nine large cohorts from eight populations across multiple ethnicities (total n = 16,969). We tested this variant for association with BMI in each sample under a recessive model using family-based, population-based, and case-control designs. We observed a significant (p < 0.05) association in five cohorts but saw no association in three other cohorts. There was variability in the strength of association evidence across examination cycles in longitudinal data from unrelated individuals in the Framingham Heart Study Offspring cohort. A combined analysis revealed significant independent validation of this association in both unrelated (p = 0.046) and family-based (p = 0.004) samples. The estimated risk conferred by this allele is small, and could easily be masked by small sample size, population stratification, or other confounders. These validation studies suggest that the original association is less likely to be spurious, but the failure to observe an association in every data set suggests that the effect of SNP rs7566605 on BMI may be heterogeneous across population samples.
Obesity is an epidemic in the United States of America and developing world, portending an epidemic of related diseases such as diabetes and heart disease. While diet and lifestyle contribute to obesity, half of the population variation in body mass index, a common measure of obesity, is determined by inherited factors. Many studies have reported that common sequence variants in genes are associated with an increased risk for obesity, yet most of these are not reproducible in other study cohorts, suggesting that some are false. Recently, Herbert et al. reported a slightly increased risk of obesity for people carrying two copies of the minor allele at a common variant near INSIG2. We present our attempts to further evaluate this potential association with obesity in additional populations. We find evidence of increased risk of obesity for people carrying two copies of the minor allele in five out of nine cohorts tested, using both family- and population-based testing. We indicate possible reasons for the varied results, with the hope of encouraging a combined analysis across study cohorts to more precisely define the effect of this INSIG2 gene variant.
Citation: Lyon HN, Emilsson V, Hinney A, Heid IM, Lasky-Su J, et al. (2007) The Association of a SNP Upstream of INSIG2 with Body Mass Index is Reproduced in Several but Not All Cohorts. PLoS Genet 3(4): e61. doi:10.1371/journal.pgen.0030061
Editor: Gonçalo Abecasis, University of Michigan, United States of America
Received: December 5, 2006; Accepted: March 6, 2007; Published: April 27, 2007
This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
Funding: Essen: The study was funded by the German National Genome Research Network 2 (NGFN2) and European Union (FP6 LSHMCT-2003–503041). KORA: The Kooperative Gesundheitsforschung in der Region Augsburg (KORA; Cooperative Research in the Region of Augsburg) research platform was initiated and financed by the GSF-National Research Centre for Environment and Health, which is funded by the German Federal Ministry of Education and Research and of the State of Bavaria. H-E Wichmann, H. Löwel, C Meisinger, T Illig, R Holle, J John, and their coworkers are responsible for the design and conduct of the KORA studies. This work was supported by the Munich Center of Health Sciences of the Ludwig-Maximilians-University. This genetic association study was funded by the German Federal Ministry of Education and Research (BMBF) in the context of the German National Genome Research Network (NGFN) with grants to HEW and to TM (01GR0103). FHS: This work was supported by the National Heart, Lung and Blood Institute's Framingham Heart Study (Contract Number N01-HC-25195). Costa Rica: The Costa Rican study was funded by grants HL066289 and HL04370 from the National Institutes of Health. Scandinavia: LG, TT, and the Botnia Study are principally supported by the Sigrid Juselius Foundation, the Academy of Finland, the Finnish Diabetes Research Foundation, the Folkhalsan Research Foundation, the Swedish Medical Research Council, and the Novo Nordisk Foundation. Maywood: This work was supported by National Institutes of Health grants R01 HL54485 and R01 HL074166 to RSC and XZ. Genotyping was funded by the Richard and Susan Smith Family/American Diabetes Association Pinnacle Program Project awarded to JNH and colleagues. HNL is supported by the National Intitute of Diabetes and Digestive and Kidney Diseases grant number K23 DK067288.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: BMI, body mass index; CI, confidence interval; FHS, Framingham Heart Study; NHS, Nurses Health Study; OR, odds ratio; SNP, single nucleotide polymorphism
Body mass index (BMI) is a heritable measure of obesity that is routinely obtained in large cohorts, is correlated with other measures of obesity, and predicts morbidity and mortality from obesity-related diseases [1–4]. Thus, BMI is a readily accessible trait that can be used to screen for genetic variants that increase an individual's risk for obesity and its complications. There have been more than one hundred publications reporting association between common genetic variants and BMI, but few of the associations have been reproducible in multiple populations . Genotyping of variants has increased exponentially in scale over the past few years, and much more comprehensive screens of common genetic variation for association with obesity are now possible. The poor rate of reproducible findings in association studies in general and obesity in particular are likely due to a combination of false-positive results, underpowered attempts to reproduce associations with modest effects, systematic bias due to technical artifacts or population stratification, and perhaps true heterogeneity in effect across populations due to differences in genetic or environmental modifiers [6,7]. Thus, new reports of association require rapid, well-powered studies to validate true associations or identify false positives that could otherwise trigger unwarranted investigation of spurious findings.
Recently, Herbert and colleagues, including several of the authors of this study, reported a novel association between homozygosity for the minor allele of a single nucleotide polymorphism (SNP), rs7566605, and increased BMI . The SNP has no known function, and the closest gene codes for the insulin signaling protein type 2 (INSIG2), a hijacking protein in the endoplasmic reticulum that, in response to changes in lipid levels, impedes the movement of sterol regulatory element binding proteins to the Golgi apparatus for processing and ultimately its release to act as a nuclear transcription factor and regulator of lipid biosynthesis [9–11]. Animal data suggests a role for INSIG2 in increasing triglyceride level in rats , as well as linkage to obesity phenotypes in mice .
The association of SNP rs7566605 with obesity was initially found in a set of related individuals from the Framingham Heart Study (FHS) offspring cohort . The SNP was genotyped in five additional cohorts, and the association was observed again in four of these, including population-based studies, case-control samples, and family-based cohorts. However, no significant association was found in a fifth cohort (the Nurses Health Study [NHS]), where a slight trend in the opposite direction was seen. Approximately 10% of individuals were homozygous for the minor allele (C/C), and in a meta-analysis of the case-control samples (including the NHS cohort and excluding the FHS discovery cohort), these individuals had a 22% increased risk of obesity (defined as BMI ≥ 30 kg/m2). In the NHS cohort alone, the 95% confidence interval (CI) for the odds ratio (OR) for obesity was 0.58–1.13. Subsequently, two further groups reported no evidence of association in large cohorts, and a third found association only for people on the overweight end of their population [14–17].
We considered several possible explanations for observing an association in four cohorts but not in the fifth. The failure to observe association in the NHS sample could be due to more modest effects in this cohort and therefore inadequate sample size, population stratification, ascertainment bias, other unmeasured confounders, or any combination of these. It is also possible that evidence in the four cohorts was falsely positive, for any of a combination of reasons that could include hidden population substructure, technical artifacts, or statistical fluctuations causing false positives. However, because of the consistency across multiple cohorts, including studies with family-based design, we felt that these explanations were less likely. Finally, it is also possible that the association is heterogeneous across populations, either due to differences in ascertainment, or differences in genetic or environmental modifiers. Of these possibilities, it is most critical to assess first whether the original associations were spurious, so as to avoid further efforts expended on a false finding. Our primary objective was therefore to test additional large populations to evaluate further the validity and generalizability of this association. By studying these additional populations, including a sample with longitudinal data, we hoped to better assess the strength and consistency of the association between increased BMI and the risk genotype at rs7566605, and perhaps generate hypotheses about any inconsistencies in this association.
Descriptions of the cohorts used in this study are presented in Table 1, Table S1, and in the Methods. These nine cohorts are drawn from eight different populations and include a total of almost 17,000 individuals. The cohorts were not ascertained for BMI, except for the Essen study cohort, which was selected from the upper (BMI ≥ 30 kg/m2) and lower (BMI < 20 kg/m2) ends of the BMI distribution of their population and a portion of the African-American sample that was enriched for obese individuals. We tested for association with obese (BMI ≥ 30 kg/m2) versus non-obese (BMI <30 kg/m2) and also with BMI as a continuous trait, to mimic the association tests performed in the initial publication. All analyses were performed under a recessive model, with the prior hypothesis that C/C homozygotes would have a higher BMI than individuals in the other two genotype classes.
Eight Populations (n = 16,969) Used in Association Testing of rs7566605 and Obesity/BMIdoi:10.1371/journal.pgen.0030061.t001
The frequency of C/C homozygotes was increased in obese individuals compared to non-obese control individuals in several cohorts (Table 2). Nominally significant (two-tailed p < 0.05) associations between obesity (BMI ≥ 30 kg/m2) and the C/C were present in three samples: the Iceland cohort (OR = 1.29, 95% CI = 1.06–1.57, p = 0.0064), the Essen cohort (OR = 1.75, 95% CI = 1.15–2.68, p = 0.008), and in one of six exam cycles within the longitudinal data from the FHS cohorts (Table 2). In the Iceland cohort, the homozygote C/C genotype was associated with a 0.69 kg/m2 increment in BMI, which is in good agreement with the effect observed by Herbert et al. .
Association Studies of rs7566605 C/C Genotype and Obesity (BMI ≥ 30) and BMI as a Continuous Trait in Each of the Individual Unrelated Samplesdoi:10.1371/journal.pgen.0030061.t002
The KORA S3, Maywood, and Scandinavian cohorts, and five of six exam cycles in the FHS cohort, did not show nominally significant associations under a recessive model. The Scandinavian, FHS, and Maywood samples may have been too small to achieve statistical significance with an association of the magnitude estimated by Herbert et al. (OR = 1.22). The Scandinavian cohort had an estimated OR (1.25) similar to the original report, but a p value of 0.46 and a wide 95% CI around the estimated OR (0.69–2.24). In particular, this cohort had only 120 people with BMI > 30 kg/m2, and the power to achieve nominal significance for an OR of 1.22 (as estimated in the original report) is only 15%. The estimated OR in the Maywood cohort was 0.88 but the CIs were also wide (p = 0.68, 95% CI = 0.49–1.59), which suggests that the sample was also underpowered to find this modest association and/or that the effect in this sample is smaller than in the original report.
The KORA S3 sample was much larger (851 obese and 3,233 non-obese), but had an OR of 0.90, with a 95% CI of 0.71–1.16, suggesting that the association is either more modest or absent in this cohort, limited to a particular subgroup of this population (see Discussion), and/or that when several samples are tested, some statistical fluctuation either away from or toward the null is expected. Association tests in the FHS cohort between the C/C genotype and obesity showed some apparent variability, achieving significance in some but not all of the six exams, with p values ranging from 0.003–0.51(Table 2); correcting the best p value for having tested six exams suggests that the totality of these findings are consistent with a replication (corrected p value = 0.018). There was no formal evidence of heterogeneity across the six exams (p = 0.47), and the 95% CIs for all exams include an OR of 1.22 (Table 2).
We also analyzed the five population-based samples—Maywood, Iceland, KORA S3, Scandinavia, and FHS (see Methods for details)—for association with BMI as a continuous trait, again under a recessive model controlling for age and gender. We saw similar results to those observed for the dichotomous analysis, with nominally significant associations between C/C homozygotes and increased BMI observed in the Iceland and FHS cohorts but not in KORA S3, Maywood, or Scandinavia (Table 2). When we analyzed association with BMI at each exam cycle from FHS separately, there was no significant evidence of association in a recessive model. The effect estimates trended in the same direction (exam 3, two-tailed p value = 0.096) (Table 2) as did estimates in the analysis using z-scores for BMI (see Methods) and mean z-score over six exams (unpublished data).
Finally, we tested SNP rs7566605 for association with increased BMI in three family-based samples, using PBAT . Two of the three cohorts showed an association between SNP rs7566605 and BMI as a continuous trait under a recessive model (Table 3). (A dichotomous analysis was not done in these cohorts, because the definition of obesity we used for the remainder of the samples [BMI > 30 kg/m2] was not applicable to the children that made up a substantial part of each cohort.) The family-based portion of the Scandinavian cohort was composed of adults, but the incidence of obesity was only 13% (n = 66), limiting the power of a dichotomous analysis. Because BMI changes rapidly during childhood, we compared the results for the pediatric cohorts using three different measured outcomes: BMI, BMI adjusted for age and gender, and BMI-for-age percentile (Centers for Disease Control and Prevention 2000 National Center of Health Statistics); the p values for the corresponding FBAT statistics were essentially identical in each cohort (unpublished data).
Association Studies of rs7566605 C/C Genotype Body Mass Index in Family Cohortsdoi:10.1371/journal.pgen.0030061.t003
To estimate the overall significance and effect size in the samples we studied, we performed a pooled analysis for both the unrelated and family-based cohorts. These combined analyses, which included both cohorts that showed association and those that did not, yielded independent, statistically significant associations for both the unrelated samples (Table 4) and the family-based samples (Table 3). Combining the p values of the family-based studies using Fisher's method provided evidence of replication (Fisher's combined p = 0.004; Table 3). For the unrelated samples (Table 4), we compared obese and non-obese people, and performed a combined analysis using each exam cycle of the FHS cohort in turn. Since the Essen cohort was ascertained as a severe obesity cohort with non-age matched controls, we tested for heterogeneity between studies using a modified Breslow-Day test [19,20]. There was evidence for heterogeneity when including the Essen cohort (p values for homogeneity = 0.007–0.08) so this cohort was excluded from the combined analyses. Mantel-Haenszel two-tailed p values ranged from 0.011 using FHS exam 3 to 0.054 using FHS exam 6 (Table 4). In these combined analyses, the estimated OR for obesity (BMI > 30 kg/m2) associated with the C/C homozygous genotype ranged from 1.13 to 1.18, somewhat lower than the effect size estimated by the original report . There was also modest evidence of heterogeneity; p values for homogeneity ranged from 0.03 to 0.20, depending on which exam from FHS was included in the combined analysis (Table 4), suggesting that there might be some real variability in effect size across the samples in this study.
Association testing in these nine cohorts shows further evidence that individuals homozygous for the C/C genotype at SNP rs7566605 have a higher BMI and a higher risk of obesity. The association is detectable in diverse cohorts, in children as well as in adults, and in both family-based and population-based samples. The association is not likely to be due to stratification because it was seen in family-based samples such as Costa Rica and CAMP, which are immune to stratification, and because the original publication also described associations in family-based testing .
The effect of ascertainment on these analyses could potentially provide confounding of the association in four of these studies. Because index children in family-based studies in CAMP and Costa Rica were ascertained on the basis of asthma, a spurious association between SNP rs7566605 and BMI could be found if the SNP of interest was directly associated with asthma. However, none of the other cohorts were ascertained in this manner, lessening concerns about this source of bias as a potential cause of false-positive associations. In addition, the Scandinavian sample was ascertained as control subjects for a diabetes case control study (similar to the NHS in the original report). A further bias could potentially have been introduced by the selection of non-obese people in the Essen cohort who have a younger mean age than the obese people from this cohort (Table 1). The lean controls (mean BMI = 18.2 kg/m2) are less likely to be obese later in life, but a small portion of them could be misclassified as non-obese, which would tend to bias the estimate toward the null. Of note, the combined analysis remains significant even if we include this study (unpublished data).
The longitudinal nature of the FHS data may provide a clue to a possible cause for inconsistency in the association between SNP rs7566605 and obesity. In this cohort, a stronger effect on BMI was seen in the data from the first three exams than in the last three exams. The individuals at each exam are largely overlapping, making confounders less likely to explain a positive association in the early exam data and a lack of association in later data. Assuming that the association in this cohort is not a false positive due to statistical fluctuation, then the passage of time is the most likely explanation for the diminution of the association in this cohort. The decreasing evidence of association in theory could be due to an interaction with age, namely decreasing effect size with increasing age. Alternatively, a change in the environment could have diminished the strength of the association over time; this would be, in theory, consistent with a well documented “secular trend” of increased obesity over the relevant time period [21,22]. A preliminary and post hoc examination of the FHS data suggests that age may play an important role in modifying the strength of the association (unpublished data). This hypothesis would also be consistent with stronger effects in controls matched for early-onset disease (such as asthma) than in controls matched for later-onset diseases (such as diabetes). Finally, an additional post hoc analysis of the KORA S3 data suggests a stronger association in the most severely obese individuals (OR for BMI ≥ 38 kg/m2 was 1.78, 95% CI, = 0.99–3.21, p = 0.054), who perhaps became obese at an earlier age. Although these hypotheses are speculative at this time, they and other possibilities could and should be tested by a formal meta-analysis of our data, recent studies showing no association [14–16], and additional data that are likely to emerge. We (I.M.H. and colleagues) are in the process of organizing a meta-analysis to reexamine the INSIG2 association in light of these hypotheses to better understand the relationship of this gene to obesity in the population.
In summary, the association of SNP rs7566605 with higher BMI is found in diverse populations. The number of studies in which a nominal association has been observed (five out of the nine cohorts reported here) appears more frequently than expected by chance. However, a more precise assessment of this apparent excess of associations will depend on the availability of a complete set of studies of this polymorphism. Large sample sizes were required to observe the association, but even some large samples have not demonstrated an association with this allele, possibly due to modification by age or other issues related to ascertainment. A combined analysis of both positive and negative studies presented here suggests that the association is valid but also suggests the possibility of heterogeneity across populations. Additional data, both positive and negative, ideally from large samples with good information regarding potential confounders and in a format suitable for meta-analysis, would be required to confirm the existence of heterogeneity and to further refine the estimate of the effect of this SNP on BMI in different populations. However, the evidence to date suggests that this variant has a detectable influence on BMI in a diverse range of populations.
Materials and Methods
DNA samples were obtained from a large group of 5,187 Icelanders. The study group was composed of individuals who participated in studies of the genetic etiology of cardiovascular and metabolic diseases and the majority of these subjects were recruited as unaffected relatives of probands or as controls and did not have any history of metabolic or cardiovascular diseases. All participants in the study signed informed consent. All personal identifiers associated with tissue samples, clinical information, and genealogy were encrypted by the Icelandic Data Protection Authority, using a third-party encryption system in which the Data Protection Authority maintains the code . Association testing was done according to that of the KORA S4 study design described in Herbert et al . OR of genotype G1 (C/C) compared to genotype G0 (G/C + G/G) was calculated by [n(G1)/m(G1)]/[n(G0)/m(G0)], where n and m denote genotype counts in obese and non-obese individuals, respectively. The genotyping procedure has been previously described . Genotype call rate was 97.3%. p value and CI were adjusted for relatedness of the individuals using simulations as previously described . In each simulation, genotypes for the SNP are simulated through the Icelandic genealogy and the association test repeated treating those genotypes as real genotypes. By repeating this procedure 50,000 times we get the standard deviation of log(OR) under the null hypothesis of no association, which is used to calculate both the p value and the CI. We regressed the log transformed values for BMI on C/C carrier status by adjusting for age and sex in the multiple regressions as shown in Table 2.
KORA S3 cohort.
In the Southern German region of Augsburg, which includes the city of Augsburg and the two surrounding counties, population-based surveys of the 25–74-y-old population were implemented in 1984 as part of the World Health Organization's Multinational Monitoring of Trends and Determinants in Cardiovascular Disease [MONICA]) project and continued since 1996 within the German Kooperative Gesundheitsforschung in the Region Augsburg (KORA) platform. The third survey, KORA S3, which was the study used in our analysis, was conducted in 1994–1995. Subjects (4,856) were recruited via registry according to the same protocol as the fourth survey (KORA S4) performed in 1999–2001, which was part of the initial replication samples in Herbert et al. The KORA surveys were described previously [22,26]. Genotyping was performed using a MALDI-TOF mass spectometry system (MassEXTEND; Sequenom, http://www.sequenom.com) and the call rate was 99.3%.
DNA samples were obtained from 1,515 unrelated people from the offspring generation of the FHS . We considered the possibility of overlap between the “unrelated plate” of the offspring cohort used here and with the family-based panel, approximately half of which was used in the analysis in the Herbert et al. report. There were 283 people who overlap between the “unrelated plate” and the full family-based panel, so these 283 people were excluded from the analyses reported here. The samples were genotyped using allele-specific primer extension of amplified products with detection by MALDI-TOF mass spectroscopy using a Sequenom platform as previously described [28–30]. Genotype call rate was 99.1% with no discordancies among replicate samples. Association testing was done with linear regression using BMI log transformed and adjusted for age and gender at all six exams.
DNA samples were obtained from 874 unrelated people, self-described as African-Americans, from the same cohort as was described in the original association report . Unrelated people were selected from this population for genotyping. In 270 families, the most obese sibling was chosen to enrich the sample for obese people in the case-control comparison. These were not included in the quantitative trait analysis as described below in Statistical Analysis. Samples were genotyped as previously described [8,29]. Genotype call rate was 97.9% with no discordancies among replicate samples. Association testing was done with linear regression modeling of using log BMI corrected for age and gender with genotype in a recessive and additive model.
DNA samples were obtained from 1,381 adults from Marburg, of which 990 were obese cases (BMI ≥ 30 kg/m2; mean BMI 36.02 ± 5.38 kg/m2) and 391 were lean controls (BMI ≤ 20 kg/m2, mean BMI 18.17 ± 1.00 kg/m2 . Genotyping was carried out by PCR-RFLP with Bsp143I (digests the C-allele) (primers: 5′-TGAAGTTGATCTAATGTTCTCTCTCC-3′ and 5′-AAACCAAGGGAATCGAGAGC-3′). Association analysis under the recessive model, by χ2 testing.
Nuclear families (415) of children with asthma in the Central Valley of Costa-Rica, a relative genetic isolate of predominantly Spanish and Amerindian ancestry [32,33]. Children and their families were enrolled as described previously  and anthropometric measurements of all probands included weight and height. However, this population was not ascertained based on morphometric phenotypes. Genotyping was performed using the Illumina BeadStation 500G system (http://www.illumina.com). Genotyping completion rate was >99.8% with no discordances among replicate genotypes. Of the 415 families with genotypic data, 408 had complete phenotypic data and were included in the analysis.
Childhood Asthma Management Program.
The Childhood Asthma Management Program (CAMP) is a multicentered North American clinical trial designed to investigate the long-term effects of inhaled antiinflammatory medications in children with mild to moderate asthma . Children ages 5 through 12 were eligible for inclusion in the study if they had a diagnosis of asthma and no other clinically significant conditions. Height and weight measurements were collected on these children during the prerandomization period. Of the 1,041 children originally enrolled, 968 children and 1,518 parents contributed DNA samples for genetic studies. Complete nuclear families (408) of self-described non-Hispanic white race with baseline BMI measurements are included here. Genotyping was performed using the Sequenom genotyping platform.
The unrelated sample consisted of individuals from the Botnia Study chosen as control subjects from two cohorts to study diabetes. The first group were controls from a Scandinavian sample of 471 case-control pairs individually matched for gender, age, BMI, and geographic region in Sweden and Finland. The second group were from a Swedish sample of 514 case-control pairs who were individually matched for gender, age and BMI. Subjects were characterized as unaffected for diabetes by glucose tolerance testing as previously described . The family cohort was comprised of 512 unaffected siblings from a Scandinavian sample of 1,189 siblings with and without diabetes, as previously described [36,37]. The samples were genotyped using by an allele-specific primer extension of amplified products with detection by MALDI-TOF mass spectroscopy using a Sequenom platform as previously described [28,29]. Genotype call rate was 96.5% with one Mendel error in one family and no discordancies among replicate samples.
The genotype data in each population was tested for deviation from Hardy-Weinberg and found to be consistent (p value > 0.01). Tests for association of rs7566605 with obesity were performed for the five population-based cohorts under a recessive model, classifying non-obese people as BMI < 30 kg/m2 and obese as BMI ≥ 30 kg/m2. Significance was assessed using a χ2 test with one degree of freedom and two-tailed p values were reported. The Mantel-Haenszel method was used for the combined analysis, and testing for heterogeneity was performed using the Breslow-Day test, as described previously [7,19,20].
For the four samples that had population-based components, an association analysis was performed using BMI as a continuous trait, adjusting for age and gender. A second analysis of the FHS cohort was done to make use of longitudinal data collected across six exams, approximately 4 y apart spanning 26 years from 1971–1997. For each exam, Z scores were calculated by the following process: within each decade of life and gender, log BMI was regressed against age. A Z score was then calculated for these age-adjusted BMIs based on the mean and standard deviation within each decade and gender for each exam. These were then analyzed using standard regression methods (implemented in SAS) for each exam individually, and also for the mean of all available Z scores across the six exams. For the KORA S3, Maywood, and Scandinavian cohort analyses we used standard linear regression with log transformed BMI and adjusted for age and gender. The linear regression analysis in the Maywood cohort excluded 270 people, who had been selected as the most obese person in their family, to avoid possible bias. The Iceland analysis was done with log transformed BMI as a continuous trait under a recessive model, adjusting for age and sex in the multiple regression (sex + age + sex × age).
Association testing of rs7566605 in the family-based cohorts was performed using the FBAT-approach as implemented in PBAT [18,38], with BMI as a quantitative (continuous) trait adjusted for age and gender by Z score under a recessive model. For the Costa Rica and CAMP populations, tests were also done for the outcome BMI adjusted for age and gender, and BMI-for-age percentile (Centers for Disease Control and Prevention 2000 National Center of Health Statistics). Because these studies were similarly sized, a combined analysis was performed using Fisher's method for combining p values, in which twice the negative sum of the natural log of k one-tailed p values is distributed as a χ2 distribution with 2k degrees of freedom . In this method, a one-tailed p value for an effect in the opposite direction is first corrected by subtracting the p value from one; as all the effects in our studies were in the same direction, this correction was not necessary.
Table S1. Six Populations Divided into Non-obese (BMI<30 kg/m2) and Obese (BMI≥30 kg/m2) with Mean Age in years.
(66 KB DOC)
The National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov) accession numbers for the gene and gene product discussed in this paper are INSIG2 (NM_016133) and INSIG2 (NP_057217).
Institutional Review Board approval was obtained for all cohorts through their institutions. Iceland: Ethical approval for the present study was granted by the National Bioethics Committee (NBC ref. no. 01–033) and the Icelandic Data Protection Authority. Essen: This study was approved by the Ethics Committees of the Universities of Duisburg-Essen and Marburg, Germany. All individuals (in case of minors their parents) who participated in the study gave informed consent. KORA: All study participants gave informed written consent according to the ethics committee of the Bavarian Medical Association and every attempt was made to ensure anonymity of the participants. FHS: From the Framingham Heart Study of the National Heart Lung and Blood Institute of the National Institutes of Health and Boston University School of Medicine. Costa Rica: The Costa Rican study was approved by the Institutional Review Boards of Brigham and Women's Hospital and the Hospital Nacional de Niños in San José (Costa Rica). CAMP: The Institutional Review Board of the Brigham and Women's Hospital, as well as those of the other CAMP study centers, approved this study. Informed assent and consent were obtained from the study participants and their parents to collect DNA for genetic studies. Scandinavia: Approval for the Botnia Study was obtained from the Ethical Committee of the Helsinki University Central Hospital. Boston. TT is a Research Fellow at the Academy of Finland. Maywood: Study approval was obtained from the Institutional Review Boards of Children's Hospital Boston and Loyola University Medical Center.
We thank the members of the Altshuler, Hirschhorn, Daly, and labs for helpful discussions. We gratefully thank the participants of the studied cohorts for the contribution of their DNA samples.
JCC, HEW, JH, KS, CL, and JNH contributed equally to this paper.
This work represents a broad collaboration among many groups. HNL and JNH produced the original draft, HNL, CL, and JNH performed meta-analyses, JLB conducted genotyping and data management. VE, GT, and AK performed analysis of data from Iceland cohort; the cohort is directed by UT, JG, and KS. SG, GBW, and UT conducted genotyping of the Icelandic cohort. IMH performed analysis of data from the KORA cohort. The KORA group consists of HEW, and his coworkers, who are responsible for the design and conduct of the KORA studies. The cohort is directed by HEW; CV conducted genotyping. AH and GB performed analysis of data from the Essen cohort; the cohort is directed by JH. TM supervised genotyping for the Essen and KORA cohorts. JL-S, BAR, NL, XD and CL performed analysis of data from the CAMP cohort. BAR conducted the CAMP genotyping; the cohort is directed by STW. JL-S and CL performed analysis of data from Costa Rica cohort; the cohort is directed by JCC. HNL and JNH performed analysis of data from the FHS cohort, with collaboration from CSF and CJO. HNL and JNH performed analysis of data from the Maywood cohort; the cohort is directed by XZ and RSC. HNL and JNH performed analysis of data from the Scandanavia cohort; the cohort is directed by LG. All authors provided text and interpretation.
- 1. Barsh GS, Farooqi IS, O'Rahilly S (2000) Genetics of body-weight regulation. Nature 404: 644–651.
- 2. Kopelman PG (2000) Obesity as a medical problem. Nature 404: 635–643.
- 3. Allison DB, Faith MS, Nathan JS (1996) Risch's lambda values for human obesity [comment]. Int J Obes Relat Metab Disord 20: 990–999.
- 4. Allison DB, Kaprio J, Korkeila M, Koskenvuo M, Neale MC, et al. (1996) The heritability of body mass index among an international sample of monozygotic twins reared apart. Int J Obes Relat Metab Disord 20: 501–506.
- 5. Rankinen T, Zuberi A, Chagnon YC, Weisnagel SJ, Argyropoulos G, et al. (2006) The human obesity gene map: The 2005 update. Obesity (Silver Spring) 14: 529–644.
- 6. Clayton DG, Walker NM, Smyth DJ, Pask R, Cooper JD, et al. (2005) Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet 37: 1243–1246.
- 7. Lohmueller KE, Pearce CL, Pike M, Lander ES, Hirschhorn JN (2003) Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease. Nat Genet 33: 177–182.
- 8. Herbert A, Gerry NP, McQueen MB, Heid IM, Pfeufer A, et al. (2006) A common genetic variant is associated with adult and childhood obesity. Science 312: 279–283.
- 9. Gong Y, Lee JN, Brown MS, Goldstein JL, Ye J (2006) Juxtamembranous aspartic acid in Insig-1 and Insig-2 is required for cholesterol homeostasis. Proc Natl Acad Sci U S A 103: 6154–6159.
- 10. Engelking LJ, Liang G, Hammer RE, Takaishi K, Kuriyama H, et al. (2005) Schoenheimer effect explained–feedback regulation of cholesterol synthesis in mice mediated by Insig proteins. J Clin Invest 115: 2489–2498.
- 11. Yabe D, Brown MS, Goldstein JL (2002) Insig-2, a second endoplasmic reticulum protein that binds SCAP and blocks export of sterol regulatory element-binding proteins. Proc Natl Acad Sci U S A 99: 12753–12758.
- 12. Takaishi K, Duplomb L, Wang MY, Li J, Unger RH (2004) Hepatic insig-1 or -2 overexpression reduces lipogenesis in obese Zucker diabetic fatty rats and in fasted/refed normal rats. Proc Natl Acad Sci U S A 101: 7106–7111.
- 13. Deng HW, Deng H, Liu YJ, Liu YZ, Xu FH, et al. (2002) A genomewide linkage scan for quantitative-trait loci for obesity phenotypes. Am J Hum Genet 70: 1138–1151.
- 14. Dina C, Meyre D, Samson C, Tichet J, Marre M, et al. (2007) Comment on “A common genetic variant is associated with adult and childhood obesity.”. Science 315: 187. author reply 187.
- 15. Loos RJ, Barroso I, O'Rahilly S, Wareham NJ (2007) Comment on “A common genetic variant is associated with adult and childhood obesity.”. Science 315: 187. author reply 187.
- 16. Rosskopf D, Bornhorst A, Rimmbach C, Schwahn C, Kayser A, et al. (2007) Comment on “A common genetic variant is associated with adult and childhood obesity.”. Science 315: 187. author reply 187.
- 17. Hall DH, Rahman T, Avery PJ, Keavney B (2006) INSIG-2 promoter polymorphism and obesity related phenotypes: Association study in 1428 members of 248 families. BMC Med Genet 7: 83.
- 18. Lange C, DeMeo D, Silverman EK, Weiss ST, Laird NM (2004) PBAT: Tools for family-based association studies. Am J Hum Genet 74: 367–369.
- 19. Breslow NE, Day NE (1980) Statistical methods in cancer research. Volume I - The analysis of case-control studies. IARC Sci Publ. pp. 5–338.
- 20. Tarone RE (1985) On heterogeneity tests based on efficient scores. Biometrika 72: 91–95.
- 21. Flegal KM, Carroll MD, Ogden CL, Johnson CL (2002) Prevalence and trends in obesity among US adults, 1999–2000. JAMA 288: 1723–1727.
- 22. Heid IM, Vollmert C, Hinney A, Doring A, Geller F, et al. (2005) Association of the 103I MC4R allele with decreased body mass in 7937 participants of two population based surveys. J Med Genet. 42.
- 23. Gulcher JR, Stefansson K (2000) The Icelandic Healthcare Database and informed consent. N Engl J Med 342: 1827–1830.
- 24. Grant SF, Thorleifsson G, Reynisdottir I, Benediktsson R, Manolescu A, et al. (2006) Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes. Nat Genet 38: 320–323.
- 25. Stefansson H, Helgason A, Thorleifsson G, Steinthorsdottir V, Masson G, et al. (2005) A common inversion under selection in Europeans. Nat Genet 37: 129–137.
- 26. Wichmann HE, Gieger C, Illig T (2005) KORA-gen–resource for population genetics, controls and a broad spectrum of disease phenotypes. Gesundheitswesen 67(Suppl 1): S26–S30.
- 27. Kannel WB, Feinleib M, McNamara PM, Garrison RJ, Castelli WP (1979) An investigation of coronary heart disease in families. The Framingham offspring study. Am J Epidemiol 110: 281–290.
- 28. Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, et al. (2002) The structure of haplotype blocks in the human genome. Science 296: 2225–2229.
- 29. Lyon HN, Florez JC, Bersaglieri T, Saxena R, Winckler W, et al. (2006) Common variants in the ENPP1 gene are not reproducibly associated with diabetes or obesity. Diabetes. 55.
- 30. Tang K, Fu D, Kotter S, Cotter RJ, Cantor CR, et al. (1995) Matrix-assisted laser desorption/ionization mass spectrometry of immobilized duplex DNA probes. Nucleic Acids Res 23: 3126–3131.
- 31. Hinney A, Bettecken T, Tarnow P, Brumm H, Reichwald K, et al. (2006) Prevalence, spectrum, and functional characterization of melanocortin-4 receptor gene mutations in a representative population-based sample and obese adults from Germany. J Clin Endocrinol Metab 91: 1761–1769.
- 32. Carvajal-Carmona LG, Ophoff R, Service S, Hartiala J, Molina J, et al. (2003) Genetic demography of Antioquia (Colombia) and the Central Valley of Costa Rica. Hum Genet 112: 534–541.
- 33. Service SK, Ophoff RA, Freimer NB (2001) The genome-wide distribution of background linkage disequilibrium in a population isolate. Hum Mol Genet 10: 545–551.
- 34. Hunninghake GM, Soto-Quiros ME, Avila L, Ly NP, Liang C, et al. (2006) Sensitization to Ascaris and Severity of Childhood Asthma in Costa Rica. J Allergy Clin Immunol 119: 654–61.
- 35. CAMP GR (1999) The Childhood Asthma Management Program (CAMP): Design, rationale, and methods. Childhood Asthma Management Program Research Group. Control Clin Trials 20: 91–120.
- 36. Winckler W, Graham RR, de Bakker PI, Sun M, Almgren P, et al. (2005) Association testing of variants in the hepatocyte nuclear factor 4alpha gene with risk of type 2 diabetes in 7,883 people. Diabetes 54: 886–892.
- 37. Altshuler D, Hirschhorn JN, Klannemark M, Lindgren CM, Vohl MC, et al. (2000) The common PPARγ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. Nat Genet 26: 76–80.
- 38. Laird NM, Horvath S, Xu X (2000) Implementing a unified approach to family-based tests of association. Genet Epidemiol 19(Suppl 1): S36–S42.
- 39. Fisher RA (1925) Statistical methods for research workers. Oa L, editor. Edinburgh: Oliver and Boyd. 239 p.