Search
Advanced Search
Metrics info
Average Rating (0 User Ratings)
    • Currently 0/5 Stars.
    See all categories
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
    Rate This Article
Share this Article info
  • Bookmark: StumbleUpon Facebook Connotea CiteULike Bibliography

Open Access

Research Article

Genome-Wide Patterns of Nucleotide Polymorphism in Domesticated Rice

Author Summary<p>Domesticated Asian rice is one of the oldest and most important crops in the world. Two main rice evolutionary lineages have been identified, and are thought to have been independently domesticated in Asia. We have examined patterns of DNA sequence variation in the genomes of rice and its wild ancestor to make inferences about the origin of domesticated rice. Population bottlenecks (a reduction in the size of the founding population) in the evolutionary transition from wild to cultivated species has long been thought to be the dominant force shaping patterns of molecular evolution during domestication. We find that the nucleotide variation patterns in rice are inconsistent with a simple bottleneck model. Rice genetic variation, however, can be explained by either a model that incorporates both a bottleneck and migration among rice variety groups, or a model that incorporates a bottleneck and multiple rounds of artificial selection on rice. Selection by humans is believed to have played an important role during crop domestication, and these results may suggest that strong, recurrent selection can leave a signal that can be observed throughout the genomes of domesticated species.</p></sec></div> <span property="dc:date" content="2007-09-28" datatype="xsd:date" rel="dc:identifier" href="http://dx.doi.org/10.1371/journal.pgen.0030163"></span> <span property="dc:subject" content="Evolutionary Biology"></span> <form action=""> <input type="hidden" name="journalDisplayName" id="journalDisplayName" value="PLoS Genetics" /> <input type="hidden" name="crossRefPageURL" id="crossRefPageURL" value="/article/crossref/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" /> <input type="hidden" name="metricsTabURL" id="metricsTabURL" value="/article/metrics/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" /> <input type="hidden" name="doi" id="doi" value="info:doi/10.1371/journal.pgen.0030163" /> <input type="hidden" name="articleTitleUnformatted" id="articleTitleUnformatted" value="Genome-Wide%20Patterns%20of%20Nucleotide%20Polymorphism%20in%20Domesticated%20Rice" /> <input type="hidden" name="articlePubDate" id="articlePubDate" value="1190962800000" /> </form> <div class="horizontalTabs" xpathLocation="noSelect"> <ul id="tabsContainer"> <li id="article" class="active"><a href="/article/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" class="tab" title="Article">Article</a></li> <li id="metrics"><a href="/article/metrics/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" class="tab" title="Metrics">Metrics</a></li> <li id="related"><a href="/article/related/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" class="tab" title="Related Content">Related Content</a></li> <li id="comments"><a href="/article/comments/info%3Adoi%2F10.1371%2Fjournal.pgen.0030163" class="tab" title="Comments">Comments: 0</a></li> </ul> </div> <div id="retractionHtmlId" class="retractionHtmlId" style="display:none;" xpathLocation="noSelect"> <div id="retractionlist"></div> </div> <div id="fch" class="fch" style="display:none;" xpathLocation="noSelect"> <p class="fch"><strong> Formal Correction:</strong> This article has been <em>formally corrected</em> to address the following errors.</p> <ol id="fclist" class="fclist"></ol> </div> <div id="articleMenu" xpathLocation="noSelect"> <div class="wrap"> <ul> <li class="annotation icon">To <strong>add a note</strong>, highlight some text. <a href="#" onclick="toggleAnnotation(this, 'public'); return false;" title="Click to turn notes on/off">Hide notes</a></li> <li class="discuss icon"> <a href="/user/secure/secureRedirect.action?goTo=%2Farticle%2Finfo%3Adoi%2F10.1371%2Fjournal.pgen.0030163">Make a general comment</a> </li> </ul> <div id="sectionNavTopBox" style="display:none;"> <p><strong>Jump to</strong></p> <div id="sectionNavTop" class="tools"></div> </div> </div> </div> <p xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="authors" xpathLocation="noSelect"><span property="dc:creator">Ana L. Caicedo</span><sup><a href="#aff1">1</a></sup><sup><a href="#equal-contrib">#</a></sup><sup><a href="#n105">¤a</a></sup>, <span property="dc:creator">Scott H. Williamson</span><sup><a href="#aff2">2</a></sup><sup><a href="#equal-contrib">#</a></sup>, <span property="dc:creator">Ryan D. Hernandez</span><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Adam Boyko</span><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Adi Fledel-Alon</span><sup><a href="#aff2">2</a></sup><sup><a href="#n106">¤b</a></sup>, <span property="dc:creator">Thomas L. York</span><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Nicholas R. Polato</span><sup><a href="#aff3">3</a></sup>, <span property="dc:creator">Kenneth M. Olsen</span><sup><a href="#aff1">1</a></sup><sup><a href="#n107">¤c</a></sup>, <span property="dc:creator">Rasmus Nielsen</span><sup><a href="#aff2">2</a></sup><sup><a href="#n108">¤d</a></sup>, <span property="dc:creator">Susan R. McCouch</span><sup><a href="#aff3">3</a></sup>, <span property="dc:creator">Carlos D. Bustamante</span><sup><a href="#aff2">2</a></sup><sup><a href="#cor1" class="fnoteref">*</a></sup>, <span property="dc:creator">Michael D. Purugganan</span><sup><a href="#aff1">1</a></sup><sup>,</sup><sup><a href="#aff4">4</a></sup><sup>,</sup><sup><a href="#aff5">5</a></sup><sup><a href="#cor1" class="fnoteref">*</a></sup></p><p xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="affiliations" xpathLocation="noSelect"><a name="aff1" id="aff1"></a><strong>1</strong> Department of Genetics, North Carolina State University, Raleigh, North Carolina, United States of America, <a name="aff2" id="aff2"></a><strong>2</strong> Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America, <a name="aff3" id="aff3"></a><strong>3</strong> Department of Plant Breeding and Genetics, Cornell University, Ithaca, New York, United States of America, <a name="aff4" id="aff4"></a><strong>4</strong> Department of Biology, New York University, New York, New York, United States of America, <a name="aff5" id="aff5"></a><strong>5</strong> Center for Comparative Functional Genomics, New York University, New York, New York, United States of America</p><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="abstract" xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[1]"><a id="abstract0" name="abstract0" toc="abstract0" title="Abstract"></a><h2 xpathLocation="noSelect">Abstract <a href="#top">Top</a></h2><p xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[1]/p[1]">Domesticated Asian rice (<span class="genus-species">Oryza sativa</span>) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, <i>O</i>. <i>rufipogon</i>, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in <i>O</i>. <i>sativa</i> varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation.</p> </div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="abstract" xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[2]"><a id="abstract1" name="abstract1" toc="abstract1" title="Author Summary"></a> <h2 xpathLocation="noSelect">Author Summary <a href="#top">Top</a></h2> <p xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[2]/sec[1]/p[1]">Domesticated Asian rice is one of the oldest and most important crops in the world. Two main rice evolutionary lineages have been identified, and are thought to have been independently domesticated in Asia. We have examined patterns of DNA sequence variation in the genomes of rice and its wild ancestor to make inferences about the origin of domesticated rice. Population bottlenecks (a reduction in the size of the founding population) in the evolutionary transition from wild to cultivated species has long been thought to be the dominant force shaping patterns of molecular evolution during domestication. We find that the nucleotide variation patterns in rice are inconsistent with a simple bottleneck model. Rice genetic variation, however, can be explained by either a model that incorporates both a bottleneck and migration among rice variety groups, or a model that incorporates a bottleneck and multiple rounds of artificial selection on rice. Selection by humans is believed to have played an important role during crop domestication, and these results may suggest that strong, recurrent selection can leave a signal that can be observed throughout the genomes of domesticated species.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="articleinfo" xpathLocation="noSelect"><p><strong>Citation: </strong>Caicedo AL, Williamson SH, Hernandez RD, Boyko A, Fledel-Alon A, et al. (2007) Genome-Wide Patterns of Nucleotide Polymorphism in Domesticated Rice. PLoS Genet 3(9): e163. doi:10.1371/journal.pgen.0030163</p><p><strong>Editor: </strong>Gil McVean, University of Oxford, United Kingdom</p><p></p><p><strong>Received:</strong> February 20, 2007; <strong>Accepted:</strong> August 6, 2007; <strong>Published:</strong> September 28, 2007</p><p><strong>Copyright:</strong> © 2007 Caicedo et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.</p><p><strong>Funding:</strong> This work was funded by the US National Science Foundation Plant Genome Research Program..</p><p><strong>Competing interests:</strong> The authors have declared that no competing interests exist.</p><p><strong>Abbreviations: </strong>AIC, Akaike information criterion; GOF, goodness-of-fit; SNP, single nucleotide polymorphism; STS, sequence-tagged site(s)</p><p><a name="cor1"></a>* To whom correspondence should be addressed. E-mail: <a href="mailto:cdb28@cornell.edu">cdb28@cornell.edu</a> (CDB); <a href="mailto:mp132@nyu.edu">mp132@nyu.edu</a> (MDP)</p><p><a name="equal-contrib"></a># These authors contributed equally to this work. </p><p><a name="n105"></a><span class="capture-id"> ¤a Current address: Department of Biology, University of Massachusetts, Amherst, Massachusetts, United States of America</span></p></div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section1" xpathLocation="/article[1]/body[1]/sec[1]"><a id="s1" name="s1" toc="s1" title="Introduction"></a><h3 xpathLocation="noSelect">Introduction <a href="#top">Top</a></h3><p xpathLocation="/article[1]/body[1]/sec[1]/p[1]">Domestication is a complex, cumulative evolutionary process in which human use of organisms leads to morphological and/or behavioral changes distinguishing domesticated species from their wild ancestors [<a href="#pgen-0030163-b001">1</a>,<a href="#pgen-0030163-b002">2</a>]. Beginning with Charles Darwin [<a href="#pgen-0030163-b003">3</a>,<a href="#pgen-0030163-b004">4</a>], there has been strong interest in the study of domestication of crop species as a means of understanding the nature of selection. Moreover, domestication and the development of agriculture are arguably the most important technological innovations in human history [<a href="#pgen-0030163-b005">5</a>]. Crop plant domestication was the linchpin of the Neolithic Revolution 10,000–12,000 years ago, in which hunter-gatherer groups transitioned into sedentary agricultural societies that gave rise to current human cultures [<a href="#pgen-0030163-b006">6</a>]. With domestication came the availability of food surpluses, and this agricultural development led to craft specializations, art, religious and social hierarchies, writing, urbanization, and the origin of the state [<a href="#pgen-0030163-b005">5</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[2]">One of the earliest domesticated crop species is cultivated Asian rice, <span class="genus-species">Oryza sativa</span> L., which has become the world's most widely grown crop and has also assumed the stature of a key model system in plant biology. Rice consumption constitutes about 20% of the world's caloric intake, and in Asian countries, where over half of the world's population lives, rice often represents over 50% of the calories consumed [<a href="#pgen-0030163-b007">7</a>]. Because of its small genome size, rice has been the first crop plant to have its whole genome sequenced [<a href="#pgen-0030163-b008">8</a>–<a href="#pgen-0030163-b010">10</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[3]">A wealth of morphological, physiological, and ecological variation exists within cultivated Asian rice, reflected in the large number of recognized cultivars or strains [<a href="#pgen-0030163-b011">11</a>,<a href="#pgen-0030163-b012">12</a>]. Two main rice varietal groups, <i>O</i>. <i>sativa indica</i> and <i>O</i>. <i>sativa japonica</i>, have been recognized since ancient China [<a href="#pgen-0030163-b013">13</a>]. Although phenotypic distinctions between these groups is not always straightforward, <i>indica</i> varieties tend to be found throughout the tropical regions of Asia and are primarily grown in lowland conditions, while <i>japonica</i> types are differentiated into <i>tropical japonica</i>, distributed in upland tropical regions, and <i>temperate japonica</i>, a recently derived group cultivated in temperate regions [<a href="#pgen-0030163-b011">11</a>,<a href="#pgen-0030163-b013">13</a>,<a href="#pgen-0030163-b014">14</a>]. Additional variety groups include <i>aus</i>, drought-tolerant rice from Bangladesh and West Bengal, and <i>aromatic</i>, fragrant rice from the Himalayan range [<a href="#pgen-0030163-b014">14</a>,<a href="#pgen-0030163-b015">15</a>]. All rice varieties have a predominantly self-fertilizing mating system [<a href="#pgen-0030163-b013">13</a>]. Both morphological and isozyme data have established that <i>O</i>. <i>rufipogon</i> Griff., a partially outcrossing species native to southern Asia, is the wild ancestor of domesticated rice [<a href="#pgen-0030163-b013">13</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[4]">In this paper, we describe the levels and patterns of DNA sequence polymorphism across the rice genome and that of its wild ancestor, <i>O</i>. <i>rufipogon</i>. To our knowledge this is the first genome-wide characterization of sequence variation in domesticated Asian rice, and we show that rice contains a unique pattern of excess high-frequency derived single nucleotide polymorphisms (SNPs) that has not been reported in other species. We develop four models to explain patterns of genetic variation in <i>O</i>. <i>sativa</i> and <i>O</i>. <i>rufipogon</i>, including a simple selectively neutral bottleneck model that has been previously thought to be the dominant demographic force shaping levels of nucleotide variation in crop species. We demonstrate that this simple bottleneck model is inadequate to explain the origin of domesticated rice. We conclude that either positive selection has made a significant impact on genomic polymorphism patterns, or that domestication involved an extremely severe bottleneck (~99.5% reduction) coupled with gene flow among modern varieties and between domesticated rice and its wild ancestor.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section2" xpathLocation="/article[1]/body[1]/sec[2]"><a id="s2" name="s2" toc="s2" title="Results/Discussion"></a><h3 xpathLocation="noSelect">Results/Discussion <a href="#top">Top</a></h3> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/title[1]">Nucleotide Variation in the Rice Genome</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/p[1]">To assess levels and patterns of polymorphism in the rice genome, we sequenced one hundred eleven randomly chosen gene fragments (sequence-tagged sites or STS) in a diverse panel of <i>Oryza</i> accessions, including 72 from <i>O</i>. <i>sativa</i> and 21 from <i>O</i>. <i>rufipogon</i> (<a href="#pgen-0030163-st001">Tables S1</a> and <a href="#pgen-0030163-st002">S2</a>). Average silent (synonymous and noncoding) site nucleotide diversity (θ<sub>π</sub>) across all sampled loci in <i>O</i>. <i>sativa</i> is approximately 3.20 × 10<sup>−3</sup> (<a href="#pgen-0030163-t001">Table 1</a>). Levels of polymorphism in the wild ancestral species, <i>O</i>. <i>rufipogon</i>, are predictably higher than rice, with a mean silent θ<sub>π</sub> of 5.19 × 10<sup>−3</sup> (<a href="#pgen-0030163-t001">Table 1</a>). These levels of polymorphism are lower than those observed for maize, a domesticated outcrossing species [<a href="#pgen-0030163-b016">16</a>], and <span class="genus-species">Arabidopsis thaliana</span>, a selfing, wild species [<a href="#pgen-0030163-b017">17</a>,<a href="#pgen-0030163-b018">18</a>].</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/table-wrap[1]"><a name="pgen-0030163-t001" id="pgen-0030163-t001" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.t001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.t001&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/table-wrap[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.t001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/table-wrap[1]/label[1]">Table 1. </span></a></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/table-wrap[1]/caption[1]/p[1]">Average Diversity Measures in <span class="genus-species">Oryza</span> spp. across 111 STS Regions</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.t001</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/p[2]">To determine if any genetic differentiation due to population structure among rice groups is evident in these STS sequences, we used the Bayesian clustering program STRUCTURE [<a href="#pgen-0030163-b019">19</a>]. The highest likelihood obtained was with a model specifying <i>K</i> = 7 groups (<a href="#pgen-0030163-g001">Figure 1</a>; <a href="#pgen-0030163-st001">Table S1</a>). Five groups occur within <i>O</i>. <i>sativa</i> and correspond to the traditional variety designations, as described previously [<a href="#pgen-0030163-b014">14</a>]. Evidence of some limited geographical population structure is also observed in <i>O</i>. <i>rufipogon</i> (<a href="#pgen-0030163-g001">Figure 1</a>; <a href="#pgen-0030163-st001">Table S1</a>). Neighbor-joining analysis of the concatenated STS sequences (<a href="#pgen-0030163-sg001">Figure S1</a>) revealed two distinct clusters within cultivated rice; one comprises a <i>tropical japonica</i>, <i>temperate japonica</i>, and <i>aromatic</i> rice lineage, and another consists of <i>aus</i> and <i>indica</i> rice. The apparent monophyly of these major groups is consistent with at least two domestication events in rice [<a href="#pgen-0030163-b014">14</a>,<a href="#pgen-0030163-b020">20</a>–<a href="#pgen-0030163-b024">24</a>]. The nesting of the <i>aromatic</i> and the <i>temperate japonica</i> variety groups within <i>tropical japonica</i> suggests the first two groups originated from secondary divergence events from the latter, although the lack of support for <i>tropical japonica</i> branches does not exclude other possible divergence scenarios (<a href="#pgen-0030163-sg001">Figure S1</a>). <i>Indica</i> and <i>aus</i> relationships, on the other hand, are consistent with rapid divergence after domestication or separate domestication events from the same ancestral gene pool. Within-group SNP levels of cultivated rice are lower than those of the whole species (<a href="#pgen-0030163-t001">Table 1</a>), with subpopulations harboring between 19% (<i>temperate japonica</i>) and 43% (<i>indica</i>) of the polymorphism of <i>O</i>. <i>rufipogon</i>. Assuming separate domestication events, the <i>japonica</i> clade contains 42% and the <i>indica</i> clade contains 48% of the diversity levels found in <i>O</i>. <i>rufipogon</i>.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/fig[1]"><a name="pgen-0030163-g001" id="pgen-0030163-g001" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.g001&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/fig[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/fig[1]/label[1]">Figure 1. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/fig[1]/caption[1]/title[1]">Estimated Population Structure for 97 Accessions of <i>O</i>. <i>sativa</i> and <i>O</i>. <i>rufipogon</i> from 111 STS Loci</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[1]/fig[1]/caption[1]/p[1]">Vertical bars along the horizontal axis represent each <i>Oryza</i> accession; for all accessions, the proportion of ancestry under <i>K</i> = 7 clusters that can be attributed to each cluster is given by the length of each colored segment in a bar.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.g001</span><div class="clearer"></div></div> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/title[1]">The Derived Site-Frequency Spectrum is U-Shaped in <i>O</i>. <i>sativa</i></h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/p[1]">Because of the strong population structure evident in our rice sample, it is necessary to assess patterns of variation separately for each group when making inferences about the evolutionary dynamics of domestication. <i>Indica</i> and <i>tropical japonica</i> represent the most widely grown cultivars for each of the separate domestication events, and we limited our characterization of polymorphism patterns to these two groups. We examined the frequency spectrum of segregating sites within loci using Tajima's D [<a href="#pgen-0030163-b025">25</a>], and found that <i>O</i>. <i>rufipogon</i> and the two main rice subspecies show an excess of rare alleles, as evidenced by the biased distribution of Tajima's D toward negative values (<a href="#pgen-0030163-sg002">Figure S2</a>; <a href="#pgen-0030163-t001">Table 1</a>). Crops are expected to have gone through a population bottleneck during domestication, as only a limited number of founding individuals were brought into cultivation. The distribution of Tajima's D in the domesticated rice varieties is inconsistent with a recent bottleneck, however, as these should reduce levels of low-frequency variants and bias measures of Tajima's D toward positive values. It is possible that subsequent population expansion, due to the spread of rice agriculture, could be responsible for the over-representation of rare alleles segregating in domesticated rice varieties, or selection may have played a role.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/p[2]">We further examined the derived site-frequency spectrum across SNPs (i.e., the fraction of derived polymorphisms present at various frequencies within a group) in <i>indica</i> and <i>tropical japonica</i>. To infer ancestral alleles for each SNP, we used as an outgroup <i>O</i>. <i>meridionalis</i>, a species believed to have diverged from <i>O</i>. <i>sativa</i> ~2 million years ago [<a href="#pgen-0030163-b021">21</a>]. In each <i>O</i>. <i>sativa</i> variety we observed a large number of high-frequency derived mutations (i.e., derived SNPs above 70% frequency in the population) leading to a U-shaped frequency distribution (<a href="#pgen-0030163-g002">Figure 2</a>); this type of pattern has not been reported at the genomic level in any other species.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/fig[1]"><a name="pgen-0030163-g002" id="pgen-0030163-g002" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.g002&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/fig[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/fig[1]/label[1]">Figure 2. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/fig[1]/caption[1]/title[1]">The Observed Marginal Derived Site-Frequency Spectra of Noncoding and Synonymous SNPs for Two Population Pairs: <i>indica</i> and <i>O</i>. <i>rufipogon</i> and <i>tropical japonica</i> and <i>O</i>. <i>rufipogon</i></span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/fig[1]/caption[1]/p[1]">To accommodate SNPs with missing data, all spectra are plotted as the expected site frequency spectrum in a subsample of the data of size <i>n</i> = 16.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.g002</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/p[3]">Possible explanations for the excess of high-frequency derived SNPs in <i>O</i>. <i>sativa</i> include the misidentification of ancestral states due to shared polymorphism with <i>O</i>. <i>meridionalis</i>, or the occurrence of multiple mutations at given sites since divergence from <i>O</i>. <i>meridionalis</i>. However, both misidentification of derived alleles and multiple hits would be expected to also affect the site-frequency spectrum of <i>O</i>. <i>rufipogon</i>, which is not observed (<a href="#pgen-0030163-g002">Figure 2</a>). This suggests that the <i>O</i>. <i>sativa</i> derived site-frequency distribution is a result of the domestication process. Furthermore, derived alleles at high frequency in the <i>O</i>. <i>sativa</i> varieties occur primarily at low to intermediate frequency in <i>O</i>. <i>rufipogon</i>, suggesting that such alleles have only recently increased in frequency (<a href="#pgen-0030163-sg003">Figure S3</a>).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/p[4]">We also checked the ancestral state calls in <i>O</i>. <i>sativa</i> using the African wild rice <i>O</i>. <i>barthii</i>. Although <i>O</i>. <i>barthii</i> is more closely related to <i>O</i>. <i>sativa</i> than is <i>O</i>. <i>meridionalis</i>, if we assume that both wild species share ancestral polymorphisms with domesticated rice, the possibility that we always identified the same alternative allele as derived in our sample should be low. Using this approach, we find that 88% of our ancestral SNP calls in <i>indica</i> and 86% in <i>tropical japonica</i> matched in <i>O</i>. <i>barthii</i> and <i>O</i>. <i>meridionalis</i>. Even when using only the matched calls (which is a very conservative criterion, since it does not take into account drift and/or fixation processes in <i>O</i>. <i>barthii</i>), the site frequency spectrum in <i>O</i>. <i>sativa</i> varieties remains U-shaped.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[2]/p[5]">An excess of high-frequency derived SNPs is often interpreted as a result of genetic hitchhiking during recent selective sweeps [<a href="#pgen-0030163-b026">26</a>]. Because the site-frequency spectrum in rice varieties is observed from randomly selected loci, and the loci contributing high-frequency derived SNPs are distributed across the genome (<a href="#pgen-0030163-sg004">Figure S4</a>), this pattern suggests that strong linkage to positively selected mutations occurred within most of the genome. However, demographic forces may have also played a role in shaping the rice genomes. We developed several demographic models and a multiple selective sweeps model to test which evolutionary processes may best explain the observed patterns of polymorphism in rice.</p> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/title[1]">Demographic Models for Rice Domestication: A Neutral Population Bottleneck Model</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/p[1]">The most widely accepted demographic model for crop domestication is a neutral bottleneck model [<a href="#pgen-0030163-b027">27</a>–<a href="#pgen-0030163-b029">29</a>]. In this model, rice domestication is assumed to be a result of recent population divergence, with one of the two daughter populations experiencing a reduction in population size at divergence associated with the founder effect at the time of domestication, followed by population growth as cultivation of the crop increases. To fit this model to our data, we used a diffusion-based approach [<a href="#pgen-0030163-b030">30</a>–<a href="#pgen-0030163-b032">32</a>] to predict the pattern of allele frequencies in domestic and ancestral populations under selective neutrality.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/p[2]">Details of the inference procedure can be found in the <a href="#s3">Materials and Methods</a> section. The composite-likelihood function we employed uses the reduction in diversity observed in either of the domesticated rice subspecies and the shift in allele frequency distribution to estimate four parameters: the time back until the start of domestication (τ<sub>1</sub>), duration of the bottleneck (τ<sub>2</sub>), ratio of current population to ancestral population size (ν<sub>2</sub>), and relative size of the bottleneck population to the ancestral population (ν<sub>b</sub>). The duration of the bottleneck was assumed to be 25% of the time back until domestication (τ<sub>2</sub> = 0.25 × τ<sub>1</sub>), which is consistent with archeological data suggesting it took ~3,000 y from the time of initial cultivation (~12,000 y ago) until the appearance of domesticated rice grains [<a href="#pgen-0030163-b033">33</a>,<a href="#pgen-0030163-b034">34</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/p[3]">Bottleneck parameter estimates for <i>indica</i> and <i>tropical japonica</i> are broadly comparable, with a slightly more severe bottleneck in <i>tropical japonica</i> (<a href="#pgen-0030163-t002">Table 2</a>). Assuming the time back to the beginning of domestication for both variety groups was ~12,000 y [<a href="#pgen-0030163-b035">35</a>], we can independently derive estimates of the current <i>O</i>. <i>rufipogon</i> effective population size, <i>N</i><sub>rufi</sub>, using the relationship τ<sub>1</sub> × 2<i>N</i><sub>rufi</sub> = 12,000 (because τ<sub>1</sub> is scaled by 2<i>N</i><sub>rufi</sub>). From the <i>indica</i> analyses, <i>N</i><sub>rufi</sub> is equal to 12,000/(2 × 0.1044) = 57,471, and from the <i>tropical japonica</i> analyses is equal to 12,000/(2 × 0.0508) = 118,110 (this exact value of <i>N</i><sub>rufi</sub> is important in scaling all of the estimated parameters into years and number of individuals). The <i>indica</i>-derived <i>N</i><sub>rufi</sub> estimate implies bottleneck and current estimated population size (<i>N</i><sub>e</sub>) for <i>indica</i> of (ν<sub>b</sub> × <i>N<sub>r</sub></i><sub>ufi</sub>) = 1,413 and (ν<sub>2</sub> × <i>N</i><sub>rufi</sub>) = 40,229 respectively. The second estimate suggests a bottleneck and current <i>N</i><sub>e</sub> sizes for <i>tropical japonica</i> of (ν<sub>b</sub> × <i>N</i><sub>rufi</sub>) = 1,334 and (ν<sub>2</sub> × <i>N</i><sub>rufi</sub>) = 46,889, respectively.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/table-wrap[1]"><a name="pgen-0030163-t002" id="pgen-0030163-t002" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.t002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.t002&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/table-wrap[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.t002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/table-wrap[1]/label[1]">Table 2. </span></a></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/table-wrap[1]/caption[1]/p[1]">Maximum Likelihood Estimates for Demographic Parameters of the Bottleneck and Bottleneck plus Migration Models in <i>indica</i>, <i>tropical japonica</i>, and <span class="genus-species">O. rufipogon</span></p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.t002</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/sec[3]/p[4]">The differences in estimates of <i>N</i><sub>rufi</sub> from each analysis could be attributable to differences in the founding population of each variety group or differences in the timing of each domestication event. We note, however, that a bottleneck model conditioned on coincident domestication for <i>indica</i> and <i>tropical japonica</i> (equal τ<sub>1</sub> values) differs only by 1.8 log likelihood units (unpublished data), suggesting that equal timing of domestication is likely to have occurred. An independent estimate of <i>N</i><sub>rufi</sub> can be found by using the estimated scaled population silent mutation rates (θ<sub>W</sub> = 4<i>N</i><sub>rufi</sub> μ = 5.42 × 10<sup>−3</sup> per bp; <a href="#pgen-0030163-t001">Table 1</a>) and the observation that the <i>O</i>. <i>rufipogon</i> site-frequency spectrum is consistent with that of a population of long-term constant size (<a href="#pgen-0030163-g002">Figure 2</a>). Assuming a neutral mutation rate of 10<sup>−8</sup> per bp, yields a point estimate of <i>N</i><sub>rufi</sub> = 135,500, which is slightly higher, but close to the estimates found by conditioning on the start of domestication.</p> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/title[1]">Demographic Models for Rice Domestication: A Complex Model Incorporating Subdivision, Bottlenecks, and Migration</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/p[1]">It is important to note that population bottlenecks alone would not generate the strong excess of high-frequency derived alleles and strong U-shaped site-frequency spectrum observed in <i>O</i>. <i>sativa</i> (<a href="#pgen-0030163-g002">Figure 2</a>) [<a href="#pgen-0030163-b036">36</a>]. In order to explain this aspect of the data, we considered several demographic models that included ancient subdivision in the ancestor of rice, a bottleneck at the time of domestication for each domesticated varietal group, and limited gene flow between the independently domesticated rice groups <i>indica</i> and <i>tropical japonica</i>. Ancient, strong subdivision is not evident in our <i>O</i>. <i>rufipogon</i> sample (<a href="#pgen-0030163-g001">Figure 1</a>); <i>F</i><sub>st</sub> between Chinese and non-Chinese <i>O</i>. <i>rufipogon</i> is low, about 0.16, and no interior modes are evident in the site-frequency spectrum of <i>O</i>. <i>rufipogon</i>, as expected under subdivision. However, it is possible for limited gene flow in <i>O</i>. <i>rufipogon</i> to lead to some differentiation of allele frequency between groups, but not so much that it would have a strong effect on a combined <i>O</i>. <i>rufipogon</i> sample. Furthermore, the population bottlenecks induced by independent domestication events could amplify any allele frequency differentiation between <i>indica</i> and <i>tropical japonica</i>, and limited gene flow between these two groups could introduce ancestral alleles into each population, causing mutations previously fixed in one group to be observed as high-frequency derived alleles in the other.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/p[2]">To test the effect of ancestral population substructure within <i>O</i>. <i>rufipogon</i> prior to the domestication of the two <i>O</i>. <i>sativa</i> groups, we fit the parameters of a complex demographic model to our data using a composite likelihood technique (see <a href="#s3">Materials and Methods</a>). We began by exploring a model with seven demographic parameters, which consists of <i>O</i>. <i>rufipogon</i> being subdivided into two demes of equal size, sharing on average <i>M</i><sub>R</sub> migrants per generation. Current-day <i>indica</i> varieties are descended from one of these demes, while <i>tropical japonica</i> varieties descend from the other. During the domestication process, each population underwent a bottleneck that began τ<sub>1</sub> generations ago (in units 2<i>N</i><sub>rufi</sub>) and had severity ν<sub>b</sub> (the ratio of the reduced population size to the ancestral size). After τ<sub>2</sub> = 0.25 × τ<sub>1</sub> generations (~3,000 y), both <i>indica</i> and <i>tropical japonica</i> partially recovered, instantaneously reaching a fraction υ<sub>I</sub> and υ<sub>J</sub> of the ancestral size, respectively. Contemporary gene flow (since domestication) between <i>tropical japonica</i>, <span class="genus-species">O. rufipogon</span>, and <i>indica</i> is captured by the last parameter, the average number of migrants per generation between these demes (<i>M</i><sub>S</sub>). This model was conceived because it incorporates key demographic features of rice or crop domestication (e.g., bottlenecks, two domestication events) and could conceptually generate the observed derived SNP site frequency spectrum.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/p[3]">In preliminary analyses, we found that the migration rate (<i>M</i><sub>R</sub>) between the two ancestral <i>O</i>. <i>rufipogon</i> demes was very large, with the marginal likelihood surface for this parameter near its maximum value whenever <i>M</i><sub>R</sub> > 7. This is consistent with our observations of limited population structure in <i>O</i>. <i>rufipogon</i> (above), and we therefore discarded ancestral population structure as a main contributor to the patterns observed in our dataset, and simplified the demographic model to consider only a single ancestral population from with both <i>indica</i> and <i>tropical japonica</i> derive (with migration rates among the three remaining demes, <i>M</i><sub>S</sub> = 4<i>N</i><sub>rufi</sub><i>m</i>). This assumption reduced the computational complexity, so that the remaining parameters could be estimated via a grid search using an initial size of over 2,000 points with 1,000,000 coalescent simulations per point. The resulting model (which we refer to as the bottleneck plus migration model) has five free parameters with composite maximum likelihood estimates of <i>M</i><sub>S</sub> = 7.0 (migration between demes), ν<sub>b</sub> = 0.0055 (domestication bottleneck size), ν<sub>I</sub> = 0.27 (ratio of <i>indica</i> to <i>O</i>. <i>rufipogon N</i><sub>e</sub>), ν<sub>J</sub> = 0.12 (ratio of <i>tropical japonica</i> to <i>O</i>. <i>rufipogon N</i><sub>e</sub>), and τ<sub>1</sub> = 0.04 (start of domestication in units of 2<i>N</i><sub>rufi</sub>) (<a href="#pgen-0030163-t002">Table 2</a>). It is important to note that coalescent simulations scale the migration based on population size, so the number of migrants entering into the <i>tropical japonica</i> population is smaller (0.5 × <i>M</i> × ν<sub>J</sub> = 0.42), than into <i>indica</i> (0.5 × <i>M</i> × ν<sub>I</sub> = 0.945), and <span class="genus-species">O. rufipogon</span> (0.5 × <i>M</i> = 3.5).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/p[4]">In <a href="#pgen-0030163-g003">Figure 3</a>, we report the profile composite-likelihood contours for the three key demographic parameters in the bottleneck plus migration model: migration rate, start of the bottleneck, and severity. The figure is constructed by holding two parameters fixed at a given point in the (<i>x</i>,<i>y</i>) plane, optimizing over the third parameter, and reporting the maximum likelihood attained for the (<i>x</i>,<i>y</i>) point (due to computational limitations the figure was constructed holding the ratio of current-day <i>indica</i> and <i>tropical japonica</i> populations at their maximum composite-likelihood estimates). We note that the three parameters are moderately to strongly correlated, but only a restricted set of values in high dimensional space is consistent with the data. These solutions all include: a very strong bottleneck (>99% reduction), high rates of migration within and between domesticated and wild populations of Asian rice (<i>M</i> > 5), and current-day effective population sizes for cultivated rice that are substantially smaller than those seen in the ancestral population. We also note that the model solutions show a positive correlation between size of bottleneck population and timing of the bottleneck, a negative correlation between size of the bottleneck and migration, and a negative correlation between migration and timing (consistent with the ~2-fold difference in the estimated time of the bottleneck between the model with migration and the model without).</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[1]"><a name="pgen-0030163-g003" id="pgen-0030163-g003" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g003" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.g003&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g003" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[1]/label[1]">Figure 3. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[1]/caption[1]/title[1]">Contours of Composite Profile Log-Likelihood Surface under the Bottleneck and Migration (i.e., “Complex Demography”) Model for Three Key Demographic Parameters</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[1]/caption[1]/p[1]">Parameters include bottleneck severity, migration rate among demes (4<i>Nm</i>), and τ<sub>1</sub> (time back until start of domestication scaled in units of 2<i>N</i><sub>rufi</sub>). The maximum composite-likelihood estimate of the parameters is denoted by a red filled circle.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.g003</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/p[5]">As can be seen in <a href="#pgen-0030163-g004">Figure 4</a>, the expected site-frequency spectrum under the best fitting bottleneck plus migration model matches the observed frequency distributions fairly well for both <i>O</i>. <i>rufipogon</i> as well as <i>indica</i>, but not as well for <i>tropical japonica</i>. As expected, the total number of SNPs in each of the three populations is predicted quite well by the model. We quantified the fit of the model to the observed data using a modified Pearson Chi-square goodness-of-fit (GOF) statistic, and found that the best-fitting complex demographic model is an excellent fit to the marginal <i>indica</i> (GOF<sub>I</sub> = 20.26, <i>p</i> = 0.72) and <span class="genus-species">O. rufipogon</span> site-frequency spectra (GOF<sub>R</sub> = 7.57; <i>p</i> = 0.99), and an adequate fit to the <i>tropical japonica</i> site-frequency spectrum (GOF<sub>T</sub> = 37.83, <i>p</i> = 0.22). One interesting observation is that the demographic model underpredicts the excess of high-frequency derived alleles observed in <i>tropical japonica</i>—a potential indication of recent positive selection. Given that artificial selection was probably quite strong and frequent during and after domestication, we further explored models that incorporate selection during the domestication process of <i>O</i>. <i>sativa</i>.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[2]"><a name="pgen-0030163-g004" id="pgen-0030163-g004" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g004" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.g004&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[2]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g004" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[2]/label[1]">Figure 4. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[2]/caption[1]/title[1]">Observed and Expected Derived Site-Frequency Spectra under Various Models</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[4]/fig[2]/caption[1]/p[1]">The observed derived site-frequency spectrum for (A) <i>indica</i> and (B) <i>tropical japonica</i>, along with the expected site-frequency spectrum under the simple bottleneck, bottleneck plus migration demography, and bottleneck plus sweeps models. (C) Observed site-frequency spectrum for <i>O</i>. <i>rufipogon</i> and expected frequencies using a standard neutral model and a bottleneck plus migration model.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.g004</span><div class="clearer"></div></div> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/title[1]">Selection Models for Rice Domestication</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/p[1]">Since strong selection is known to accompany crop domestication, we developed two alternative models incorporating multiple selective sweeps to explain the unusual polymorphism patterns in <i>indica</i> and <i>tropical japonica</i>. In a neutral locus linked to a single, recent selective sweep, let <i>f</i><sub>i</sub> be the probability of observing a neutral mutation segregating at frequency <i>i</i> in a sample of size <i>n</i>, conditional on the locus being variable. An expression for <i>f</i><sub>i</sub> has been derived [<a href="#pgen-0030163-b026">26</a>] and further extended [<a href="#pgen-0030163-b037">37</a>,<a href="#pgen-0030163-b038">38</a>], and includes the genomic distance <i>d</i> (measured in bp) between neutral and selected loci, a compound parameter α, which represents the combined contributions of recombination, selection, and population size, and the “background” allele frequency distribution (i.e., the expected site-frequency spectrum for loci unlinked to a selected site).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/p[2]">These results for a single sweep can be used to predict the site-frequency spectrum at randomly chosen loci if multiple sweeps have recently occurred. Assuming that selective sweeps occur at random positions in the genome at a density of κ sweeps per bp, the distance between a random neutral locus and the nearest sweep will be approximately exponentially distributed with mean 1/(2κ). Define the function φ<i><sub>i</sub></i>(<i>d</i>, α, κ) to be the probability of observing <i>i</i> copies of a neutral mutation in a sample of <i>n</i> chromosomes, given that a sweep occurred at a distance <i>d</i> bp away with compound parameter α [<a href="#pgen-0030163-b038">38</a>], and background site-frequency spectrum <b>q</b>. By integrating over the distance between the sampled locus and the unknown target of the sweep, the marginal probability, <i>P</i><sub>i</sub>, of observing a randomly chosen SNP at frequency <i>i</i> in a sample of <i>n</i> chromosomes is a function of κ, α, and <b>q</b> [<a href="#pgen-0030163-b038">38</a>]: <br><a name="pgen-0030163-e001" id="pgen-0030163-e001"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e001&representation=PNG"></span><br>This probability can be used to calculate the composite likelihood of the data and estimate the parameters κ and α (see <a href="#s3">Materials and Methods</a>). It should be noted that this equation assumes that the neutral locus is affected only by the nearest selective sweep. </p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/p[3]">We considered two distinct models. The first is a model in which strong selection is the only force that has acted in domesticated rice populations, and uses the normalized <i>O</i>. <i>rufipogon</i> site-frequency spectrum as the background frequency distribution. The second, a bottleneck plus sweeps model, allows multiple selective sweeps to affect patterns of variation immediately following a population size change. The background site-frequency spectrum in the latter case can be approximated using the predictions of a simplified neutral bottleneck model. The bottleneck plus sweeps model incorporates the sweep density κ, the compound parameter α (the combined contributions of recombination, selection, and population size), and a bottleneck severity parameter ν.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/p[4]">The likelihood surfaces for both the pure selection and the bottleneck plus sweeps model in rice each contains a long ridge where different parameter combinations have almost equally high likelihoods, implying that a model with high sweep density and relatively weak selection is just as likely as a model with low sweep density and strong selection (<a href="#pgen-0030163-g005">Figure 5</a>). For both models, the ridge of maximum likelihood is shifted to the right in <i>tropical japonica</i>, indicating that for a given value of the selection severity parameter α, the sweep density in <i>tropical japonica</i> is estimated to be twice that in <i>indica</i>.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/fig[1]"><a name="pgen-0030163-g005" id="pgen-0030163-g005" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g005" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.g005&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/fig[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0030163&imageURI=info:doi/10.1371/journal.pgen.0030163.g005" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/fig[1]/label[1]">Figure 5. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/fig[1]/caption[1]/title[1]">Composite Likelihood Surfaces in <i>indica</i> and <i>tropical japonica</i> under Models Incorporating Selection</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/fig[1]/caption[1]/p[1]">A density plot of the marginal composite log-likelihood surface of the parameters α and κ, with the bottleneck severity υ fixed to its estimate, under the bottleneck plus sweeps model for (A) <i>indica</i> and (B) <i>tropical japonica</i>. The composite log-likelihood surface of the parameters α and κ under the pure selection model for (C) <i>indica</i> and (D) <i>tropical japonica</i>. The composite log-likelihood is represented as a deviation from the maximum log-likelihood, with lighter values representing higher composite likelihoods. Numbers above (A) and (B) indicate the total number of sweeps in the rice genome corresponding to each value of κ, and numbers to the right of (B) and (D) represent the selection coefficient, <i>s</i>, corresponding to each value of α, substituting an effective recombination rate of <i>r</i> = 10<sup>−12</sup> and ln(2<i>N</i>) = 10 into the expression: α ≈ <i>rs</i><sup>−1</sup> ln(2N), then solving for <i>s</i>.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0030163.g005</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/sec[5]/p[5]">Sweep density is confounded with selection strength due to the effect of a mating system change on recombination rate. In domesticated rice, the transition to selfing likely occurred simultaneously with the sweeps, making it difficult to disentangle the recombination rate and selfing parameters. Under a recent selective sweep in a randomly mating population, the compound parameter α ≈ <i>rs</i><sup>−1</sup> ln(2<i>N</i>), where <i>r</i> is the per-basepair recombination rate, <i>s</i> is the selection coefficient and <i>N</i> is the population size [<a href="#pgen-0030163-b039">39</a>]. In a partially selfing population such as domesticated rice, however, both effective recombination rate and population size are affected by selfing rate. While the rate of coalescence (and hence the effective population size) is at most doubled by the rate of selfing, the rate of recombination can be radically altered. An expression for effective recombination rate is <i>r</i>(1 − σ/[2 − σ]), where σ is the selfing rate [<a href="#pgen-0030163-b040">40</a>]. For domesticated rice, estimates of selfing rates are typically ~0.99 [<a href="#pgen-0030163-b013">13</a>], resulting in a reduced recombination rate by approximately 10<sup>−3</sup>. If we assume 400 selective sweeps occurred in the rice genome since domestication (κ = 10<sup>−6</sup>), we estimate that α = 2 × 10<sup>−12</sup> for <i>indica</i>. With <i>r</i> = 10<sup>−9</sup> recombination events per generation per base pair and ln(2<i>N</i>) ≈ 10, this estimate of α corresponds to an unreasonably high estimate of a 5,000-fold fitness advantage. Substituting an effective recombination rate of 10<sup>−12</sup> (corresponding to a reduced effective rate due to selfing), we find more reasonable values for the strength of selection for the selective sweeps, with <i>s</i> ≈ 5. This example illustrates how high selfing rates can amplify the signal of selection and contribute to the pattern of polymorphism in the rice genome.</p> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[6]/title[1]">Comparing Models to Explain Patterns of Nucleotide Polymorphism in Rice</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[6]/p[1]">Visually, it appears both the bottleneck plus sweeps model and the bottleneck plus migration model predict the site-frequency spectrum of domesticated rice better than the bottleneck model alone (<a href="#pgen-0030163-g004">Figure 4</a>) or the pure selection model (unpublished data). To compare likelihoods and determine which model best fits the data, we used the Akaike information criterion (AIC) [<a href="#pgen-0030163-b041">41</a>]. Since SNPs in our dataset are linked, we used a composite likelihood function and simulations to assign <i>p</i>-values to the observed AIC statistic (see <a href="#s3">Materials and Methods</a>).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[6]/p[2]">For <i>indica</i>, the bottleneck plus sweeps model is significantly better than the neutral bottleneck model (Λ = −17.18, <i>p</i> < 0.05) as is the bottleneck plus migration model (Λ = −14.19, <i>p</i> < 0.05). For <i>tropical japonica</i>, we also reject the neutral bottleneck model in favor of both the bottleneck plus sweeps model (Λ = −56.88, <i>p</i> < 0.01) and the bottleneck plus migration model (Λ = −53.60, <i>p</i> < 0.01). For both rice variety groups, the AIC for the bottleneck plus sweeps model was slightly lower than for the bottleneck plus migration models (Λ = −2.26, <i>indica</i>; Λ = −3.28, <i>japonica</i>), but this difference is likely not statistically meaningful given the various assumptions made. A separate (but not independent) assessment is comparing the fit of the predictions of each model to the data. The bottleneck plus sweeps model fits the marginal site-frequency spectrum of <i>indica</i> quite well (GOF = 13.86; <i>p</i> = 0.92), and does a slightly better job explaining the site-frequency spectrum of <i>tropical japonica</i> than does the complex demographic model incorporating bottlenecks plus migration (GOF<sub>sweeps + bottleneck</sub> = 31.21, <i>p</i> = 0.33; GOF<sub>bottlenecks + migration</sub> = 37.83; <i>p</i> = 0.22). These results underscore the importance of jointly modeling demographic and selective effects when considering the evolution of domesticated crop species.</p> <h4 xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/title[1]">Domestication and the Shaping of Genome-Wide Polymorphism Patterns in Rice</h4> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[1]">Population bottlenecks are believed to be the primary demographic event associated with crop species origins, and are the accepted mechanism to explain observed genome-wide polymorphism levels among these taxa. There have been concerted efforts to model the impact of population bottlenecks on domesticated species genomes [<a href="#pgen-0030163-b027">27</a>–<a href="#pgen-0030163-b029">29</a>,<a href="#pgen-0030163-b042">42</a>–<a href="#pgen-0030163-b044">44</a>]. It appears from our results, however, that a population bottleneck alone is inadequate to explain the observed nucleotide polymorphism patterns in rice, one of the oldest and the most predominant food crop species in the world.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[2]">A more complex demographic scenario involving very strong bottlenecks that led to the fixation of alternate alleles during the two rice domestication events, with concurrent gene flow between variety groups, can explain the site-frequency spectrum of <i>indica</i> and <i>O</i>. <i>rufipogon</i>. However, this pure demography model requires a bottleneck 4-fold stronger in <i>indica</i> and twice as strong in <i>tropical japonica</i> relative to the model that incorporates selection (<a href="#pgen-0030163-g005">Figure 5</a>; <a href="#pgen-0030163-t002">Table 2</a>), and a relatively high migration rate between domesticated rice and wild <i>O</i>. <i>rufipogon</i> populations. It is also important to note that the model is a poor fit to the observed frequency distribution of alleles in <i>tropical japonica</i>.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[3]">Domestication, however, is characterized by strong directional selection on a suite of traits that lead to the establishment of cultivated species as distinct entities from their wild progenitors within agricultural settings. We show that, in contrast to the complex demographic model, a simple bottleneck with sweeps model fits data from both <i>tropical japonica</i> and <i>indica</i> well without requiring an extremely strong domestication bottleneck. Since domesticated Asian rice has been subject to artificial selection, the selection plus demography model is a very plausible explanation for the observed strong excess of high-frequency derived alleles in domesticated rice varieties, and is consistent with recent reports about domestication genes in rice [<a href="#pgen-0030163-b045">45</a>,<a href="#pgen-0030163-b046">46</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[4]">Positive selection on specific genes results in reductions in variation within a genome through selective sweeps [<a href="#pgen-0030163-b047">47</a>,<a href="#pgen-0030163-b048">48</a>]. Unlike bottlenecks, however, selection is thought to have largely localized effects on genome variation. Our results suggest that a model that incorporates selection can explain patterns of nucleotide variation in a set of genome-wide markers. We suggest two reasons why selective sweeps during domestication could cause a genome-wide effect in <i>O</i>. <i>sativa</i> and not in other cereal crop species such as maize. First, the origin of domesticated Asian rice is associated with a transition to self-fertilization, which results in a low effective recombination rate and greatly increases the genomic distance affected by selection. Second, <i>O</i>. <i>sativa</i> possesses such a small genome (<400 Mb) that it is likely that a few dozen to hundreds of selective sweeps could leave a genome-wide imprint.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[5]">Interestingly, under the bottleneck plus selective sweeps model, the dynamics of domestication appear to differ in significant ways between <i>indica</i> and <i>tropical japonica</i>. Despite the fact that these two variety groups were domesticated from the same species and both have contributed significantly to Asian agriculture, it appears that the number of selective events and/or the bottleneck severity differs between them. It is possible that the two subspecies would diverge from each other in the demographic patterns associated with domestication, given that they were established by different cultures. If this is correct, then <i>tropical japonica</i> appears to have undergone a more severe bottleneck associated with domestication. Alternatively, it may be that the establishment of <i>tropical japonica</i>, which includes landraces that expanded to upland growing areas, may be associated with stronger selection pressures on a larger number of traits.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/sec[7]/p[6]">The process of domestication is one of recent, rapid species evolution, and studies on the dynamics of this process inform our understanding of the origins and diversification of new species. Simple demographic scenarios that have been employed in the past may not fully capture the domestication process of some crop species such as Asian rice. Our models indicate that selection and population bottlenecks together, or more complex scenarios that invoke very strong bottlenecks and current gene flow, could be responsible for determining genome-wide variation in the rice genome, a finding that has not been described in other domesticated species. Domesticated crop species are particularly suitable subjects in which to study the interaction between demographic events and selection in shaping species characteristics, and exploring the relative contributions of these forces require developing predictions for patterns of DNA polymorphism using models that allow selection to vary in timing (i.e., both during and after population bottlenecks) and strength. Nevertheless, our findings do underscore the possible role that selection may play in shaping genomic variation in domesticated species, reinforcing our appreciation of the foresight showed by Charles Darwin nearly a century-and-a-half ago [<a href="#pgen-0030163-b003">3</a>] when he sought to illustrate the power of selection by drawing on the lessons learned from the evolution of domesticated species.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section3" xpathLocation="/article[1]/body[1]/sec[3]"><a id="s3" name="s3" toc="s3" title="Materials and Methods"></a><h3 xpathLocation="noSelect">Materials and Methods <a href="#top">Top</a></h3> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[1]/title[1]">Samples.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[1]/p[1]">A panel of 72 <i>O</i>. <i>sativa</i> accessions was chosen to represent the diversity found within the species. These include representatives of five major subpopulations identified in a previous study [<a href="#pgen-0030163-b014">14</a>], including 21 <i>indica</i>, 18 <i>tropical japonica</i>, 21 <i>temperate japonica</i>, six <i>aus</i>, and six <i>aromatic</i> accessions (<a href="#pgen-0030163-st001">Table S1</a>). Most accessions are landraces, but five accessions studied correspond to modern cultivars. Also included in the panel were 21 accessions of the wild progenitor of rice, <i>O</i>. <i>rufipogon</i>, along with one sample each of <i>O</i>. <i>nivara</i> (a close relative of <i>O</i>. <i>rufipogon</i> not believed to have contributed to the ancestry of cultivated rice) and the outgroup species <i>O</i>. <i>barthii</i> and <i>O</i>. <i>meridionalis</i> (<a href="#pgen-0030163-st001">Table S1</a>).</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[1]/p[2]">DNA was extracted from single plants as described in [<a href="#pgen-0030163-b049">49</a>] with minor modifications. All <i>O</i>. <i>sativa</i> and one <i>O</i>. <i>rufipogon</i> accession (International Rice Germplasm Collection [<a href="http://www.irri.org/grc/">http://www.irri.org/grc/</a>] #105491) were self-fertilized for two generations prior to initiating the study. Seeds from <i>O</i>. <i>rufipogon</i> from Nepal were collected in the field by H. J. Koh and colleagues (Seoul National University); all other seeds were obtained from germplasm repositories as summarized in <a href="#pgen-0030163-st001">Table S1</a>.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/title[1]">PCR and DNA sequencing.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/p[1]">A total of 121 approximately 400–600 bp gene regions across the rice genome were chosen at random for sequencing from a set of 6,591 ESTs [<a href="#pgen-0030163-b050">50</a>]. Four fragments were also selected from genes coding for well-known allozymes, including: catalase, acid phosphatase, <i>pgi-a</i>, and <i>Adh</i>. Primers were designed from the Nipponbare genomic sequence available from Gramene using Primer3 [<a href="#pgen-0030163-b051">51</a>]. Primers were designed in exons, and attempts were made to include both exon and intron sequence within each fragment. DNA sequencing was carried out in Genaissance's sequencing facilities (New Haven, Connecticut, United States) as described in [<a href="#pgen-0030163-b052">52</a>]. Amplification and sequencing were successful for 111 fragments referred to as STS (<a href="#pgen-0030163-st002">Table S2</a>). Approximately 54 kbp per accession were sequenced, composed of, on average, 55% coding and 45% noncoding sequence.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/p[2]">Base-pair calls, quality score assignment, and construction of contigs were carried out using the Phred and Phrap programs (Codon Code). Sequence alignment and editing were carried out with BioLign Version 2.09.1 (Tom Hall, North Carolina State University, Raleigh, North Carolina, United States). Heterozygous sites were identified with Polyphred (Deborah Nickerson, University of Washington, Seattle, Washington, United States) and by visually inspecting chromatograms for double peaks. Heterozygous sites were rare for <i>O</i>. <i>sativa</i>. For heterozygous <i>O</i>. <i>sativa</i> and <i>O</i>. <i>rufipogon</i> sequences, heterozygous sites were labeled with ambiguity codes. For all analyses, the published sequence of Nipponbare was included.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/p[3]">To assess the sequencing error rate, 18 randomly chosen STS fragments were resequenced in a single direction for four <i>Oryza</i> accessions. Only three discordant base pairs within a single individual in a single fragment sequence were observed. This corresponds to three errors in 33,193 resequenced bp, or a sequencing error rate of less than 0.01%.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/title[1]">Diversity analyses.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/p[1]">Population structure among <i>O</i>. <i>sativa</i> and <i>O</i>. <i>rufipogon</i> accessions was evaluated with STRUCTURE 2.1 [<a href="#pgen-0030163-b019">19</a>] using an admixture model with no linkage. To limit the effect of correlation between SNPs due to linkage, one SNP per fragment (the SNP with the highest minor allele frequency across the entire accession set) was used in the analysis. <i>O</i>. <i>sativa</i> is primarily selfing, and most accessions exist as homozygotes; thus, SNP data were considered haploid for this species. <i>O</i>. <i>rufipogon</i> is partially outcrossing, a condition that cannot be adequately represented by considering each locus as diploid; thus, SNP data for <i>O</i>. <i>rufipogon</i> were also considered haploid. Because alternate alleles could occur at a given site in heterozygous <i>O</i>. <i>rufipogon</i> accessions, ten datasets were created with randomly chosen alternative base pairs in heterozygous individuals. Analyses were carried out for all ten datasets. All analyses had a burn-in length of 50,000 iterations and a run length of 100,000 iterations. Three replicates at each value of <i>K</i> (population number) were carried out. Simulations were run with uncorrelated allele frequencies. Results were entirely consistent among replicate runs within datasets and among datasets; the results from one run are presented in <a href="#pgen-0030163-g001">Figure 1</a> and <a href="#pgen-0030163-st001">Table S1</a>.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/p[2]">To assess relationships among <i>Oryza</i> accessions, all STS fragment alignments were concatenated to form a single dataset. Relationships were estimated with a neighbor-joining analysis as implemented in PAUP* version 4.0 b3 [<a href="#pgen-0030163-b053">53</a>]. Distances were calculated using the Kimura two-parameter model. Branch bootstrap estimates were obtained from 1,000 replicates.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/p[3]">Perl scripts were written to assess levels of nucleotide variation (θ<sub>W</sub>) and nucleotide diversity (θ<sub>π</sub>) and Tajima's D across rice groups for all STS fragments, and to calculate the frequency distributions of derived SNPs across the genome. For <i>O</i>. <i>sativa</i> accessions, where heterozygotes were rare, all measures were calculated considering each accession as contributing a single haplotype; for <i>O</i>. <i>rufipogon</i> population measures, each accession was considered to contribute two haplotypes, except for one accession (International Rice Germplasm Collection [<a href="http://www.irri.org/grc/">http://www.irri.org/grc/</a>] #105491) from Malaysia, which had been selfed for several generations prior to this study.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[4]/title[1]">Analysis of the neutral bottleneck model.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[4]/p[1]">Under a neutral bottleneck model, the history of rice domestication is represented by recent population divergence, with one of the two daughter populations experiencing a size bottleneck at divergence associated with the founder effect at the time of domestication. We use the sample frequencies of variable noncoding and synonymous nucleotides in the STS alignments (i.e., the site-frequency spectrum of putatively neutral SNPs) to infer the parameters of the bottleneck model. Our analytical approach makes use of standard Wright-Fisher population genetic theory within a Poisson random field setting [<a href="#pgen-0030163-b054">54</a>–<a href="#pgen-0030163-b057">57</a>]. The assumptions of this model include independence among SNPs, no selection, an underlying Poisson process governing mutations, and a piecewise constant population of large size amenable to modeling using diffusion approximations.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[4]/p[2]">The model we employ is an extension of Williamson et al. [<a href="#pgen-0030163-b058">58</a>], where we present the relevant population and statistical inference theory for modeling a population experiencing a recent size change. The key addition to our previous model is a second size change event, corresponding to the post-bottleneck growth phase. This amounts to modeling the components of the site-frequency spectrum (<i>X</i><sub>1</sub>, <i>X</i><sub>2</sub>, . . ., <i>X</i><sub>n</sub>) as independent Poisson random variables with mean: <br><a name="pgen-0030163-e002" id="pgen-0030163-e002"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e002&representation=PNG"></span><br>where θ is the genome-wide mutation rate, <i>x</i> represents the (unknown) population frequencies of mutations, and <i>f</i>(<i>x</i>;Θ) is the distribution of mutation frequencies given demographic history parameters Θ = {ν,τ<sub>1</sub>,τ<sub>2</sub>}. These parameters are: the time back until the start of domestication (τ<sub>1</sub>), duration of the bottleneck (τ<sub>2</sub>), ratio of current population to ancestral population size (ν<sub>2</sub>), and relative size of the bottleneck population to the ancestral population (ν<sub>b</sub>). The duration of the bottleneck was assumed to be 25% of the time back until domestication (τ<sub>2</sub> = 0.25 × τ<sub>1</sub>), which is consistent with archaeological data suggesting domestication took 3,000 y and began 12,000 y ago. The mutation rate, θ, was estimated from the number of synonymous and noncoding segregating SNPs assuming <i>O</i>. <i>rufipogon</i> represented a population of constant size. This assumption is quite reasonable given the excellent concordance between the <i>O</i>. <i>rufipogon</i> and the predictions of the standard neutral model (<a href="#pgen-0030163-g004">Figure 4</a>), and is equivalent to using Watterson's (1975) estimator of θ. In order to account for missing data, we fitted the population bottleneck model using the projected site-frequency spectrum for a sample of <i>n</i> = 16 chromosomes. </p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[5]/title[1]">Alternative demographic scenarios for rice domestication.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[5]/p[1]">We considered alternative demographic scenarios, in which ancestral population subdivision, followed by gene flow between <i>rufipogon</i>, <i>indica</i>, and <i>tropical japonica</i>, led to an excess of high-frequency derived alleles in domesticated rice groups, as well as a simpler model that has no ancestral substructure. For these models, the composite likelihood function was based on the marginal site-frequency spectrum of each of the three groups analyzed. For ease of notation, let <i>S<sub>ind</sub></i>, <i>S<sub>jap</sub></i>, and <i>S<sub>ruf</sub></i> be the number of SNPs for which we could distinguish ancestral from derived alleles using the outgroup (223, 172, and 636, respectively). Let <b><i>y</i></b> denote the set of derived allele counts for each SNP, with <b><i>y</i></b><i><sub>•</sub></i><sup>ind</sup>, <b><i>y</i></b><i><sub>•</sub></i><i><sup>jap</sup></i>, and <b><i>y</i></b><i><sub>•</sub><sup>ruf</sup></i> referring to set of SNPs for <i>indica</i>, <i>tropical japonica</i>, and <i>O</i>. <i>rufipogon</i> (with lengths <i>S<sub>ind</sub></i>, <i>S<sub>jap</sub></i>, and <i>S<sub>ruf</sub></i> , respectively). To account for missing data, let <b><i>n</i></b> refer to the number of chromosomes sequenced at each SNP, with <b><i>n</i></b><sub>•</sub><i><sup>ind</sup></i>, <b><i>n</i></b><sub>•</sub><i><sup>jap</sup></i>, and <b><i>n</i></b><sub>•</sub><i><sup>ruf</sup></i> the vector for each group (again with lengths <i>S<sub>ind</sub></i>, <i>S<sub>jap</sub></i>, and <i>S<sub>ruf</sub></i>, respectively). For a given demographic model discussed above (the parameters of which we collectively denote Θ), the composite likelihood function is written as <br><a name="pgen-0030163-e003" id="pgen-0030163-e003"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e003&representation=PNG"></span><br>where Pr(<i>S</i><sub>•</sub>|Θ) is assumed to follow a Poisson probability of observing <i>S</i><sub>•</sub> SNPs in a given population under the demographic model Θ assuming the population scaled mutation rate θ = 148.6 (estimated using the observed number of SNPs in <i>O</i>. <i>rufipogon</i>), and <span class="capture-id" id="pgen-0030163-ex001"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.ex001&representation=PNG" border="0"></span> is the probability of observing a SNP configuration in a given population under the demographic model. It is important to note that the inference scheme assumes the allele frequency distributions, conditional on the observed number of segregating sites and demographic parameters, are independent among populations. This composite-likelihood function (like all composite-likelihood functions) must, therefore, be taken as an approximation of the true likelihood function since it ignores dependencies among SNPs due to linkage and among populations due to shared variation. To account for missing data at an arbitrary SNP <i>k</i> in population <i>x</i>, we set <br><a name="pgen-0030163-e004" id="pgen-0030163-e004"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e004&representation=PNG"></span><br>where <i>P<sub>z</sub></i>(Θ,<i>N<sub>x</sub></i>) is the expected proportion of SNPs at a frequency <i>z</i> in a sample of <i>N</i><sub>x</sub> chromosomes under the demographic model Θ, and the fraction within the summation represents the hypergeometric probability of sampling <span class="capture-id" id="pgen-0030163-ex002"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.ex002&representation=PNG" border="0"></span> derived alleles in a subsample of <span class="capture-id" id="pgen-0030163-ex003"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.ex003&representation=PNG" border="0"></span> chromosomes if the unknown frequency of the SNP were <i>j</i> out of <i>N<sub>x</sub></i> (summed over all possible underlying SNP frequencies, <i>j</i>). Details on calculating the expected number of SNPs in each population as well as <i>P<sub>z</sub></i>(Θ,<i>N<sub>x</sub></i>) are described below. </p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/title[1]">Optimizing complex neutral demographic models.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/p[1]">For a given set of parameters, Θ, we determine the expected site-frequency spectra for all three populations (<i>O</i>. <i>rufipogon</i>, <i>indica</i>, and <i>tropical japonica</i>) using 100,000 iterations of the coalescent simulation program <i>ms</i> [<a href="#pgen-0030163-b001">1</a>] conditional on the observed genome-wide estimate of θ for <i>O</i>. <i>rufipogon</i>. To generate data under this model, we used the following code:</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/p[2]"> ms 80 200000 –t 148.6487 –r 148.6487 111 –I 3 21 18 41 <i>M</i> –en 0.5*0.75*τ<sub>1</sub> 1 ν<sub>B</sub> –en 0.5*0.75*τ<sub>1</sub>2ν<sub>B</sub> –ej 0.5*τ<sub>1</sub> 1 3 –ej τ<sub>1</sub> 2 3 −em 0.5*τ<sub>1</sub> 3 1 0 –em 0.5*τ<sub>1</sub> 3 2 0 −n 1 ν<sub>I</sub> –n 2 ν<sub>J</sub>.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/p[3]">Note that the factor 0.75 enters from the assumption that the bottleneck lasted 3,000 y of the 12,000 y time since domestication began, and 0.5 enters since <i>ms</i> scales time in units of 4<i>N</i> generations.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/p[4]">To optimize the three- and five-dimensional likelihood surface, we used an iterative technique, whereby a very coarse grid is initially chosen for each parameter, followed by successively tighter intervals containing the previous iteration's maximum likelihood estimates. Because we were pooling data across 111 STS loci, we generated our expected site-frequency spectrum accordingly. Although recombination within or between STS loci will not affect the expected number of segregating sites or the expected site frequency spectrum under a neutral demographic model, it does impact the rate at which simulations will approach them. We therefore assumed 111 mostly independent loci of equal size when generating our expectations.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[7]/title[1]">Modified Pearson Goodness-of-Fit test.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[7]/p[1]">In order to compare the fit of the demographic model to the observed data accounting for missing genotypes and partial selfing, we considered a projection of the observed and predicted site-frequency spectra into a sample of size <i>n</i> = 16 chromosomes from each of the three populations using the hypergeometric distribution. The “observed” data can be thought of as the predicted SFS in a subsample of <i>n</i> =16 based on the actual SNP data assuming each of the <i>O</i>. <i>sativa</i> accessions contributes one chromosome to the observed allele frequency spectrum, and each of the <i>O</i>. <i>rufipogon</i> accessions contributes two, with the exception of one accession that was known to have been purified. The “expected” data are the predicted marginal site-frequency spectrum at the maximum composite-likelihood estimates of the parameters from the complex demographic model that includes bottlenecks in the two domesticated populations, migration within domesticated populations, and migration between domesticated and ancestral populations. There were 45 observed data points (15 segregating site-frequency spectrum components multiplied by three populations), and the GOF statistic for a given population was tabulated as <span class="capture-id" id="pgen-0030163-ex004"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.ex004&representation=PNG" border="0"></span> . In order to assign a <i>p</i>-value, we simulated 10,000 datasets each containing 111 independent loci with no recombination within loci under the best-fitting demographic model. For each dataset, we then calculated the GOF test statistic using the expected site-frequency spectrum from <a href="#pgen-0030163-g004">Figure 4</a> scaled to the observed number of segregating sites within each of the subpopulations. Ideally, one would re-estimate the demographic parameters in order to fully mimic the inference procedure we used. Unfortunately, estimation of the demographic parameters was extremely computationally intensive for each dataset; the single observed STS data point analyzed here, for example, took over a week of computer time on a dedicated 100-node computing cluster. </p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[8]/title[1]">Composite likelihood under multiple sweeps models.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[8]/p[1]">Conditioning on the observed number of segregating sites in the dataset, the site-frequency spectrum is multinomially distributed with frequency probabilities according to <a href="#pgen-0030163-e001">Equation 1</a>. For the pure selection model, the composite likelihood is: <br><a name="pgen-0030163-e005" id="pgen-0030163-e005"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e005&representation=PNG"></span><br>where <b>q</b><sub>r</sub> is the normalized site-frequency spectrum of <i>O</i>. <i>rufipogon</i>. For the bottleneck plus multiple sweeps model, the composite likelihood is: <br><a name="pgen-0030163-e006" id="pgen-0030163-e006"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e006&representation=PNG"></span><br>where <b>q</b><sub>ν</sub> is the predicted spectrum from a neutral bottleneck model with severity ν. <a href="#pgen-0030163-e005">Equations 5 </a>and <a href="#pgen-0030163-e006">6</a> can be maximized to quantify the number and strength of selective sweeps in domestic rice, and the optimization of <a href="#pgen-0030163-e005">Equation 5</a> provides an estimate of the severity of the population bottleneck that preceded the selective sweep. </p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[9]/title[1]">The background site-frequency spectrum for the bottleneck plus multiple sweeps model.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[9]/p[1]">The bottleneck plus sweeps model assumes that a short bottleneck (representing to the founding of domestic populations) precedes the selective sweeps. To calculate the background site-frequency spectrum at the end of the bottleneck and the beginning of the selective sweeps, we again used numerical methods to solve the one-population diffusion equation with population size changes: <br><a name="pgen-0030163-e007" id="pgen-0030163-e007"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e007&representation=PNG"></span><br> </p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[9]/p[2]">In this case, the recovery time, τ<sub>1</sub>, was set to 0, corresponding to the assumption that new mutations since the bottleneck do not make a strong contribution to the observed SFS. Because the bottleneck duration, τ<sub>2</sub>, and the severity, ν, are confounded parameters, we set τ<sub>2</sub> = 0.01 and allow ν to vary. With <i>f</i>(<i>q</i>,τ<sub>2</sub>) as the numerical solution to <a href="#pgen-0030163-e007">Equation 7</a> evaluated at time τ<sub>2</sub>, we calculate the background site-frequency spectrum <i>q</i>ν as: <br><a name="pgen-0030163-e008" id="pgen-0030163-e008"></a><span class="equation"><img src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0030163.e008&representation=PNG"></span><br> </p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[10]/title[1]">AIC as a test statistic for comparing non-nested models.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[10]/p[1]">To properly interpret differences in AIC between models, we simulated 10,000 datasets of 111 nonrecombining loci under the null hypothesis of the best-fitting neutral bottleneck model using the ms coalescent simulation program [<a href="#pgen-0030163-b059">59</a>]. Because we did not allow recombination within loci, these simulations conservatively account for the effects of linkage. For each simulated dataset, we found the maximum composite likelihoods under each model (bottleneck, bottleneck plus migration, multiple sweeps, and bottleneck plus sweeps) and calculated the AIC value. The AIC statistic of model <i>i</i> is defined as: AIC<i><sub>i</sub></i> = −2(lmax<i><sub>i</sub></i> − k<i><sub>i</sub></i>) where lmax<i><sub>i</sub></i> is the maximum likelihood under model <i>i</i> and k<i><sub>i</sub></i> is the number of free parameters in model <i>i</i>. We used Λ = AIC<i><sub>1</sub></i> − AIC<i><sub>2</sub></i> as a test statistic for comparing the bottleneck and alternative models using a one-tailed test: the <i>p</i>-value was estimated as the proportion of simulations under the null distribution with Λ > Λ<i><sub>obs</sub></i>.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section4" xpathLocation="/article[1]/body[1]/sec[4]"><a id="s4" name="s4" toc="s4" title="Supporting Information"></a><h3 xpathLocation="noSelect">Supporting Information <a href="#top">Top</a></h3><a name="pgen-0030163-sg001" id="pgen-0030163-sg001"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.sg001">Figure S1. </a>Clustering of <i>Oryza</i> Accessions Based on Neighbor-Joining Analysis of Concatenated STS Sequences</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[1]/caption[1]/p[1]">Numbers by branches are bootstraps of 1,000 replicates. Only branches with a bootstrap value higher than 60% for major clades (five or more accessions) are labeled. The monophyly of each rice variety group is well supported, with the exception of <i>tropical japonica</i>. The accession M202-new is an elite <i>temperate japonica</i> line that has been subjected to possible crosses with other groups, perhaps explaining its inclusion within the <i>tropical japonica</i>.</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[1]/caption[1]/p[2]">(409 KB EPS)</p> <a name="pgen-0030163-sg002" id="pgen-0030163-sg002"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.sg002">Figure S2. </a>Frequency Distribution of Tajima's D Values for All STS Sampled in (A) <i>indica</i>, (B) <i>tropical japonica</i>, and (C) <i>O</i>. <i>rufipogon</i></strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[2]/caption[1]/p[1]">(373 KB EPS)</p> <a name="pgen-0030163-sg003" id="pgen-0030163-sg003"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.sg003">Figure S3. </a>The Distribution of Allele Frequency in <i>O</i>. <i>rufipogon</i> for Derived Alleles That Are at High Frequency in <i>indica</i> or <i>tropical japonica</i></strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[3]/caption[1]/p[1]">Notably, most alleles are at low to intermediate frequency in <i>O</i>. <i>rufipogon</i>, consistent with multiple selective sweeps in <i>O</i>. <i>sativa</i>, and discounting the possibility of misidentification of ancestral alleles or interspecific introgression being responsible for the pattern observed in rice. HFD, high-frequency derived SNPs.</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[3]/caption[1]/p[2]">(390 KB EPS)</p> <a name="pgen-0030163-sg004" id="pgen-0030163-sg004"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.sg004">Figure S4. </a>The Genomic Distribution of STS Fragments Contributing High-Frequency Derived SNPs in <i>indica</i> and <i>tropical japonica</i></strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[4]/caption[1]/p[1]">In each group, high-frequency derived SNPs occur in ten of 12 rice chromosomes. Fragments containing high-frequency derived SNPs comprise a large portion of fragments containing any variation at all in each <i>O</i>. <i>sativa</i> group. In both rice groups, the sample of STS fragments used to construct the site-frequency spectrum is slightly lower than 111 due to missing data in <i>O</i>. <i>meridionalis</i>.</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[4]/caption[1]/p[2]">(433 KB EPS)</p> <a name="pgen-0030163-st001" id="pgen-0030163-st001"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.st001">Table S1. </a><i>Oryza</i> Accessions Used in the Study and Inferred Ancestry Coefficients</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[5]/caption[1]/p[1]">(45 KB XLS)</p> <a name="pgen-0030163-st002" id="pgen-0030163-st002"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0030163.st002">Table S2. </a>STS Fragment Information and Silent Sites Diversity Measures for the Various <i>Oryza</i> Groups</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[6]/caption[1]/p[1]">(133 KB XLS)</p> <h4 xpathLocation="/article[1]/body[1]/sec[4]/sec[1]/title[1]">Accession Numbers</h4> <p xpathLocation="/article[1]/body[1]/sec[4]/sec[1]/p[1]">The National Center for Biotechnology Information GenBank (<a href="http://www.ncbi.nlm.nih.gov/Genbank">http://www.ncbi.nlm.nih.gov/Genbank</a>) ID numbers for the sequences and alignments discussed in this article are EF000002–EF010509.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" xpathLocation="noSelect"><a id="ack" name="ack" toc="ack" title="Acknowledgments"></a><h3 xpathLocation="noSelect">Acknowledgments <a href="#top">Top</a></h3> <p xpathLocation="/article[1]/back[1]/ack[1]/p[1]">We are grateful to two anonymous reviewers for suggestions that much improved the manuscript.</p> </div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="contributions"><a id="authcontrib" name="authcontrib" toc="authcontrib" title="Author Contributions"></a><h3 xpathLocation="noSelect">Author Contributions <a href="#top">Top</a></h3><p xpathLocation="noSelect"><span class="capture-id">RN, SRM, CDB, and MDP conceived the experiments. ALC, KMO, and MDP designed the experiments. ALC collected the data. ALC, SHW, RDH, and CDB analyzed the data. AB, AFA, NRP, TLY, and SRM and contributed materials/analysis tools. ALC, SHW, CDB, and MDP wrote the paper.</span></p></div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" xpathLocation="noSelect"><a id="references" name="references" toc="references" title="References"></a><h3 xpathLocation="noSelect">References <a href="#top">Top</a></h3><ol class="references" xpathLocation="noSelect"><li xpathLocation="noSelect"><a name="pgen-0030163-b001" id="pgen-0030163-b001"></a><span class="authors">Hancock JF</span> (2004) Plant evolution and the origin of crop species. Cambridge (Massachusetts): CABI Publishing. 313 p. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b002" id="pgen-0030163-b002"></a><span class="authors">Mannion AM</span> (1999) Domestication and the origins of agriculture: an appraisal. Prog Phys Geogr 23: 37–56. <a class="find" href="/article/findArticle.action?author=Mannion&title=Domestication and the origins of agriculture: an appraisal."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b003" id="pgen-0030163-b003"></a><span class="authors">Darwin C</span> (1859) On the origin of species. London: John Murray. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b004" id="pgen-0030163-b004"></a><span class="authors">Darwin C</span> (1868) The variation of animals and plants under domestication. London: John Murray. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b005" id="pgen-0030163-b005"></a><span class="authors">Armelagos GJ, Harper KN</span> (2005) Genomics at the origins of agriculture, part one. Evol Anthropol 14: 68–77. <a class="find" href="/article/findArticle.action?author=Armelagos&title=Genomics at the origins of agriculture, part one."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b006" id="pgen-0030163-b006"></a><span class="authors">Diamond J</span> (2002) Evolution, consequences and future of plant and animal domestication. Nature 418: 700–707. <a class="find" href="/article/findArticle.action?author=Diamond&title=Evolution, consequences and future of plant and animal domestication."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b007" id="pgen-0030163-b007"></a><span class="authors">FAO</span> (2005) FAOSTAT data, in last update 2003. <a href="http://faostat.fao.org/site/346/DesktopDefault.aspx?PageID=346">http://faostat.fao.org/site/346/DesktopD​efault.aspx?PageID=346 </a>. Accessed 1 August 2007. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b008" id="pgen-0030163-b008"></a><span class="authors">Yu J, Hu SN, Wang J, Wong GKS, Li SG, et al. </span> (2002) A draft sequence of the rice genome (Oryza sativa L. ssp indica). Science 296: 79–92. <a class="find" href="/article/findArticle.action?author=Yu&title=A draft sequence of the rice genome (Oryza sativa L. ssp indica)."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b009" id="pgen-0030163-b009"></a><span class="authors">Goff SA, Ricke D, Lan TH, Presting G, Wang RL, et al. </span> (2002) A draft sequence of the rice genome (Oryza sativa L. ssp japonica). Science 296: 92–100. <a class="find" href="/article/findArticle.action?author=Goff&title=A draft sequence of the rice genome (Oryza sativa L. ssp japonica)."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b010" id="pgen-0030163-b010"></a><span class="authors">IRGSP</span> (2005) The map-based sequence of the rice genome. Nature 436: 793–800. <a class="find" href="/article/findArticle.action?author=&title=The map-based sequence of the rice genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b011" id="pgen-0030163-b011"></a><span class="authors">Takahashi N, Hamamura K, Tsunoda S, Sakamoto S, Sato Y</span> (1997) Differentiation of ecotypes in cultivated rice. In: Matsuo T, Futsuhara Y, Kikuchi F, Yamaguchi H, editors. Science of the rice plant. Tokyo: Food and agriculture policy research center. pp. 112–160. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b012" id="pgen-0030163-b012"></a><span class="authors">Jackson MT</span> (1997) Conservation of rice genetic resources: the role of the International Rice Genebank at IRRI. Plant Mol Biol 35: 61–67. <a class="find" href="/article/findArticle.action?author=Jackson&title=Conservation of rice genetic resources: the role of the International Rice Genebank at IRRI."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b013" id="pgen-0030163-b013"></a><span class="authors">Oka HI</span> (1988) Origin of cultivated rice. Tokyo: Japan Scientific Societies Press and Elsevier Science Publishers. 254 p. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b014" id="pgen-0030163-b014"></a><span class="authors">Garris AJ, Tai TH, Coburn J, Kresovich S, McCouch S</span> (2005) Genetic structure and diversity in Oryza sativa L. Genetics 169: 1631–1638. <a class="find" href="/article/findArticle.action?author=Garris&title=Genetic structure and diversity in Oryza sativa L."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b015" id="pgen-0030163-b015"></a><span class="authors">Glaszmann JC</span> (1987) Isozymes and classification of Asian rice varieties. Theor Appl Genet 74: 21–30. <a class="find" href="/article/findArticle.action?author=Glaszmann&title=Isozymes and classification of Asian rice varieties."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b016" id="pgen-0030163-b016"></a><span class="authors">Tenaillon MI, Sawkins MC, Anderson LK, Stack SM, Doebley J, et al. </span> (2002) Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp mays L.). Genetics 162: 1401–1413. <a class="find" href="/article/findArticle.action?author=Tenaillon&title=Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp mays L.)."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b017" id="pgen-0030163-b017"></a><span class="authors">Schmid KJ, Ramos-Onsins S, Ringys-Beckstein H, Weisshaar B, Mitchell-Olds T</span> (2005) A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism. Genetics 169: 1601–1615. <a class="find" href="/article/findArticle.action?author=Schmid&title=A multilocus sequence survey in Arabidopsis thaliana reveals a genome-wide departure from a neutral model of DNA sequence polymorphism."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b018" id="pgen-0030163-b018"></a><span class="authors">Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, et al. </span> (2005) The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol 3: 1289–1299. <a class="find" href="/article/findArticle.action?author=Nordborg&title=The pattern of polymorphism in Arabidopsis thaliana."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b019" id="pgen-0030163-b019"></a><span class="authors">Pritchard JK, Stephens M, Donnelly P</span> (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945–959. <a class="find" href="/article/findArticle.action?author=Pritchard&title=Inference of population structure using multilocus genotype data."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b020" id="pgen-0030163-b020"></a><span class="authors">Cheng CY, Motohashi R, Tsuchimoto S, Fukuta Y, Ohtsubo H, et al. </span> (2003) Polyphyletic origin of cultivated rice: based on the interspersion pattern of SINEs. Mol Biol Evol 20: 67–75. <a class="find" href="/article/findArticle.action?author=Cheng&title=Polyphyletic origin of cultivated rice: based on the interspersion pattern of SINEs."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b021" id="pgen-0030163-b021"></a><span class="authors">Zhu QH, Ge S</span> (2005) Phylogenetic relationships among A-genome species of the genus <i>Oryza</i> revealed by intron sequences of four nuclear genes. New Phytol 167: 249–265. <a class="find" href="/article/findArticle.action?author=Zhu&title=Phylogenetic relationships among A-genome species of the genus Oryza revealed by intron sequences of four nuclear genes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b022" id="pgen-0030163-b022"></a><span class="authors">Vitte C, Ishii T, Lamy F, Brar D, Panaud O</span> (2004) Genomic paleontology provides evidence for two distinct origins of Asian rice (Oryza sativa L.). Mol Genet Genomics 272: 504–511. <a class="find" href="/article/findArticle.action?author=Vitte&title=Genomic paleontology provides evidence for two distinct origins of Asian rice (Oryza sativa L.)."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b023" id="pgen-0030163-b023"></a><span class="authors">Londo JP, Chiang YC, Hung KH, Chiang TY, Schaal BA</span> (2006) Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa. Proc Natl Acad Sci U S A 103: 9578–9583. <a class="find" href="/article/findArticle.action?author=Londo&title=Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b024" id="pgen-0030163-b024"></a><span class="authors">Second G</span> (1982) Origin of the genic diversity of cultivated rice (Oryza-Spp)—study of the polymorphism scored at 40 isoenzyme loci. Jpn J Genet 57: 25–57. <a class="find" href="/article/findArticle.action?author=Second&title=Origin of the genic diversity of cultivated rice (Oryza-Spp)%E2%80%94study of the polymorphism scored at 40 isoenzyme loci."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b025" id="pgen-0030163-b025"></a><span class="authors">Tajima F</span> (1989) Statistical-method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123: 585–595. <a class="find" href="/article/findArticle.action?author=Tajima&title=Statistical-method for testing the neutral mutation hypothesis by DNA polymorphism."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b026" id="pgen-0030163-b026"></a><span class="authors">Fay JC, Wu CI</span> (2000) Hitchhiking under positive Darwinian selection. Genetics 155: 1405–1413. <a class="find" href="/article/findArticle.action?author=Fay&title=Hitchhiking under positive Darwinian selection."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b027" id="pgen-0030163-b027"></a><span class="authors">Wright SI</span> (2005) The effects of artificial selection on the maize genome. Science 310: 54–54. <a class="find" href="/article/findArticle.action?author=Wright&title=The effects of artificial selection on the maize genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b028" id="pgen-0030163-b028"></a><span class="authors">Tenaillon MI, U'Ren J, Tenaillon O, Gaut BS</span> (2004) Selection versus demography: a multilocus investigation of the domestication process in maize. Mol Biol Evol 21: 1214–1225. <a class="find" href="/article/findArticle.action?author=Tenaillon&title=Selection versus demography: a multilocus investigation of the domestication process in maize."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b029" id="pgen-0030163-b029"></a><span class="authors">Eyre-Walker A, Gaut RL, Hilton H, Feldman DL, Gaut BS</span> (1998) Investigation of the bottleneck leading to the domestication of maize. Proc Natl Acad Sci U S A 95: 4441–4446. <a class="find" href="/article/findArticle.action?author=Eyre-Walker&title=Investigation of the bottleneck leading to the domestication of maize."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b030" id="pgen-0030163-b030"></a><span class="authors">Takahata N</span> (1991) Genealogy of neutral genes and spreading of selected mutations in a geographically structured population. Genetics 129: 585–595. <a class="find" href="/article/findArticle.action?author=Takahata&title=Genealogy of neutral genes and spreading of selected mutations in a geographically structured population."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b031" id="pgen-0030163-b031"></a><span class="authors">Tachida H, Iizuka M</span> (1991) Fixation probability in spatially changing environments. Genet Res 58: 243–251. <a class="find" href="/article/findArticle.action?author=Tachida&title=Fixation probability in spatially changing environments."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b032" id="pgen-0030163-b032"></a><span class="authors">Maruyama T</span> (1970) On fixation probability of mutant genes in a subdivided population. Genet Res 15: 221. &. <a class="find" href="/article/findArticle.action?author=Maruyama&title=On fixation probability of mutant genes in a subdivided population."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b033" id="pgen-0030163-b033"></a><span class="authors">Lu HY, Liu ZX, Wu NQ, Berne S, Saito Y, et al. </span> (2002) Rice domestication and climatic change: phytolith evidence from East China. Boreas 31: 378–385. <a class="find" href="/article/findArticle.action?author=Lu&title=Rice domestication and climatic change: phytolith evidence from East China."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b034" id="pgen-0030163-b034"></a><span class="authors">Zhao ZJ</span> (1998) The middle Yangtze region in China is one place where rice was domesticated: phytolith evidence from the Diaotonghuan cave, northern Jiangxi. Antiquity 72: 885–897. <a class="find" href="/article/findArticle.action?author=Zhao&title=The middle Yangtze region in China is one place where rice was domesticated: phytolith evidence from the Diaotonghuan cave, northern Jiangxi."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b035" id="pgen-0030163-b035"></a><span class="authors">Normile D</span> (1997) Archaeology—Yangtze seen as earliest rice site. Science 275: 309–309. <a class="find" href="/article/findArticle.action?author=Normile&title=Archaeology%E2%80%94Yangtze seen as earliest rice site."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b036" id="pgen-0030163-b036"></a><span class="authors">Wakeley J, Hey J</span> (1997) Estimating ancestral population parameters. Genetics 145: 847–855. <a class="find" href="/article/findArticle.action?author=Wakeley&title=Estimating ancestral population parameters."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b037" id="pgen-0030163-b037"></a><span class="authors">Kim Y, Stephan W</span> (2003) Selective sweeps in the presence of interference among partially linked loci. Genetics 164: 389–398. <a class="find" href="/article/findArticle.action?author=Kim&title=Selective sweeps in the presence of interference among partially linked loci."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b038" id="pgen-0030163-b038"></a><span class="authors">Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, et al. </span> (2005) Genomic scans for selective sweeps using SNP data. Genome Res 15: 1566–1575. <a class="find" href="/article/findArticle.action?author=Nielsen&title=Genomic scans for selective sweeps using SNP data."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b039" id="pgen-0030163-b039"></a><span class="authors">Durrett R, Schweinsberg J</span> (2004) Approximating selective sweeps. Theor Popul Biol 66: 129–138. <a class="find" href="/article/findArticle.action?author=Durrett&title=Approximating selective sweeps."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b040" id="pgen-0030163-b040"></a><span class="authors">Nordborg M</span> (2000) Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. Genetics 154: 923–929. <a class="find" href="/article/findArticle.action?author=Nordborg&title=Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b041" id="pgen-0030163-b041"></a><span class="authors">Burnham KP, Anderson DR</span> (1998) Model selection and inference: a practical information-theoretic approach. New York: Springer-Verlag. 353 p. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b042" id="pgen-0030163-b042"></a><span class="authors">Vigouroux Y, Mitchell S, Matsuoka Y, Hamblin M, Kresovich S, et al. </span> (2005) An analysis of genetic diversity across the maize genome using microsatellites. Genetics 169: 1617–1630. <a class="find" href="/article/findArticle.action?author=Vigouroux&title=An analysis of genetic diversity across the maize genome using microsatellites."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b043" id="pgen-0030163-b043"></a><span class="authors">Thuillet AC, Bataillon T, Poirier S, Santoni S, David JL</span> (2005) Estimation of long-term effective population sizes through the history of durum wheat using microsatellite data. Genetics 169: 1589–1599. <a class="find" href="/article/findArticle.action?author=Thuillet&title=Estimation of long-term effective population sizes through the history of durum wheat using microsatellite data."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b044" id="pgen-0030163-b044"></a><span class="authors">Muller MH, Poncet C, Prosperi JM, Santoni S, Ronfort J</span> (2006) Domestication history in the Medicago sativa species complex: inferences from nuclear sequence polymorphism. Mol Ecol 15: 1589–1602. <a class="find" href="/article/findArticle.action?author=Muller&title=Domestication history in the Medicago sativa species complex: inferences from nuclear sequence polymorphism."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b045" id="pgen-0030163-b045"></a><span class="authors">Li C, Zhou A, Sang T</span> (2006) Rice domestication by reducing shattering. Science 311: 1936–1939. <a class="find" href="/article/findArticle.action?author=Li&title=Rice domestication by reducing shattering."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b046" id="pgen-0030163-b046"></a><span class="authors">Sweeney MT, Thomson MJ, Cho Y, Park YJ, Williamson SH</span> (2007) Global dissemination of a single mutation conferring white pericarp in rice. PLoS Genet 3: e133 doi:<a href="http://dx.doi.org/10.1371/journal.pgen.0030133">10.1371/journal.pgen.0030133</a>. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b047" id="pgen-0030163-b047"></a><span class="authors">Maynard Smith J, Haigh J</span> (2004) The hitch-hiking effect of a favourable gene. Genet Res 23: 23–35. <a class="find" href="/article/findArticle.action?author=Maynard Smith&title=The hitch-hiking effect of a favourable gene."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b048" id="pgen-0030163-b048"></a><span class="authors">Kaplan NL, Hudson RR, Langley CH</span> (1989) The hitchhiking effect revisited. Genetics 123: 887–899. <a class="find" href="/article/findArticle.action?author=Kaplan&title=The hitchhiking effect revisited."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b049" id="pgen-0030163-b049"></a><span class="authors">McCouch SR, Kochert G, Yu ZH, Wang ZY, Khush GS, et al. </span> (1988) Molecular mapping of rice chromosomes. Theor Appl Genet 76: 815–829. <a class="find" href="/article/findArticle.action?author=McCouch&title=Molecular mapping of rice chromosomes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b050" id="pgen-0030163-b050"></a><span class="authors">Wu JZ, Maehara T, Shimokawa T, Yamamoto S, Harada C, et al. </span> (2002) A comprehensive rice transcript map containing 6591 expressed sequence tag sites. Plant Cell 14: 525–535. <a class="find" href="/article/findArticle.action?author=Wu&title=A comprehensive rice transcript map containing 6591 expressed sequence tag sites."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b051" id="pgen-0030163-b051"></a><span class="authors">Rozen S, Skaletsky HJ</span> (2000) Bioinformatics methods and protocols: methods in molecular biology. Krawetz S, Misener S, editors. Totowa (New Jersey): Humana Press. pp. 365–386. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b052" id="pgen-0030163-b052"></a><span class="authors">Olsen KM, Caicedo AL, Polato N, McClung A, McCouch S, et al. </span> (2006) Selection under domestication: evidence for a sweep in the rice Waxy genomic region. Genetics 173: 975–983. <a class="find" href="/article/findArticle.action?author=Olsen&title=Selection under domestication: evidence for a sweep in the rice Waxy genomic region."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b053" id="pgen-0030163-b053"></a><span class="authors">Swofford DL</span> (2000) PAUP* Phylogenetic analysis using parsimony (* and other methods). Sunderland, Massachusetts: Sinauer Associates. </li><li xpathLocation="noSelect"><a name="pgen-0030163-b054" id="pgen-0030163-b054"></a><span class="authors">Sawyer SA, Hartl DL</span> (1992) Population-genetics of polymorphism and divergence. Genetics 132: 1161–1176. <a class="find" href="/article/findArticle.action?author=Sawyer&title=Population-genetics of polymorphism and divergence."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b055" id="pgen-0030163-b055"></a><span class="authors">Bustamante CD, Wakeley J, Sawyer S, Hartl DL</span> (2001) Directional selection and the site-frequency spectrum. Genetics 159: 1779–1788. <a class="find" href="/article/findArticle.action?author=Bustamante&title=Directional selection and the site-frequency spectrum."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b056" id="pgen-0030163-b056"></a><span class="authors">Bustamante CD, Nielsen R, Hartl DL</span> (2003) Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data. Theor Popul Biol 63: 91–103. <a class="find" href="/article/findArticle.action?author=Bustamante&title=Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b057" id="pgen-0030163-b057"></a><span class="authors">Williamson S, Fledel-Alon A, Bustamante CD</span> (2004) Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance. Genetics 168: 463–475. <a class="find" href="/article/findArticle.action?author=Williamson&title=Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b058" id="pgen-0030163-b058"></a><span class="authors">Williamson S, Perry SM, Bustamante CD, Orive ME, Stearns MN, et al. </span> (2005) A statistical characterization of consistent patterns of human immunodeficiency virus evolution within infected patients. Mol Biol Evol 22: 456–468. <a class="find" href="/article/findArticle.action?author=Williamson&title=A statistical characterization of consistent patterns of human immunodeficiency virus evolution within infected patients."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0030163-b059" id="pgen-0030163-b059"></a><span class="authors">Hudson RR</span> (2002) Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 18: 337–338. <a class="find" href="/article/findArticle.action?author=Hudson&title=Generating samples under a Wright-Fisher neutral model of genetic variation."> Find this article online </a></li></ol></div> </div> </div> <div style="display:none"> <div dojoType="ambra.widget.RegionalDialog" id="AnnotationDialog" style="padding:0;margin:0;"> <div class="dialog annotate"> <div class="tipu" id="dTipu"></div> <div class="comment"> <h5><span class="commentPrivate">Add Your Note (For Private Viewing)</span><span class="commentPublic">Post Your Note (For Public Viewing)</span></h5> <div class="posting pane"> <form name="createAnnotation" id="createAnnotation" method="post" action=""> <input type="hidden" name="target" value="info:doi/10.1371/journal.pgen.0030163" /> <input type="hidden" name="startPath" value="" /> <input type="hidden" name="startOffset" value="" /> <input type="hidden" name="endPath" value="" /> <input type="hidden" name="endOffset" value="" /> <input type="hidden" name="commentTitle" id="commentTitle" value="" /> <input type="hidden" name="comment" id="commentArea" value="" /> <input type="hidden" name="ciStatement" id="statementArea" value="" /> <input type="hidden" name="isCompetingInterest" id="isCompetingInterest" value="false" /> <input type="hidden" name="noteType" id="noteType" value="" /> <fieldset> <legend>Compose Your Note</legend> <span id="submitMsg" class="error" style="display:none;"></span> <table class="layout"> <tr> <td> <label for="cNoteType">This is a </label><select name="cNoteType" id="cNoteType"><option value="note">note</option><option value="correction">correction</option></select> <span id="cdls" style="visibility:hidden;margin-left:0.3em; white-space:nowrap;"><a href="/static/commentGuidelines.action?target=info%3Adoi%2F10.1371%2Fjournal.pgen.0030163#corrections">What are corrections?</a></span> <label for="cTitle" class="commentPublic"><span class="none">Enter your note title</span><!-- error message text <em>A title is required for all public notes</em>--></label> <input type="text" name="cTitle" id="cTitle" value="Enter your note title..." class="title commentPublic" alt="Enter your note title..." /> <label for="cArea"><span class="none">Enter your note</span><!-- error message text <em>Please enter your note</em>--></label> <textarea name="cArea" id="cArea" value="Enter your note..." alt="Enter your note...">Enter your note...</textarea> <input type="hidden" name="isPublic" value="true" /> </td> <td> </td> <td class="coi"> <fieldset> <legend>Declare any competing interests.</legend> <ul> <li><label><input id="isCompetingInterestNo" type="radio" checked="checked" name="competingInterest" value="false" /> No, I don't have any competing interests to declare.</label></li> <li><label><input id="isCompetingInterestYes" type="radio" name="competingInterest" value="true" /> Yes, I have competing interests to declare (enter below):</label></li> </ul> <textarea name="ciStatementArea" id="ciStatementArea" disabled value="Enter your competing interests..." alt="Enter your competing interests...">Enter your competing interests...</textarea> </fieldset> </td> </tr> <tr> <td colspan="3" class="buttons"> <input type="button" value="Cancel" title="Click to close and cancel" id="btn_cancel"/> <input type="button" value="Submit" title="Click to post your note publicly" id="btn_post" class="primary"/> </td> </tr> </table> </fieldset> </form> </div> </div> <div class="tip" id="dTip"></div> </div> </div><div dojoType="ambra.widget.ContextAction" id="ContextActionDialog" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="caTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> Please follow our <a href="/static/commentGuidelines.action">guidelines for notes and comments</a> and review our <a href="/static/competing.action">competing interests policy</a>. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided: <ul> <li>Remarks that could be interpreted as allegations of misconduct</li> <li>Unsupported assertions or statements</li> <li>Inflammatory or insulting language</li> </ul> <form name="contextActionForm" id="contextActionForm" class="clearfix buttons" method="post" action=""> <input type="button" name="Continue" value="Continue" id="ContextActionDialogContinueButton" onmouseup="ambra.displayAnnotationContext.startComment(event);" title="Add a note to this text" class="primary"/> <input type="button" name="Cancel" value="Cancel" id="ContextActionDialogCancelButton" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);" title="Close this Window"/> </form> </div> <div class="tip" id="caTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogNotLogged" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canlTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You must be logged in to add a note to an article. You may log in by <a onmousedown="ambra.displayAnnotationContext.disconnect(event);" href="/user/secure/secureRedirect.action?goTo=%2Farticle%2Finfo%3Adoi%2F10.1371%2Fjournal.pgen.0030163">clicking here</a> or <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">cancel this note</a>. </div> <div class="tip" id="canlTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogBadSelection" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canBDTipu"></div> <div class="contextActionContent"> <h5 class="annotation icon"><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You cannot annotate this area of the document. <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">Close</a> </div> <div class="tip" id="canBDTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogBadRangeSelection" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canbrTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You cannot create an annotation that spans different sections of the document; please adjust your selection.<br/> <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">Close</a> </div> <div class="tip" id="canbrTip"></div> </div> </div> <div dojoType="ambra.widget.RegionalDialog" id="CommentDialog" style="padding:0;margin:0;"> <div class="dialog preview"> <div class="tipu" id="cTipu"></div> <div class="btn close" id="btn_close" title="Click to close"><a title="Click to close">Close</a></div> <div id="cmtContainer" class="comment"> <h6 id="viewCmtTitle"></h6> <div class="detail" id="viewCmtDetail"></div> <div class="contentwrap" id="viewComment"></div> <div class="contentwrap" id="viewCIStatement"></div> <div class="detail" id="viewLink"> <!--<a href="#" class="commentary icon" title="Click to view full thread and respond">View all responses</a> <a href="#" class="respond tooltip" title="Click to respond to this posting">Respond to this</a>--> </div> </div> <div class="tip" id="cTip"></div> </div> </div> <div dojoType="ambra.widget.RegionalDialog" id="CommentDialogMultiple" style="padding:0;margin:0;"> <div class="dialog multiple preview"> <div class="tipu" id="mTipu"></div> <div class="btn close" id="btn_close_multi" title="Click to close"><a title="Click to close">Close</a></div> <ol id="multilist"></ol> <br/> <div id="multidetail"></div> <div class="tip" id="mTip"></div> </div> </div> <div dojoType="dijit.Dialog" id="Rating"> <div class="dialog annotate"> <div class="tipu" id="dTipu"></div> <div class="comment"> <h5><span class="commentPublic">Rate This Article</span></h5> <div class="instructions">Please follow our <a href="/static/ratingGuidelines.action">guidelines for rating</a> and review our <a href="/static/competing.action">competing interests policy</a>. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided: <ol> <li>Remarks that could be interpreted as allegations of misconduct</li> <li>Unsupported assertions or statements</li> <li>Inflammatory or insulting language</li> </ol> </div> <div class="posting pane"> <form name="ratingForm" id="ratingForm" method="post" action=""> <input type="hidden" name="articleURI" value="info:doi/10.1371/journal.pgen.0030163" /> <input type="hidden" name="commentTitle" id="commentTitle" value="" /> <input type="hidden" name="comment" id="commentArea" value="" /> <input type="hidden" name="ciStatement" id="statementArea" value="" /> <input type="hidden" name="isCompetingInterest" id="isCompetingInterest" value="" /> <fieldset> <legend>Compose Your Annotation</legend> <span id="submitRatingMsg" class="error" style="display:none;"></span> <table class="layout"> <tr> <td rowspan="2"> <label for="insight">Insight</label> <ul class="star-rating rating edit" title="Rate insight" id="rateInsight"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Bland" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'insight', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 4);">4</a></li> <li><a href="javascript:void(0);" title="Profound" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 5);">5</a></li> </ul> <input type="hidden" name="insight" title="insight" value="" /> <label for="reliability">Reliability</label> <ul class="star-rating rating edit" title="Rate reliability" id="rateReliability"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Tenuous" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'reliability', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 4);">4</a></li> <li><a href="javascript:void(0);" title="Unassailable" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 5);">5</a></li> </ul> <input type="hidden" name="reliability" title="reliability" value="" /> <label for="style">Style</label> <ul class="star-rating rating edit" title="Rate style" id="rateStyle"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Crude" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'style', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 4);">4</a></li> <li><a href="javascript:void(0);" title="Elegant" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 5);">5</a></li> </ul> <input type="hidden" name="style" title="style" value="" /> <label for="cTitle" class="commentPublic"><span class="none">Enter your comment title</span><!-- error message text <em>A title is required for all public annotations</em>--></label> <input type="text" name="cTitle" id="cTitle" value="Enter your comment title..." class="title commentPublic" alt="Enter your comment title..." /> <label for="cArea"><span class="none">Enter your comment</span><!-- error message text <em>Please enter your annotation</em>--></label> <textarea name="cArea" id="cArea" value="Enter your comment..." alt="Enter your comment...">Enter your comment...</textarea> </td> <td rowspan="2"> </td> <td class="coi"> <fieldset> <legend>Declare any competing interests.</legend> <ul> <li><label><input id="isCompetingInterestNo" type="radio" name="competingInterest" value="false" /> No, I don't have any competing interests to declare.</label></li> <li><label><input id="isCompetingInterestYes" type="radio" name="competingInterest" value="true" /> Yes, I have competing interests to declare (enter below):</label></li> </ul> <textarea name="ciStatementArea" id="ciStatementArea" disabled value="Enter your competing interests..." title="Enter your competing interests...">Enter your competing interests...</textarea> </fieldset> </td> </tr> <tr> <td class="buttons"> <input type="button" value="Cancel" title="Click to close and cancel" id="btn_cancel_rating"/> <input type="button" value="Submit" title="Click to post your annotation publicly" id="btn_post_rating" class="primary"/> </td> </tr> </table> </fieldset> </form> </div> </div> </div> </div> <div dojoType="ambra.widget.LoadingCycle" id="LoadingCycle" class="loadingCycler"> <img src="/images/loading.gif" width="58" height="58" title="Loading..." /> </div> </div> </div> <!-- end : main contents --> </div> <!-- end : container --> <!-- begin : footer --> <div id="ftr"> <p><span>All site content, except where otherwise noted, is licensed under a <a href="http://creativecommons.org/licenses/by/2.5/" title="Creative Commons Attribution License 2.5" tabindex="200">Creative Commons Attribution License</a>.</span></p> <ul> <li><a href="/static/privacy.action" title="PLoS Privacy Statement" tabindex="501">Privacy Statement</a></li> <li><a href="/static/terms.action" title="PLoS Terms of Use" tabindex="502">Terms of Use</a></li> <li><a href="http://www.plos.org/advertise/" title="Advertise With PLoS" tabindex="503">Advertise</a></li> <li><a href="http://www.plos.org/journals/embargopolicy.html" title="PLoS Embargo Policy" tabindex="504">Media Inquiries</a></li> <li><a href="http://www.plos.org/journals/print.html" title="PLoS in Print" tabindex="505">PLoS in Print</a></li> <li><a href="/static/sitemap.action" title="Site Map" tabindex="506">Site Map</a></li> <li><a href="http://www.plos.org" title="PLoS.org" tabindex="507">PLoS.org</a></li> </ul> <div class="powered"> <ul> <li><a href="/static/releaseNotes.action" title="Ambra | Release Notes">Ambra 0.9.4 beta</a></li> <li>Managed Colocation provided by <a href="http://www.unitedlayer.com/" title="UnitedLayer: Built on IP Services">UnitedLayer</a>.</li> </ul> </div> </div> <!-- end : footer --> <script type="text/javascript"> var _namespace=""; var loggedIn = false; var almHost = "http://alm.plos.org"; // Safari v3.1.1 "console.debug" issue (http://trac.dojotoolkit.org/ticket/6849) workaround if (/3[\.0-9]+ Safari/.test(navigator.appVersion)) { window.console = { origConsole: window.console, log: function(s){ this.origConsole.log(s); }, info: function(s){ this.origConsole.info(s); }, error: function(s){ this.origConsole.error(s); }, warn: function(s){ this.origConsole.warn(s); } }; } var djConfig = { // don't debug for IE - as dojo's firebug lite module is error prone in IE isDebug: false, parseOnLoad: true }; </script> <script type="text/javascript" src="/javascript/dojo/dojo/dojo.js"></script> <script type="text/javascript" src="/javascript/dojo/dojo/ambra.js"></script> <script type="text/javascript" src="/javascript/init_global.js"></script> <script type="text/javascript" src="/javascript/init_article.js"></script> <script type="text/javascript" src="/javascript/init_ratings.js"></script> <script type="text/javascript" src="/javascript/init_article_body.js"></script> <script type="text/javascript" src="/javascript/init_article_rhc.js"></script> <script type="text/javascript" src="/javascript/alm.js"></script> <script type="text/javascript" src="/javascript/reporting/articleViewsCumulative.js"></script> <script type="text/javascript"> var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www."); document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); </script> <script type="text/javascript"> var pageTracker = _gat._getTracker("UA-338393-1"); pageTracker._trackPageview(); pageTracker._setDomainName("www.plosgenetics.org"); </script> </body> </html>