Search
Advanced Search
Metrics info
Average Rating (0 User Ratings)
    • Currently 0/5 Stars.
    See all categories
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
      • Currently 0/5 Stars.
    Rate This Article
Share this Article info
  • Bookmark: StumbleUpon Facebook Connotea CiteULike Bibliography

Open Access

Research Article

Repetitive Element-Mediated Recombination as a Mechanism for New Gene Origination in Drosophila

Author Summary<p>In numerous organisms, many new genes have been found to originate through dispersed gene duplication and exon/domain shuffling. What recombination mechanisms were involved in the duplication and the shuffling processes? Lack of the intermediate products of recombination that share adequate sequence identity between homologous sequences, or the parental sequences from which the new genes were derived, often makes answering these questions difficult. We identified a number of young genes that originated in recently diverged branches in the evolutionary tree of the eight <i>Drosophila melanogaster</i> subgroup species, by using fluorescence in situ hybridization with polytene chromosomes. We analyzed the genomic regions surrounding 17 new dispersed duplicate genes and observed that most of these genes are flanked by repetitive elements (REs), including a large and diverged transposable element family, DNAREP1. Several copies of these REs are kept in both new and parental gene regions, and their degeneration is correlated with the increasing ages of the identified new genes. These data suggest that REs mediate the recombination responsible for the new gene origination.</p></sec></div> <span property="dc:date" content="2008-01-18" datatype="xsd:date" rel="dc:identifier" href="http://dx.doi.org/10.1371/journal.pgen.0040003"></span> <span property="dc:subject" content="Evolutionary Biology"></span> <form action=""> <input type="hidden" name="journalDisplayName" id="journalDisplayName" value="PLoS Genetics" /> <input type="hidden" name="crossRefPageURL" id="crossRefPageURL" value="/article/crossref/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" /> <input type="hidden" name="metricsTabURL" id="metricsTabURL" value="/article/metrics/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" /> <input type="hidden" name="doi" id="doi" value="info:doi/10.1371/journal.pgen.0040003" /> <input type="hidden" name="articleTitleUnformatted" id="articleTitleUnformatted" value="Repetitive%20Element-Mediated%20Recombination%20as%20a%20Mechanism%20for%20New%20Gene%20Origination%20in%20Drosophila" /> <input type="hidden" name="articlePubDate" id="articlePubDate" value="1200643200000" /> </form> <div class="horizontalTabs" xpathLocation="noSelect"> <ul id="tabsContainer"> <li id="article" class="active"><a href="/article/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" class="tab" title="Article">Article</a></li> <li id="metrics"><a href="/article/metrics/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" class="tab" title="Metrics">Metrics</a></li> <li id="related"><a href="/article/related/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" class="tab" title="Related Content">Related Content</a></li> <li id="comments"><a href="/article/comments/info%3Adoi%2F10.1371%2Fjournal.pgen.0040003" class="tab" title="Comments">Comments: 0</a></li> </ul> </div> <div id="retractionHtmlId" class="retractionHtmlId" style="display:none;" xpathLocation="noSelect"> <div id="retractionlist"></div> </div> <div id="fch" class="fch" style="display:none;" xpathLocation="noSelect"> <p class="fch"><strong> Formal Correction:</strong> This article has been <em>formally corrected</em> to address the following errors.</p> <ol id="fclist" class="fclist"></ol> </div> <div id="articleMenu" xpathLocation="noSelect"> <div class="wrap"> <ul> <li class="annotation icon">To <strong>add a note</strong>, highlight some text. <a href="#" onclick="toggleAnnotation(this, 'public'); return false;" title="Click to turn notes on/off">Hide notes</a></li> <li class="discuss icon"> <a href="/user/secure/secureRedirect.action?goTo=%2Farticle%2Finfo%3Adoi%2F10.1371%2Fjournal.pgen.0040003">Make a general comment</a> </li> </ul> <div id="sectionNavTopBox" style="display:none;"> <p><strong>Jump to</strong></p> <div id="sectionNavTop" class="tools"></div> </div> </div> </div> <p xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="authors" xpathLocation="noSelect"><span property="dc:creator">Shuang Yang</span><sup><a href="#aff1"> 1 </a></sup><sup>,</sup><sup><a href="#aff2">2</a></sup><sup><a href="#equal-contrib">#</a></sup>, <span property="dc:creator">J. Roman Arguello</span><sup><a href="#aff3"> 3 </a></sup><sup><a href="#equal-contrib">#</a></sup>, <span property="dc:creator">Xin Li</span><sup><a href="#aff1"> 1 </a></sup><sup>,</sup><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Yun Ding</span><sup><a href="#aff1"> 1 </a></sup><sup>,</sup><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Qi Zhou</span><sup><a href="#aff1"> 1 </a></sup><sup>,</sup><sup><a href="#aff2">2</a></sup>, <span property="dc:creator">Ying Chen</span><sup><a href="#aff4"> 4 </a></sup>, <span property="dc:creator">Yue Zhang</span><sup><a href="#aff1"> 1 </a></sup>, <span property="dc:creator">Ruoping Zhao</span><sup><a href="#aff1"> 1 </a></sup>, <span property="dc:creator">Frédéric Brunet</span><sup><a href="#aff3"> 3 </a></sup><sup><a href="#n105">¤</a></sup>, <span property="dc:creator">Lixin Peng</span><sup><a href="#aff1"> 1 </a></sup>, <span property="dc:creator">Manyuan Long</span><sup><a href="#aff3"> 3 </a></sup><sup>,</sup><sup><a href="#aff4">4</a></sup><sup><a href="#cor1" class="fnoteref">*</a></sup>, <span property="dc:creator">Wen Wang</span><sup><a href="#aff1"> 1 </a></sup><sup><a href="#cor1" class="fnoteref">*</a></sup></p><p xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="affiliations" xpathLocation="noSelect"><a name="aff1" id="aff1"></a><strong>1</strong> Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China, <a name="aff2" id="aff2"></a><strong>2</strong> Graduate School of Chinese Academy Sciences, Beijing, China, <a name="aff3" id="aff3"></a><strong>3</strong> Committee on Evolutionary Biology, The University of Chicago, Chicago, Illinois, United States of America, <a name="aff4" id="aff4"></a><strong>4</strong> Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America</p><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="abstract" xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[1]"><a id="abstract0" name="abstract0" toc="abstract0" title="Abstract"></a><h2 xpathLocation="noSelect">Abstract <a href="#top">Top</a></h2><p xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[1]/p[1]">Previous studies of repetitive elements (REs) have implicated a mechanistic role in generating new chimerical genes. Such examples are consistent with the classic model for exon shuffling, which relies on non-homologous recombination. However, recent data for chromosomal aberrations in model organisms suggest that ectopic homology-dependent recombination may also be important. Lack of a dataset comprising experimentally verified young duplicates has hampered an effective examination of these models as well as an investigation of sequence features that mediate the rearrangements. Here we use ~7,000 cDNA probes (~112,000 primary images) to screen eight species within the <span class="genus-species">Drosophila melanogaster</span> subgroup and identify 17 duplicates that were generated through ectopic recombination within the last 12 mys. Most of these are functional and have evolved divergent expression patterns and novel chimeric structures. Examination of their flanking sequences revealed an excess of repetitive sequences, with the majority belonging to the transposable element DNAREP1 family, associated with the new genes. Our dataset strongly suggests an important role for REs in the generation of chimeric genes within these species.</p> </div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="abstract" xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[2]"><a id="abstract1" name="abstract1" toc="abstract1" title="Author Summary"></a> <h2 xpathLocation="noSelect">Author Summary <a href="#top">Top</a></h2> <p xpathLocation="/article[1]/front[1]/article-meta[1]/abstract[2]/sec[1]/p[1]">In numerous organisms, many new genes have been found to originate through dispersed gene duplication and exon/domain shuffling. What recombination mechanisms were involved in the duplication and the shuffling processes? Lack of the intermediate products of recombination that share adequate sequence identity between homologous sequences, or the parental sequences from which the new genes were derived, often makes answering these questions difficult. We identified a number of young genes that originated in recently diverged branches in the evolutionary tree of the eight <span class="genus-species">Drosophila melanogaster</span> subgroup species, by using fluorescence in situ hybridization with polytene chromosomes. We analyzed the genomic regions surrounding 17 new dispersed duplicate genes and observed that most of these genes are flanked by repetitive elements (REs), including a large and diverged transposable element family, DNAREP1. Several copies of these REs are kept in both new and parental gene regions, and their degeneration is correlated with the increasing ages of the identified new genes. These data suggest that REs mediate the recombination responsible for the new gene origination.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="articleinfo" xpathLocation="noSelect"><p><strong>Citation: </strong>Yang S, Arguello JR, Li X, Ding Y, Zhou Q, et al. (2008) Repetitive Element-Mediated Recombination as a Mechanism for New Gene Origination in <i>Drosophila</i>. PLoS Genet 4(1): e3. doi:10.1371/journal.pgen.0040003</p><p><strong>Editor: </strong>R. Scott Hawley, Stowers Institute for Medical Research, United States of America</p><p></p><p><strong>Received:</strong> August 22, 2007; <strong>Accepted:</strong> November 27, 2007; <strong>Published:</strong> January 18, 2008</p><p><strong>Copyright:</strong> © 2008 Yang et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.</p><p><strong>Funding:</strong> This work was supported by a CAS-Max Planck Society Fellowship, a National Natural Science Foundation of China (NSFC) award (number 30325016), a NSFC key grant (number 30430400), and a 973 Program (number 2007CB815703–5) to WW; a US National Science Foundation CAREER award (MCB0238168) and US National Institutes of Health R01 grants (R01GM065429-01A1 and 1R01GM078070-01A1) to ML at the University of Chicago; a Graduate Assistance in Areas of National Need (GAANN) genomics grant supports JRA.</p><p><strong>Competing interests:</strong> The authors have declared that no competing interests exist.</p><p><a name="cor1"></a>* To whom correspondence should be addressed. E-mail: <a href="mailto:mlong@uchicago.edu">mlong@uchicago.edu</a> (ML); <a href="mailto:wwang@mail.kiz.ac.cn">wwang@mail.kiz.ac.cn</a> (WW)</p><p><a name="equal-contrib"></a># These authors contributed equally to this work. </p><p><a name="n105"></a><span class="capture-id"> ¤ Current address: Ingénieur de Recherche en Bioinformatique Equipe Génomique Evolutive des Vertébrés, Institut de Génomique Fonctionnelle de Lyon (IGFL)—Ecole Normale Supérieure de Lyon (ENSL) 46, Lyon, France</span></p></div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section1" xpathLocation="/article[1]/body[1]/sec[1]"><a id="s1" name="s1" toc="s1" title="Introduction"></a><h3 xpathLocation="noSelect">Introduction <a href="#top">Top</a></h3><p xpathLocation="/article[1]/body[1]/sec[1]/p[1]">Gene duplication followed by the acquisition of novel molecular function is a fundamental process underlying biological diversity. It has been theoretically and empirically demonstrated that functionally distinct duplicates are capable of evolving through a neofunctionalization process in which there is an accumulation of mutations in a redundant copy of a preexisting gene [<a href="#pgen-0040003-b001">1</a>–<a href="#pgen-0040003-b003">3</a>]. In addition, there is mounting evidence for the rapid generation of new genes through the recombination of preexisting exons and functional domains. This latter process does not exclude, and in fact often relies on, the duplication of the loci involved [<a href="#pgen-0040003-b004">4</a>,<a href="#pgen-0040003-b005">5</a>]. Excluding chimeric genes formed through retroposition [<a href="#pgen-0040003-b006">6</a>–<a href="#pgen-0040003-b008">8</a>], more than three hundred gene families are believed to have originated through exon shuffling [<a href="#pgen-0040003-b009">9</a>]. Most of these gene families have introns, suggesting that DNA level recombination was involved (DLR; DLR as opposed to a retroposition event involving an RNA intermediate).</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[2]">Since its initial proposal [<a href="#pgen-0040003-b010">10</a>], the genetic mechanisms involved in the formation of chimeric genes through exon shuffling have largely remained a mystery. The classic model states that nonhomologous recombination (NHR) brings together exons or domains from ectopic positions [<a href="#pgen-0040003-b010">10</a>]. Experimental evidence for the role of NHR has been gained through transfection experiments [<a href="#pgen-0040003-b011">11</a>,<a href="#pgen-0040003-b012">12</a>] and through surveys of rearrangement hotspots which are often disease-associated [<a href="#pgen-0040003-b013">13</a>–<a href="#pgen-0040003-b015">15</a>]. Breakpoint analyses on these datasets revealed little or no sequence identity between the loci recombined, supporting a NHR model. While these experiments show such a model is possible for exon shuffling, it remains an open question how frequently such processes in non-artificial systems, and over evolutionary time, will contribute to the formation of fixed chimeric genes.</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[3]">Another potential NHR mechanism that can mediate nonhomologous recombination is through the activity of transposable elements (TEs). If a TE is capable of mobilizing adjacent sequence, novel junctions that share no sequence identity could be generated [<a href="#pgen-0040003-b016">16</a>]. The capacity for such events has been documented with the imprecise excision of well studied TEs such as P elements [<a href="#pgen-0040003-b017">17</a>] as well as in plant pack-MULE and Helitron TEs [<a href="#pgen-0040003-b018">18</a>–<a href="#pgen-0040003-b020">20</a>]. These investigations implicate a role for TEs in the generation of chimeric genes. Whether these shuffled products are under functional constraint remains an interesting question.</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[4]">Alternatively, non-allelic homologous recombination (NAHR) between ectopic sequences can lead to the formation of chimeric genes. Recently, a surge of evidence has begun to demonstrate the importance of NAHR to genomic architecture, especially in primates [<a href="#pgen-0040003-b021">21</a>–<a href="#pgen-0040003-b026">26</a>]. Intriguingly, several studies have reported on a limited number of chimeric gene structures, some of which appear functional and nondeleterious, but most remain putative [<a href="#pgen-0040003-b024">24</a>,<a href="#pgen-0040003-b027">27</a>]. Focus has primarily been placed on NAHR's role in human disease [<a href="#pgen-0040003-b026">26</a>]. However, given that NAHR appears to be a common mutational mechanism, a new hypothesis for exon shuffling has been motivated: Despite the frequently deleterious effects, NAHR is capable of making a contribution to the origin of new chimeric genes as an exon shuffling mechanism [<a href="#pgen-0040003-b024">24</a>,<a href="#pgen-0040003-b025">25</a>,<a href="#pgen-0040003-b028">28</a>,<a href="#pgen-0040003-b029">29</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[5]">A difficulty in investigating the relative contributions of these mechanisms to the formation of chimeric genes is that most of the available examples are evolutionarily ancient [<a href="#pgen-0040003-b009">9</a>]. These genes provide few clues for understanding the recombination mechanisms that generated their initial structures because the sequence features, especially those non-constrained sequence traits, that may have fostered their formations have likely been lost (the half life is 120 mys for mammals and 10 mys in <i>Drosophila</i> [<a href="#pgen-0040003-b030">30</a>]). While sequence analyses of ancient chimeric genes provide little mechanistic insight, a sample of young chimeric genes that potentially retain these sequence features may. A second difficulty arises from the limited number of young chimeric genes that are thought to have arisen by DLR. While several case studies exist, evolutionary analyses demonstrating that the new chimeras are functional are largely lacking [<a href="#pgen-0040003-b024">24</a>,<a href="#pgen-0040003-b027">27</a>,<a href="#pgen-0040003-b031">31</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[1]/p[6]">Here we report on a large-scale experimental genomic screen for young chimeric genes generated by DLR within the <span class="genus-species">D. melanogaster</span> subgroup. We utilized an integrated approach based on fluorescent <i>in situ</i> hybridizations (FISH), Southern hybridizations, expression and transcript experiments, BLAST queries, and evolutionary analyses. This approach allowed us to focus on dispersed duplication events, ignoring tandem duplications. Consequently, the total number of chimeric formations are likely larger than the total we report on here. Nonetheless, our results show that, rather than providing redundant copies, dispersed duplication events via DLR have generated new chimeric structures at a high frequency. Interestingly, none of these chimeric structures involved two or more genic sequences; all chimeric regions were formed from the fusion of the duplicated loci and intergenic sequences. Furthermore, we provide strong evidence that REs, in particular the TE family DNAREP1, are a major mediator of these events. Finally, using multiple well-established methods [<a href="#pgen-0040003-b006">6</a>,<a href="#pgen-0040003-b007">7</a>,<a href="#pgen-0040003-b032">32</a>–<a href="#pgen-0040003-b034">34</a>], we demonstrate that most of these new chimeric genes are functional.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section2" xpathLocation="/article[1]/body[1]/sec[2]"><a id="s2" name="s2" toc="s2" title="Results/Discussion"></a><h3 xpathLocation="noSelect">Results/Discussion <a href="#top">Top</a></h3><p xpathLocation="/article[1]/body[1]/sec[2]/p[1]">Two cDNA unigene libraries from <span class="genus-species">D. melanogaster</span> comprised of ~7,000 cDNA probes were used for cFISH experiments over all tested species. Each hybridization generated at least two images for each species. In total, our experiment produced ~112,000 primary images. Including those probes that gave weak or paradoxical signals, the <i>Drosophila</i> Gene Collection (DGC) library version 1.0 set resulted in 266 candidates. The unigene library included 1,000 cDNA probes, most of which were included in the DGC 1.0 library. From this set, 5 new genes, <i>jingwei</i> [<a href="#pgen-0040003-b033">33</a>], <i>Hun</i> [<a href="#pgen-0040003-b032">32</a>], <i>sphinx</i> [<a href="#pgen-0040003-b034">34</a>], <i>monkey-king</i> [<a href="#pgen-0040003-b035">35</a>], and <i>Dntf-2r</i> [<a href="#pgen-0040003-b036">36</a>] have previously been described.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[2]">To exclude false positives from the 266 candidates, we carried out Southern hybridizations and conducted BLAST searches against the available genome sequences of <span class="genus-species">D. simulans</span> (droSim1)<i>, D. yakuba</i> (droYak1)<i>, D. sechellia</i> (droSec1)<i>, D. melanogaster</i> (dm2) and <i>D. erecta</i> (droEre1) (<a href="http://genome.ucsc.edu">http://genome.ucsc.edu</a>) (<a href="#pgen-0040003-g001">Figure 1</a>). The Southern and BLAST analyses confirmed 17 young duplicates generated through DLR (<a href="#pgen-0040003-t001">Table 1</a>; <a href="#pgen-0040003-g002">Figure 2</a>). The genomic sequences of all 17 dispersed duplicates contain the intron(s) and/or non-coding flanking sequences that exist in their parental copies, suggesting that the new genes originated through DLR. In addition, we also identified ten new copies of retrogenes and 53 young copies of REs including retroelements and other repetitive sequences. In this report, we have focused on the 17 dispersed duplicates and investigate possible DLR mechanisms that generate dispersed duplications.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/fig[1]"><a name="pgen-0040003-g001" id="pgen-0040003-g001" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.g001&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/label[1]">Figure 1. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/caption[1]/title[1]">An Example Illustrating the Detection of New Genes</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/caption[1]/p[1]">(A) The probe LD47348 (CG10595) detected two signals in the clade of <i>D. yakuba-santomea-teissieri</i> while only detecting one signal in other species. The new additional signal suggests a new gene candidate.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/caption[1]/p[2]">(B) Southern hybridization results further confirm the extra copy in the <i>D. yakuba-santomea-teissieri</i> clade (M is 1-kb extension marker [Invitrogen]). Lanes 1–8 correspond to Xho I digested DNAs of <span class="genus-species">D. yakuba</span>, <span class="genus-species">D. teissieri</span>, <i>D. santomea</i>, <span class="genus-species">D. erecta</span>, <span class="genus-species">D. melanogaster</span>, <span class="genus-species">D. simulans</span>, <i>D. mauritiana</i>, and <i>D. sechellia</i>, respectively).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/caption[1]/p[3]">(C) Cartoon figure displaying the gene structures of the parental gene (d, or CG10595) and the new duplicate (d-r). The duplicated region is indicated by vertical dash lines. d-r recruited one upstream exon as indicated by yellow box.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/fig[1]/caption[1]/p[4]">(D) Expression patterns of the parental gene. (E) expression patterns of the new gene d-r revealed by one round of RT-PCR and a second round of nested PCR (M indicates DL2000 DNA molecular marker (Takara); E+, E−, L2+, L2−, L3+, L3−, P+, P−, A+, and A− correspond to positive and negative reactions for embryos, second instar larvae, third instar larvae, pupae, and adults, respectively). From these gels, it is clear that d-r is only expressed in the third instar larvae while the parental copy is expressed ubiquitously. All the bands in the negative control lanes are primer dimer bands. E+ and L3+ are weak but clearly visible.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.g001</span><div class="clearer"></div></div><div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[1]"><a name="pgen-0040003-t001" id="pgen-0040003-t001" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.t001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.t001&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[1]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.t001" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[1]/label[1]">Table 1. </span></a></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[1]/caption[1]/p[1]">List of the Young Duplicates Identified and their Parental Loci</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.t001</span><div class="clearer"></div></div><div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/fig[2]"><a name="pgen-0040003-g002" id="pgen-0040003-g002" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.g002&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/fig[2]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/fig[2]/label[1]">Figure 2. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/fig[2]/caption[1]/title[1]">The Phylogenetic Distribution of the 17 New DLR Duplicates Identified in This Study</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/fig[2]/caption[1]/p[1]">The species phylogeny and time scale are from [<a href="#pgen-0040003-b058">58</a>]. Different color bars show different gene families. The kep1 gene family has six new duplicates (indicated by red bars).</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.g002</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/p[3]">Interestingly, the <i>kep1</i> gene family has six new duplicates that have been dispersed to different chromosomal locations, while the other 11 gene families have only a single new duplicate (<a href="#pgen-0040003-t001">Table 1</a>). Thirteen of these duplications are intrachromosomal, and 4 are interchromosomal (<a href="#pgen-0040003-t001">Table 1</a>). Two putative pseudogenes exist in this list: CR33318 and CR9337. CR33318 is found only in <span class="genus-species">D. melanogaster</span>, however CR9337 has a disrupted reading frame in <span class="genus-species">D. melanogaster</span> but is intact in <span class="genus-species">D. sechellia</span> and <i>D. simulans</i>. Mapping these results onto the species tree reveal an age <8 mys for almost all these origination events except the 12-my-old CG5372 (<a href="#pgen-0040003-g002">Figure 2</a>).</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[4]">Excluding the two putative pseudogenes (CR33318 and CR9337) paralog-specific reverse transcriptase (RT-PCR) experiments detected transcripts for all paralogs. Twelve out of these 15 duplicates display differential expression patterns from their parental copies in development and/or sex (<a href="#pgen-0040003-st001">Table S1</a>). These observations indicate that most of the new genes have evolved divergent expression patterns, and that generally the patterns are more restricted.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[5]">To examine whether the new duplicates have evolved chimeric gene structures, we utilized previously reported cDNA sequences, RACE, or RT-PCR based on computationally predicted structures (Materials and Methods). Among the 17 new genes, 13 were found to have evolved chimeric gene sequences through the recruitment of flanking sequence near the insertion site or as the result of extensive deletions (CG5372, CG9902, CG4021, CG3875, CG3927, CR9337, CG7635-r, CG3101-r, CG3071-r, <i>d-r</i>, Dox-A3-r, <i>Hun</i>, and <i>klg-r</i>; <a href="#pgen-0040003-g003">Figure 3</a>). Among these chimeric genes, 11 can encode chimeric proteins (CG5372, CG9902, CG4021, CG3875, CG3927, CR9337, CG7635-r, CG3101-r, CG3071-r, <i>Hun</i> and <i>d-r</i>). For example, <i>d-r</i> and CG9902 have both recruited novel coding regions following their duplications, and possibly in conjunction with their deletions events that followed (<a href="#pgen-0040003-g003">Figure 3</a>). These observations reveal that the majority of young duplicated genes have evolved chimeric gene structures. In addition, it is notable that the chimeric genes that we have detected involve only the duplicated loci and integenic sequences. This suggests that for dispersed duplication events, the formation of chimeric genes by recombining two or more genic sequences may be relatively rare.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/fig[3]"><a name="pgen-0040003-g003" id="pgen-0040003-g003" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g003" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.g003&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/fig[3]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g003" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/fig[3]/label[1]">Figure 3. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/fig[3]/caption[1]/title[1]">The Gene Structures of 16 New Duplicates Mapped on the Species Phylogeny</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/fig[3]/caption[1]/p[1]">CR33318 is not shown because it is a truncated copy without detectable expression and has frame shift mutations. Duplicated regions are indicated with vertical dash lines. Horizontal dash lines in CG7635-r, CG3101-r, d-r, and klg-r indicate that we only obtained partial coding regions with RT-PCR and longer coding regions may exist outward. Boxes are exon regions and lines indicate introns. Yellow boxes indicate recruited chimeric regions, green boxes indicate parental loci UTRs, and blue boxes indicate duplicate loci UTRs. Positions of start and stop codons are marked.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.g003</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/p[6]">To test for functional constraint, we conducted substitution analyses by estimating the Ka/Ks ratio for both paralogous and orthologous comparisons. For the paralogous comparisons, our conservative null hypothesis was that the parental genes are under strong functional constraint with the new copy subject to no constraint (a pseudogene). These estimates suggest that most of the genes are under functional constraint: Ka/Ks values are lower than 0.5 for 8 genes, lower than 1 but higher than 0.5 for 5 genes, and ~1 for 2 genes (<a href="#pgen-0040003-st002">Table S2</a>). Furthermore, analyses of the functional domains for these genes (Materials and Methods), revealed that almost all genes have Ka/Ks ratios lower than or close to 0.5 (<a href="#pgen-0040003-st002">Table S2</a>). For orthologous comparisons, the null hypothesis was that the new copies are pseudogenes (Ka/Ks = 1). The results were similar, showing that Ka/Ks ratios are significantly less than 1 for most genes except CG3071-r (Ka/Ks = 2.3091) and CG8490-r (Ka/Ks = 1.2230), indicating the possibility that positive selection may be acting on these two (<a href="#pgen-0040003-st003">Table S3</a>). The statistical tests of the null hypothesis of neutrality [<a href="#pgen-0040003-b001">1</a>] in the paralogous and orthologous comparisons reveal that most of these new genes are under significant functional constraint over the tested coding sequences. These complementary analyses of expression, gene structure, and nucleotide substitution suggest that all 15 new genes are functional and that many of these have undergone neofunctionalization by evolving new gene structures with new expression patterns.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[7]">The classical models of gene duplication assume a completely redundant (in sequence and function) duplicate copy [<a href="#pgen-0040003-b001">1</a>,<a href="#pgen-0040003-b002">2</a>]. In these models the most likely outcome is that one copy will become non-functionalized, with a low probability that one or the other becomes neofunctionalized or subfunctionalized through subsequent mutations [<a href="#pgen-0040003-b037">37</a>,<a href="#pgen-0040003-b038">38</a>]. However, our results show that the majority of new duplicates generated through DLR in <i>Drosophila</i> are not structurally, and are thus unlikely to be functionally, identical to their parental copies. It is also a general result that DLR is an important mechanism for the generation of dispersed genes with novel functions, adding to other potential mechanisms [<a href="#pgen-0040003-b039">39</a>]. Interestingly, Katju and Lynch [<a href="#pgen-0040003-b040">40</a>] have recently found that many new duplicates in <span class="genus-species">C. elegans</span> have unique exons in one or both members of a duplicate pair. Consistent with our observations, these latter cases are also likely DLR-derived duplicates that have recruited new gene fragments and have evolved stable chimeric structures.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[8]">Having established that 15 of these new duplicates are likely functional, with many having chimeric structures, we then investigated the mutational mechanisms that generated them. Data, largely originating from detailed sequence analyses of human disease-related loci, have shown correlations between structural variation and REs, most notably <i>Alu</i> elements in primate genomes [<a href="#pgen-0040003-b013">13</a>,<a href="#pgen-0040003-b022">22</a>,<a href="#pgen-0040003-b023">23</a>,<a href="#pgen-0040003-b025">25</a>,<a href="#pgen-0040003-b028">28</a>,<a href="#pgen-0040003-b041">41</a>]. Though a causal relationship between the repetitive elements and segmental duplications is difficult to establish, several studies have argued for their causative role in genomic rearrangements through NAHR. Based in part on these findings, we were interested in whether there was evidence for repetitive sequence surrounding these duplicated regions.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[9]">We identified both 5′ and 3′ breakpoints for each young duplicate by comparing genomic sequences of each of these new gene duplicates with its parental copy (<a href="#pgen-0040003-t002">Table 2</a>). Interestingly, we observed REs at or near the breakpoints for 10 out of the 17 duplicates (including the 2 duplicates that are likely pseudogenes) (<a href="#pgen-0040003-t002">Table 2</a>; <a href="#pgen-0040003-sg001">Figure S1</a>). These REs consist of 7 TEs, 2 satellite sequences, and 1 simple repeat. They are associated with the new genes that are in different genomic locations, suggesting independent events. Furthermore, all TEs belong to the DNAREP1 family, the largest TE family in <i>Drosophila</i> which has very diverged members [<a href="#pgen-0040003-b042">42</a>,<a href="#pgen-0040003-b043">43</a>].</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[2]"><a name="pgen-0040003-t002" id="pgen-0040003-t002" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.t002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.t002&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[2]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.t002" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[2]/label[1]">Table 2. </span></a></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/table-wrap[2]/caption[1]/p[1]">Repetitive Elements at the Breakpoints of Duplicate Pairs</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.t002</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/p[10]">Among these 10 pairs associated with REs, 5 have shared repeats at or near the breakpoints of both the parental and the new duplicate copies (<a href="#pgen-0040003-t002">Table 2</a>). For these 5 paralog pairs, 4 (CG3875-CG3927, mkgr-mkgr2, CG3101-CG3101-r and CR9337-CR9337-r) maintain very high sequence identity over the flanking elements; the remaining CR9337-CR33318 pair, though both harboring DNAREP1 sequence at their 5′ ends, provides a weak alignment. The other five paralog pairs contain a repetitive element at the breakpoint of one copy (<a href="#pgen-0040003-t002">Table 2</a>; 2 examples with highly similar TEs shown in <a href="#pgen-0040003-g004">Figure 4</a>). In addition, <i>klg-r</i>, CG7635-r and CG8490-r (not included in the ten above) were found next to sequencing gaps in the genomic databases (<a href="#pgen-0040003-t002">Table 2</a>), and resequencing these regions resulted in sequence profiles characteristic of repetitive sequences (data not shown). If these are included, the majority of new duplicates (13/17, 76.5%) are associated with repetitive elements.</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/fig[4]"><a name="pgen-0040003-g004" id="pgen-0040003-g004" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g004" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.g004&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/fig[4]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g004" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/fig[4]/label[1]">Figure 4. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/fig[4]/caption[1]/title[1]">Two Examples of New Genes with Repetitive Sequences at the Breakpoints</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/fig[4]/caption[1]/p[1]">(A) Shows a satellite DNA sequence (SAR) located at the 5′ breakpoints of mkg-r2 and its parental gene mkg-r.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/fig[4]/caption[1]/p[2]">(B) Shows the existence of a transposon (DNAREP1-DM) at all the four breakpoints of a CR9337 duplicate pair.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.g004</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/p[11]">Four lines of evidence indicate that this association has not been observed by chance. The first is based on orthology assignments available from current genome databases, indicating that all ten in our set are euchromatic and not on the 4<sup>th</sup> chromosome. High-resolution analyses of <span class="genus-species">D. melanogaster</span> TEs have verified that the paracentromeric regions of the major chromosome arms and chromosome 4 harbor the highest densities of TEs [<a href="#pgen-0040003-b044">44</a>]. Second, simulations show that the probability that the number of genes flanked by TEs ≥7 given the sample size of seven genes (with 14 breakpoints) is low (<i>p</i> < 0.05) given a TE-free region (TFR) of ~15 kb or larger (<a href="#pgen-0040003-sg002">Figure S2</a>; Materials and Methods). Despite TE differences between species, 15 kb is less than half the mean TFR found in <span class="genus-species">D. melanogaster</span> [<a href="#pgen-0040003-b044">44</a>]. Given that the TEs in our dataset are comprised primarily of DNAREP1 family members, the distance is even greater. Furthermore, the probability that both paralogs contain the same TE sequence in their flanking regions, as three (and possibly four) do in our dataset, is much lower (<a href="#pgen-0040003-t002">Table 2</a>; <a href="#pgen-0040003-sg001">Figure S1</a>). Finally, our data reveal a gradation of degeneration in the TEs and other REs with the ages of the gene duplicates that the repeats flank (<a href="#pgen-0040003-g005">Figure 5</a>). This gradation is consistent with observed degeneration rate of functionless elements in <i>Drosophila</i> [<a href="#pgen-0040003-b030">30</a>], as well as any potential internal deletions that could be part of a self-regulation system as seen in <span class="genus-species">D. melanogaster</span> TEs [<a href="#pgen-0040003-b045">45</a>].</p> <div class="figure" xpathLocation="/article[1]/body[1]/sec[2]/fig[5]"><a name="pgen-0040003-g005" id="pgen-0040003-g005" title="Click for larger image " href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g005" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><img xpathLocation="noSelect" border="1" src="/article/fetchObject.action?uri=info:doi/10.1371/journal.pgen.0040003.g005&representation=PNG_S" align="left" alt="thumbnail" class="thumbnail"></a><p><strong xpathLocation="/article[1]/body[1]/sec[2]/fig[5]/label[1]"><a href="/article/slideshow.action?uri=info:doi/10.1371/journal.pgen.0040003&imageURI=info:doi/10.1371/journal.pgen.0040003.g005" onclick="window.open(this.href,'plosSlideshow','directories=no,location=no,menubar=no,resizable=yes,status=no,scrollbars=yes,toolbar=no,height=600,width=850');return false;"><span xpathLocation="/article[1]/body[1]/sec[2]/fig[5]/label[1]">Figure 5. </span></a> <span xpathLocation="/article[1]/body[1]/sec[2]/fig[5]/caption[1]/title[1]">A Simplified Schematic of the Repetitive Sequence Flanking New Genes and Their Distribution over the <span class="genus-species">D. melanogaster</span> Subgroup Phylogeny</span></strong></p><p xpathLocation="/article[1]/body[1]/sec[2]/fig[5]/caption[1]/p[1]">Left panel displays the varying degrees of identity and degeneration between flanking regions of paralogs, with the right panel displaying the branches in which they are found; 1: (CG2952:CG2952-r), (kep1: CG4021), (kep1:CG9337), (CG9902:CG7692), (kep1:CG3875); 2: (CG3875-CG3927); 3: (CG9337-CG9337-r); 4: (mkgr-mkgr2); 5: (CR9337-CR33318); 6: (CG3101-CG3101-r). The red blocks in the left panel indicate alignable regions of the TEs and other repeat sequences. The black boxes represent sequences of TEs and other repeats; fragmented black boxes represent RE fragments. The long boxes in various colors represent the identified new genes. See also <a href="#pgen-0040003-sg001">Figure S1</a>, for the alignments.</p> <span xpathLocation="noSelect">doi:10.1371/journal.pgen.0040003.g005</span><div class="clearer"></div></div><p xpathLocation="/article[1]/body[1]/sec[2]/p[12]">The striking association with REs provides evidence for the relationship between RE sequences and genomic rearrangements leading to novel functions. This relationship differs from previous reports of TE themselves becoming part of a novel transcript in <span class="genus-species">D. melanogaster</span> [<a href="#pgen-0040003-b046">46</a>,<a href="#pgen-0040003-b047">47</a>]. Instead, our dataset supports a model whereby REs are mediating the recombination of flanking sequences to form chimeric products that do not include RE sequence. The precise mechanism defining “RE-mediation” would likely be NAHR or the mobilization of flanking sequence through the activity of the DNAREP1 transposons. Recent studies of DNAREP1 elements suggest a burst of activity occurred just prior to or during the formation of the <span class="genus-species">D. melanogaster</span> subgroup, followed by nearly complete inactivation ~5–10 mya [<a href="#pgen-0040003-b042">42</a>]. Interestingly, there is evidence of a very recent revival of activity in the <span class="genus-species">D. yakuba</span> lineage [<a href="#pgen-0040003-b043">43</a>]. If these estimates on inactivity are correct, NAHR would be the most likely mechanism generating the rearrangement in our dataset. This possibility is also supported by the identified non-mobile repeat sequences that are associated with the new chimeric genes (<a href="#pgen-0040003-t002">Table 2</a>). However, if DNAREP1 has been active in the <span class="genus-species">D. melanogaster</span> subgroup for a longer period than reported, as implicated by the observation in <span class="genus-species">D. yakuba</span> [<a href="#pgen-0040003-b043">43</a>], and if this class of TEs does in fact mobilize flanking DNA, a combination of mechanisms is possible.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[13]">Alternatively, the REs flanking the new duplicates could be the result of larger duplications that included the REs (segmental duplication), rather than the REs mobilizing the region. However, we would expect that under this hypothesis we would see longer stretches of identity outside REs. Inspecting the flanking regions of our dataset indicate that identity is lost in close proximity with the repetitive sequences. A second alternative hypothesis is that the repetitive sequence presents a preferential site for strand breakage. Similar suggestions have been made for <i>Alu</i>, satellite repeats, and other sequence demonstrating fragility [<a href="#pgen-0040003-b023">23</a>,<a href="#pgen-0040003-b031">31</a>,<a href="#pgen-0040003-b048">48</a>]. If imperfect repair were to follow strand breakage, this too would be akin to a nonhomologous end-joining event and would support the classical view of exon-shuffling. Further experimental work is needed to address this possibility.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[14]">Our observation that there is an excess of repetitive elements around dispersed functional duplicates is of general importance in light of advancements in identifying copy number variation in other model organisms, and the increased recognition for the role of repetitive sequences in shaping chromosomal architecture [<a href="#pgen-0040003-b014">14</a>,<a href="#pgen-0040003-b022">22</a>–<a href="#pgen-0040003-b026">26</a>,<a href="#pgen-0040003-b031">31</a>,<a href="#pgen-0040003-b049">49</a>,<a href="#pgen-0040003-b050">50</a>]. Despite these advancements, little is known about the potential non-deleterious outcomes that such rearrangements may present. Our work helps fill this void by providing an extensive chimeric gene dataset that is supported by experiments that test for functionality.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[15]">Evidence from previous case studies has indicated that once a duplicate has been generated the recruitment of exons and/or flanking gDNA is a heterogeneous process [<a href="#pgen-0040003-b032">32</a>,<a href="#pgen-0040003-b051">51</a>]. Within our dataset, we also observe this. The first instance is the direct recruitment of genomic DNA flanking the insertion site of the new copy. Eight new genes, representing eight gene families (CG9902, CG5372, <i>Hun</i>, CG7635-r, CG3701-r, <i>d-r</i>, <i>Dox-A3-r</i> and <i>klg-r</i>), were created this way. The second involves dramatic mutations within the new duplicates. In the <i>kep1</i> family, numerous deletions in the duplicated 3′ regions have resulted in varying peptide sequences in the C terminal (<a href="#pgen-0040003-g003">Figure 3</a>). In CG8490-r, both the start and the stop codons have been shifted, resulting in different peptides at both N and C termini. Finally, CG3101-r has recruited part of its previous intron 3, which becomes the 5′ UTR and a short stretch of protein-coding sequence in the new duplicate gene.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[16]">We have used ~7,000 cDNA probes to screen new gene duplicate copies. The estimated number of genes in the genome is ~14,000. The total number of new gene duplicates can be estimated as 17/7,000 × 14,000 = 34, over an evolutionary time equal to ~20 mys (the sum of the branch lengths of the <span class="genus-species">D. melanogaster</span> subgroup). Thus, on average, the origination rate is 34/20 = 1.7 per mys per genome, or 0.121 × 10<sup>−9</sup> per year per gene. We note that, because our method ignores tandem duplicates, and because our FISH probes were all based on <span class="genus-species">D. melanogaster</span> sequence, this is an underestimate. However, this rate is an order of magnitude higher than the gene duplication rate estimated in yeast [<a href="#pgen-0040003-b052">52</a>] but still 30 times lower than a previous estimate that were based on the assumption of a molecular clock [<a href="#pgen-0040003-b053">53</a>]. Our estimate may not be inconsistent with previous estimates [<a href="#pgen-0040003-b053">53</a>] because our focus was much narrower, investigating DLR events only.</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[17]">Only two new duplicates (<i>d-r</i> and <i>Dox-A3-r</i>) in the <i>yakuba-santomea-teissieri</i> lineage (<i>yakuba</i> lineage) were observed, while 5 new duplicates were detected between the common ancestor of <i>melanogaster</i> and <i>yakuba</i> and the common ancestor of the <i>melanogaster</i> complex (<a href="#pgen-0040003-g002">Figure 2</a>). In addition, we did not detect any new duplicate in <span class="genus-species">D. erecta</span>. This may be a technical result attributable to the difficulty of hybridization with <span class="genus-species">D. erecta</span> polytene chromosomes, or sequence divergence relative to our probes. Alternatively, the putative inconsistent duplication rate may be associated with episodic activities of transposons or repetitive sequences. For example, the transposon DNAREP1 members were associated with 5 new duplicates in the <i>kep1</i> gene family and CG9902. As noted above, it has been suggested that there was an active episode of DNAREP1 before the <span class="genus-species">D. melanogaster</span> lineage separated from the <span class="genus-species">D. yakuba</span> lineage and then again within the <span class="genus-species">D. yakuba</span> lineage [<a href="#pgen-0040003-b042">42</a>,<a href="#pgen-0040003-b043">43</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[2]/p[18]">Previous investigations have revealed several important roles for REs in the generation of evolutionary novelties including the donation of their own sequences into protein coding regions [<a href="#pgen-0040003-b046">46</a>,<a href="#pgen-0040003-b047">47</a>,<a href="#pgen-0040003-b054">54</a>,<a href="#pgen-0040003-b055">55</a>], retrotransposing and recruiting novel gene sequence [<a href="#pgen-0040003-b005">5</a>], increasing genic diversity in the maize genome by the helitron-like transposons [<a href="#pgen-0040003-b056">56</a>], potentially providing greater overall genome plasticity [<a href="#pgen-0040003-b016">16</a>,<a href="#pgen-0040003-b057">57</a>], and elevating expression of a nearby insecticide resistant gene [<a href="#pgen-0040003-b058">58</a>,<a href="#pgen-0040003-b059">59</a>]. The observation reported here further demonstrates a mechanistic role for REs in mediating the origins of new genes by facilitating gene recombination. The precise mechanism for this recombination is unclear, but likely include NAHR, as implicated by both TEs and non-TE repetitive sequences being detected, and NHR as a consequence of transposon enzymatic activities [<a href="#pgen-0040003-b043">43</a>]. However, the conventional NAHR model is much more likely between the homologous repeats that are located on the same chromosome [<a href="#pgen-0040003-b022">22</a>]. Four of the 17 new genes identified are on different chromosomes from their parental genes. These four new genes may have been generated by a different homology-dependent recombination model that assumes a replication-dependent mechanism involving no crossover [<a href="#pgen-0040003-b022">22</a>], the explicit model depicted in Figure 8.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section3" xpathLocation="/article[1]/body[1]/sec[3]"><a id="s3" name="s3" toc="s3" title="Materials and Methods"></a><h3 xpathLocation="noSelect">Materials and Methods <a href="#top">Top</a></h3> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[1]/title[1]">Materials.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[1]/p[1]">In order to screen for young chimeric genes systemically, we designed an experimental genomics approach using the <span class="genus-species">D. melanogaster</span> species subgroup as a comparative model system. This subgroup includes <span class="genus-species">D. melanogaster</span> (hereafter abbreviated as mel in presented tables and figures)<i>, D. simulans</i> (sim)<i>, D. mauritiana</i> (mau)<i>, D. sechellia</i> (sec)<i>, D. yakuba</i> (yak)<i>, D. teissieri</i> (tei)<i>, D. santomea</i> (san)<i>,</i> and <i>D. erecta</i> (ere). <span class="genus-species">D. orena</span> was excluded from analyses because of its unclear placement in the phylogeny. The phylogeny of this subgroup is well resolved [<a href="#pgen-0040003-b060">60</a>,<a href="#pgen-0040003-b061">61</a>] and the divergence times among these species provide a considerable range over which to detect the presence of young genes. The polytene chromosomes of the salivary gland of <i>Drosophila</i> allow detection of gene copy number using a fluorescent <i>in situ</i> hybridization (FISH) approach. Therefore, we can use cDNA probes to visualize FISH signals that are about 100 kb away from each other in the species of <span class="genus-species">D. melanogaster</span> subgroup, and count the signal number in each species [<a href="#pgen-0040003-b034">34</a>].</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/title[1]">FISH and Southern hybridizations.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/p[1]">We carried out dual-color FISH on the polytene chromosome preparations of the aforementioned 8 species. Our probe sets comprised 5,928 full-length <span class="genus-species">D. melanogaster</span> cDNA clones from the Berkeley Drosophila Gene Collection (DGC) version 1.0 (<a href="http://www.fruitfly.org/DGC/index.html">http://www.fruitfly.org/DGC/index.html</a>) and about 1,000 cDNA clones from an early Drosophila Unigene Library (Research Genetics).</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[2]/p[2]">Probes were labeled with digoxiginin (DIG) or biotin using PCR [<a href="#pgen-0040003-b034">34</a>,<a href="#pgen-0040003-b062">62</a>]. Polytene chromosomes from four species were simultaneously squashed on a slide and then hybridized with a pair of DIG and biotin labeled probes [<a href="#pgen-0040003-b034">34</a>]. For a given probe, FISH is capable of resolving two signals across two adjacent polytene bands, which is equivalent to ~100 kb in linear DNA sequence. As a result, all duplicates we report in this study have been involved in translocations; they are not tandem duplications. The probes that revealed extra signals in a particular lineage were subject to further confirmation using southern hybridization. Genomic DNAs of the eight species were extracted using the Puregene DNA isolation kit (Gentra Systems). DNAs digested with restriction enzymes were separated on agarose gels and transferred to nylon membranes (Roche Molecular Biochemicals) by Southern blotting. The DIG-labeled probes were hybridized to the membrane to further confirm the copy number in different species. In addition, homology searches were carried out for those new genes that fell within sequenced genomes (<a href="http://genome.ucsc.edu">http://genome.ucsc.edu</a>).</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/title[1]">Breakpoint analyses.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[3]/p[1]">To identify breakpoints and examine the type of sequence surrounding them, the genomic sequences of each pair of duplicate and parental copy, along with 5′ and 3′ flanking sequences, were aligned using the bl2seq software with default settings [<a href="#pgen-0040003-b063">63</a>]. The length of the 5′ and 3′ flanking sequences for each pair was chosen to ensure that it extends 1kb beyond the point where sequence identity disappears. Breakpoints of duplicates were determined as the last nucleotide showing sequence identity between parental and new copy. For a multiple-copy gene family, the parental copy was defined as the copy that has the highest similarity to the new copy. RepeatMasker (<a href="http://www.repeatmasker.org/">http://www.repeatmasker.org/</a>) was used to identify whether there is repetitive sequence within a 100 bp window centered at each breakpoint.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[4]/title[1]">Substitution analyses.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[4]/p[1]">To examine the evolutionary forces operating on the new duplicates, we calculated synonymous (Ks) and non-synonymous (Ka) divergence between all paralogs except for the pseudogene CR33318 (we included the putative pseudogene CR9337 and CR9337-r because they are still intact in the <span class="genus-species">D. simulans</span> complex). In addition, we also conducted substitution analyses between orthologous copies in different species. For 11 young duplicates we retrieved their orthologs from a second species's genome, and therefore also calculated Ka and Ks between the orthologous pairs. Estimates were obtained using MEGA 3.1 [<a href="#pgen-0040003-b064">64</a>]. A Z-test implemented in MEGA 3.1 was used to test if Ka/Ks ratios deviate from the neutral expectation (Ka/Ks = 1). We tested functional constraint in the whole gene coding region and the functional domain separately. To define the functional domains, the coding sequences of genes were translated into the protein sequences. Then we performed rps-BLAST to detect whether the newly translated protein sequences have functional domains using a cutoff line E < 0.01 on NCBI website <a href="http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi">http://www.ncbi.nlm.nih.gov/Structure/cd​d/wrpsb.cgi </a>.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[5]/title[1]">RACE, RT-PCR, and gene structure analyses.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[5]/p[1]">Our approach is capable of observing three kinds of new duplicates, (1) direct duplicates that still keep intron(s) or flanking non-coding sequences, (2) retroposed copies that have lost ancestral introns, and (3) copies that have no obvious sequence features identifying them as either created by retroposition or direct duplication. Tandem duplication can be resulted from either replication slippage or DLR, but the assumption is that those dispersed duplicates across long chromosome distance, or between chromosomes, have originated through DLR. In this study, we only considered direct dispersed duplicates that were derived through DLR. For each of these duplicate genes, we designed copy-specific RT-PCR primers. RT-PCR experiments were carried out using cDNA from 5 developmental stages: embryo, instar larva 2 (L2), instar larva 3 (L3), pupa and adult. Total RNA was extracted from these samples using RNAeasy Mini RNA extraction kit (Qiagen). To avoid contamination of genomic DNA, total RNA was treated with Dnase I (amplification grade, Invitrogen) prior to first strand synthesis. First strand cDNA was synthesized using Oligo-dT and SuperScript II Rnase H- reverse Transcriptase (Invitrogen). All RT-PCR products were sequenced for verification.</p> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[5]/p[2]">To establish the gene structures of the new genes, four types of data were used: (1) the draft genomes of <span class="genus-species">D. simulans</span> (droSim1)<i>, D. yakuba</i> (droYak1)<i>, D. sechellia</i> (droSec1)<i>, and D. erecta</i> (droEre1) (<a href="http://genome.ucsc.edu">http://genome.ucsc.edu</a>) were queried and provided addition verification and gDNA for primer design; (2) For those duplicates whose full length cDNAs are available in public databases (<a href="http://www.ncbi.nlm.nih.gov/Database/">http://www.ncbi.nlm.nih.gov/Database/</a>), we mapped the cDNA to their genomic positions if draft sequence was available; (3) For those duplicates without cDNA, and whose sequences have diverged enough to allow copy-specific primers, we carried out rapid amplification of cDNA ends (RACE); (4) For those duplicate pairs that are too similar to allow copy-specific primers, and for those that resulted in no RACE product (possibly due to low expression levels or long ends), we used the Softberry software [<a href="#pgen-0040003-b065">65</a>] to obtain a tentative chimeric gene structure prediction. We then tested these predictions using RT-PCR.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/title[1]">Chromosomal mapping.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[6]/p[1]">To establish an approximate chromosomal position (interstitial or not) for each of these genes, we used the D. melanogaster genome as a reference. We carried out BLAST queries of the D. melanogaster genome using sequence flanking each of the genes. These flanking regions were then used to query available genome draft sequence (<a href="http://www.ncbi.nlm.nih.gov/Database/">http://www.ncbi.nlm.nih.gov/Database/</a>) in order to determine orthologous chromosomal regions. The cytological positions were then extracted using NCBI's MapView (<a href="http://www.ncbi.nlm.nih.gov/mapview/">www.ncbi.nlm.nih.gov/mapview/</a>). Two new copies (CG7635 and klg), fell between sequence gaps. For these two we determined their approximate position based on our FISH images.</p> <h4 xpathLocation="/article[1]/body[1]/sec[3]/sec[7]/title[1]">TE association simulation.</h4> <p xpathLocation="/article[1]/body[1]/sec[3]/sec[7]/p[1]">To assess the significance of our observed association between TE sequences and the flanking regions of the paralogs, we carried out simulations based on the known frequencies of TEs in D. melanogaster [<a href="#pgen-0040003-b044">44</a>]. The mean TE-free region (TFR) is 23,878, with a median of 1,992. The difference between the mean and the median results from the clustering of TEs within the pericentric regions and the fourth chromosome. However, the identified new genes are non-pericentromeric regions in which the density of TEs is much lower and there are few cases of non-random insertions to one particular locus. Therefore, we carried out simulations over a range of normally distributed TFRs in a conservative assumption of the 15 kb average. The length of each TE was normally distributed with a mean of 4 kb. The total length of simulated chromosomes was kept at ~ 20 Mb. 14 breakpoints were introduced randomly into the sequence (seven paralog pairs where only one copy is associated with TE sequence) and an association was considered if the breakpoint was within 300 bp. This distance was also chosen to be conservative, given the distances observed in our data. 10,000 iterations were run and the upper 5% tail was calculated from the resulting distribution.</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" id="section4" xpathLocation="/article[1]/body[1]/sec[4]"><a id="s4" name="s4" toc="s4" title="Supporting Information"></a><h3 xpathLocation="noSelect">Supporting Information <a href="#top">Top</a></h3><a name="pgen-0040003-sg001" id="pgen-0040003-sg001"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0040003.sg001">Figure S1. </a>The Alignments of Gene Duplicate Copies and Their Flanked Repetitive Sequences</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[1]/caption[1]/p[1]">(114 KB PDF)</p> <a name="pgen-0040003-sg002" id="pgen-0040003-sg002"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0040003.sg002">Figure S2. </a>The Simulation Results of the TE Association with Gene Duplications</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[2]/caption[1]/p[1]">Vertical red line indicates the observed TE-associated genes in our paralog set. The distribution is from simulation where the mean TE-free regions are 15 kb, the mean distance at which our observation is significant at the 0.05 level [<a href="#pgen-0040003-b044">44</a>].</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[2]/caption[1]/p[2]">(3 KB PDF)</p> <a name="pgen-0040003-st001" id="pgen-0040003-st001"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0040003.st001">Table S1. </a>Expression Pattern of the New Genes and Their Parental Genes</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[3]/caption[1]/p[1]">(135 KB DOC)</p> <a name="pgen-0040003-st002" id="pgen-0040003-st002"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0040003.st002">Table S2. </a>Substitutions between Paralogous Copies</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[4]/caption[1]/p[1]">The <i>p</i>-values in black are for the tests of the null hypothesis that Ka/Ks is significantly lower than 1. The <i>p</i>-values in red are for the null hypothesis that Ka/Ks is significantly lower than 0.5. <i>p</i>-Values for paralog comparisons (red) are shown only when the Ka/Ks value is lower than 0.5.</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[4]/caption[1]/p[2]">(56 KB DOC)</p> <a name="pgen-0040003-st003" id="pgen-0040003-st003"></a><p><strong xPathLocation="noSelect"><a href="/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pgen.0040003.st003">Table S3. </a>Substitutions between Orthologous Copies</strong></p><p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[5]/caption[1]/p[1]">The <i>p</i>-values are for the tests of the null hypothesis that Ka/Ks is significantly lower than 1.</p> <p xpathLocation="/article[1]/body[1]/sec[4]/supplementary-material[5]/caption[1]/p[2]">(52 KB DOC)</p> </div> <div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" xpathLocation="noSelect"><a id="ack" name="ack" toc="ack" title="Acknowledgments"></a><h3 xpathLocation="noSelect">Acknowledgments <a href="#top">Top</a></h3> <p xpathLocation="/article[1]/back[1]/ack[1]/p[1]">We would like to thank James Shapiro for insightful discussion regarding TEs; the M. Long lab for many helpful discussions; The University of Chicago sequencing center for sequencing PCR products; and the <i>Drosophila</i> Comparative Genome Sequencing, and Analysis Consortium for the genome sequences of the <i>melanogaster</i> subgroup.</p> </div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" class="contributions"><a id="authcontrib" name="authcontrib" toc="authcontrib" title="Author Contributions"></a><h3 xpathLocation="noSelect">Author Contributions <a href="#top">Top</a></h3><p xpathLocation="noSelect"><span class="capture-id"> ML and WW conceived and designed the experiments. SY, JRA, ML, and WW analyzed data. SY, XL, YD, QZ, YC, YZ, RZ, FB, LP, and WW performed molecular and cytological experiments. JRA conducted computer simulation. SY, JRA, ML, and WW wrote the paper.</span></p></div><div xmlns:xs="http://www.w3.org/2001/XMLSchema" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:aml="http://topazproject.org/aml/" xpathLocation="noSelect"><a id="references" name="references" toc="references" title="References"></a><h3 xpathLocation="noSelect">References <a href="#top">Top</a></h3><ol class="references" xpathLocation="noSelect"><li xpathLocation="noSelect"><a name="pgen-0040003-b001" id="pgen-0040003-b001"></a><span class="authors">Kimura M</span> (1983) The neutral theory of molecular evolution. Cambridge: Cambridge University Press. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b002" id="pgen-0040003-b002"></a><span class="authors">Ohno S</span> (1970) Evolution by gene duplication. New York: Springer. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b003" id="pgen-0040003-b003"></a><span class="authors">Ohta T</span> (1983) On the evolution of multiplegene families. Theor Popul Biol 23: 216–240. <a class="find" href="/article/findArticle.action?author=Ohta&title=On the evolution of multiplegene families."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b004" id="pgen-0040003-b004"></a><span class="authors">Gilbert W</span> (1987) The exon theory of genes. Cold Spring Harb Symp Quant Biol 52: 901–905. <a class="find" href="/article/findArticle.action?author=Gilbert&title=The exon theory of genes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b005" id="pgen-0040003-b005"></a><span class="authors">Long M, Betrán E, Thornton K, Wang W</span> (2003) The origin of new genes: glimpes from the young and old. Nat Rev Genet 4: 865–875. <a class="find" href="/article/findArticle.action?author=Long&title=The origin of new genes: glimpes from the young and old."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b006" id="pgen-0040003-b006"></a><span class="authors">Betran E, Thornton K, Long M</span> (2002) Retroposed new genes out of the X in Drosophila. Genome Res 12: 1854–1859. <a class="find" href="/article/findArticle.action?author=Betran&title=Retroposed new genes out of the X in Drosophila."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b007" id="pgen-0040003-b007"></a><span class="authors">Emerson JJ, Kaessmann H, Betran E, Long M</span> (2004) Extensive gene traffic on the mammalian X chromosome. Science 303: 537–540. <a class="find" href="/article/findArticle.action?author=Emerson&title=Extensive gene traffic on the mammalian X chromosome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b008" id="pgen-0040003-b008"></a><span class="authors">Marques AC, Dupanloup I, Vinckenbosch N, Reymond A, Kaessmann H</span> (2005) Emergence of young human genes after a burst of retroposition in primates. PLoS Biology 3: e357 doi:<a href="http://dx.doi.org/10.1371/journal.pbio.0030357">10.1371/journal.pbio.0030357</a>. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b009" id="pgen-0040003-b009"></a><span class="authors">Patthy L</span> (1995) Protein evolution By exon-shuffling. New York: Springer-Verlag. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b010" id="pgen-0040003-b010"></a><span class="authors">Gilbert W</span> (1978) Why genes in pieces? Nature 271: 44. <a class="find" href="/article/findArticle.action?author=Gilbert&title=Why genes in pieces?"> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b011" id="pgen-0040003-b011"></a><span class="authors">Van Rijk A, de Jong WW, Bloemendal H</span> (1999) Exon shuffling mimicked in cell culture. Proc Natl Acad Sci U S A 96: 8074–8079. <a class="find" href="/article/findArticle.action?author=Van Rijk&title=Exon shuffling mimicked in cell culture."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b012" id="pgen-0040003-b012"></a><span class="authors">Van Rijk A, Bloemendal H</span> (2003) Molecular mechanisms of exon shuffling: illegitimate recombination. Genetica 118: 245–249. <a class="find" href="/article/findArticle.action?author=Van Rijk&title=Molecular mechanisms of exon shuffling: illegitimate recombination."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b013" id="pgen-0040003-b013"></a><span class="authors">Kumatori A, Faizunnessa NN, Suzuki S, Moriuchi T, Kurozumi H, et al. </span> (1998) Nonhomologous recombination between the cytochrome b(558) heavy chain gene (CYBB) and LINE-1 causes an X-linked chronic granulomatous disease. Genomics 53: 123–128. <a class="find" href="/article/findArticle.action?author=Kumatori&title=Nonhomologous recombination between the cytochrome b(558) heavy chain gene (CYBB) and LINE-1 causes an X-linked chronic granulomatous disease."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b014" id="pgen-0040003-b014"></a><span class="authors">Linardopoulou EV, Williams EM, Fan Y, Friedman C, Young JM, et al. </span> (2005) Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication. Nature 437: 94–100. <a class="find" href="/article/findArticle.action?author=Linardopoulou&title=Human subtelomeres are hot spots of interchromosomal recombination and segmental duplication."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b015" id="pgen-0040003-b015"></a><span class="authors">Zucman-Rossi J, Legoix P, Victor J-M, Lopez B, Thomas G</span> (1998) Chromosome translocations based on illegitimate recombination in human tumors. Proc Natl Acad Sci U S A 95: 11786–11791. <a class="find" href="/article/findArticle.action?author=Zucman-Rossi&title=Chromosome translocations based on illegitimate recombination in human tumors."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b016" id="pgen-0040003-b016"></a><span class="authors">Shapiro JA</span> (2005) A 21st century view of evolution: genome system architecture, repetitive DNA, and natural genetic engineering. Gene 345: 91–100. <a class="find" href="/article/findArticle.action?author=Shapiro&title=A 21st century view of evolution: genome system architecture, repetitive DNA, and natural genetic engineering."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b017" id="pgen-0040003-b017"></a><span class="authors">Voelker RA, Greenleaf AL, Gyurkovics H, Wisely GB, Huang S-M, et al. </span> (1984) Frequent imprecise excision among reversions of a P element-caused lethal mutation in Drosophila. Genetics 107: 279–294. <a class="find" href="/article/findArticle.action?author=Voelker&title=Frequent imprecise excision among reversions of a P element-caused lethal mutation in Drosophila."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b018" id="pgen-0040003-b018"></a><span class="authors">Jiang N, Bao Z, Zhang X, Eddy SR, Wessler SR</span> (2004) Pack-MULE transposable elements mediate gene evolution in plants. Nature 431: 569–573. <a class="find" href="/article/findArticle.action?author=Jiang&title=Pack-MULE transposable elements mediate gene evolution in plants."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b019" id="pgen-0040003-b019"></a><span class="authors">Kapitonov VV, Jurka J</span> (2001) Self-synthesizing DNA transposons in eukaryotes. Proc Natl Acad Sci U S A 98: 8714–8719. <a class="find" href="/article/findArticle.action?author=Kapitonov&title=Self-synthesizing DNA transposons in eukaryotes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b020" id="pgen-0040003-b020"></a><span class="authors">Lal SK, Giroux MJ, Brendel V, Vallejos CE, Hannah LC</span> (2003) The maize genome contains a helitron insertion. Plant Cell 15: 381–391. <a class="find" href="/article/findArticle.action?author=Lal&title=The maize genome contains a helitron insertion."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b021" id="pgen-0040003-b021"></a><span class="authors">Alexander JRB, Schiestl RH</span> (2000) Homologous recombination as a mechanism for genome rearrangements: environmental and genetic effects. Hum Mol Genet 9: 2427–2334. <a class="find" href="/article/findArticle.action?author=Alexander&title=Homologous recombination as a mechanism for genome rearrangements: environmental and genetic effects."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b022" id="pgen-0040003-b022"></a><span class="authors">Babcock M, Pavlicek A, Spiteri E, Kashork CD, Ioshikhes I, et al. </span> (2003) Shuffling of genes within low-copy repeats on 22q11 (LCR22) by Alu–mediated recombination events during evolution. Genome Res 13: 2519–2532. <a class="find" href="/article/findArticle.action?author=Babcock&title=Shuffling of genes within low-copy repeats on 22q11 (LCR22) by Alu%E2%80%93mediated recombination events during evolution."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b023" id="pgen-0040003-b023"></a><span class="authors">Bailey J, Liu G, Eichler EE</span> (2003) An Alu transposition model for the origin and expansion of human segmental duplications. Am J Hum Genet 73: 823–834. <a class="find" href="/article/findArticle.action?author=Bailey&title=An Alu transposition model for the origin and expansion of human segmental duplications."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b024" id="pgen-0040003-b024"></a><span class="authors">Bailey JA, Yavor AM, Viggiano L, Misceo D, Horvath JE, et al. </span> (2002) Human-specific duplication and mosaic transcripts: the recent paralogous structure of chromosome 22. Am J Hum Genet 70: 38–100. <a class="find" href="/article/findArticle.action?author=Bailey&title=Human-specific duplication and mosaic transcripts: the recent paralogous structure of chromosome 22."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b025" id="pgen-0040003-b025"></a><span class="authors">Sharp AJ, Cheng Z, Eichler EE</span> (2006) Structural variation of the human genome. Annu Rev Genomics Hum Genet 7: 407–442. <a class="find" href="/article/findArticle.action?author=Sharp&title=Structural variation of the human genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b026" id="pgen-0040003-b026"></a><span class="authors">Stankiewicz P, Lupski JR</span> (2002) Molecular-evolutionary mechanisms for genomic disorders. Curr Opin Genet Dev 12: 312–319. <a class="find" href="/article/findArticle.action?author=Stankiewicz&title=Molecular-evolutionary mechanisms for genomic disorders."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b027" id="pgen-0040003-b027"></a><span class="authors">Ciccarelli FD, von Mering C, Suyama M, Harrington ED, Izaurralde E, et al. </span> (2005) Complex genomic rearrangements lead to novel primate gene function. Genome Res 15: 343–351. <a class="find" href="/article/findArticle.action?author=Ciccarelli&title=Complex genomic rearrangements lead to novel primate gene function."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b028" id="pgen-0040003-b028"></a><span class="authors">Inoue K, Lupski JR</span> (2002) Molecular mechanisms for genomic disorders. Annu Rev Genomics Hum Genet 3: 199–242. <a class="find" href="/article/findArticle.action?author=Inoue&title=Molecular mechanisms for genomic disorders."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b029" id="pgen-0040003-b029"></a><span class="authors">Kidwell MG, Lisch DR</span> (2001) Perspective: transposable elements, parasitic DNA, and genome evolution. Evolution 55: 1–24. <a class="find" href="/article/findArticle.action?author=Kidwell&title=Perspective: transposable elements, parasitic DNA, and genome evolution."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b030" id="pgen-0040003-b030"></a><span class="authors">Petrov DA, Lozovskaya ER, Hartl DL</span> (1996) High intrinsic rate of DNA loss in Drosophila. Nature 384: 346–349. <a class="find" href="/article/findArticle.action?author=Petrov&title=High intrinsic rate of DNA loss in Drosophila."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b031" id="pgen-0040003-b031"></a><span class="authors">Bailey J, Eichler EE</span> (2006) Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet 7: 552–564. <a class="find" href="/article/findArticle.action?author=Bailey&title=Primate segmental duplications: crucibles of evolution, diversity and disease."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b032" id="pgen-0040003-b032"></a><span class="authors">Arguello JR, Chen Y, Yang S, Wang W, Long M</span> (2006) Origination of an X-linked testes-specific chimeric gene by illegitimate recombination in Drosophila. PLoS Genetics 2: e77 doi:<a href="http://dx.doi.org/10.1371/journal.pgen.0020077">10.1371/journal.pgen.0020077</a>. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b033" id="pgen-0040003-b033"></a><span class="authors">Long MY, Langley CH</span> (1993) Natural-selection and the origin of jingwei, a chimeric processed functional gene in drosophila. Science 260: 91–95. <a class="find" href="/article/findArticle.action?author=Long&title=Natural-selection and the origin of jingwei, a chimeric processed functional gene in drosophila."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b034" id="pgen-0040003-b034"></a><span class="authors">Wang W, Brunet FG, Nevo E, Long M</span> (2002) Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster. Proc Natl Acad Sci USA 99: 4448–4453. <a class="find" href="/article/findArticle.action?author=Wang&title=Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b035" id="pgen-0040003-b035"></a><span class="authors">Wang WY, Yu HJ, Long M</span> (2004) Duplication-degeneration as a mechanism of gene fission and the origin of new genes in Drosophila species. Nat Genet 36: 523–527. <a class="find" href="/article/findArticle.action?author=Wang&title=Duplication-degeneration as a mechanism of gene fission and the origin of new genes in Drosophila species."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b036" id="pgen-0040003-b036"></a><span class="authors">Betran E, Long M</span> (2003) Dntf-2r, a young Drosophila retroposed gene with specific male expression under positive Darwinian selection. Genetics 164: 977–988. <a class="find" href="/article/findArticle.action?author=Betran&title=Dntf-2r, a young Drosophila retroposed gene with specific male expression under positive Darwinian selection."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b037" id="pgen-0040003-b037"></a><span class="authors">Force A, Lynch M, Pickett FB, Amores A, Yan YL, et al. </span> (1999) Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151: 1531–1545. <a class="find" href="/article/findArticle.action?author=Force&title=Preservation of duplicate genes by complementary, degenerative mutations."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b038" id="pgen-0040003-b038"></a><span class="authors">Lynch M, Force A</span> (2000) The probability of duplicate gene preservation by subfunctionalization. Genetics 154: 459–473. <a class="find" href="/article/findArticle.action?author=Lynch&title=The probability of duplicate gene preservation by subfunctionalization."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b039" id="pgen-0040003-b039"></a><span class="authors">Johnson ME, NISC Comparative Sequencing Program,Cheng Z, Morrison VA, Scherer S, et al. </span> (2006) Recurrent duplication-driven transposition of DNA during hominoid evolution. Proc Natl Acad Sci U S A 103: 17626–17631. <a class="find" href="/article/findArticle.action?author=Johnson&title=Recurrent duplication-driven transposition of DNA during hominoid evolution."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b040" id="pgen-0040003-b040"></a><span class="authors">Katju V, Lynch M</span> (2006) On the formation of novel genes by duplication in the Caenorhabditis elegans genome. Mol Biol Evol 23: 11056–11067. <a class="find" href="/article/findArticle.action?author=Katju&title=On the formation of novel genes by duplication in the Caenorhabditis elegans genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b041" id="pgen-0040003-b041"></a><span class="authors">López-Correa C, Dorschner M, Brems H, Lázaro C, Clementi M, et al. </span> (2001) Recombination hotspot in NF1 microdeletion patients. Hum Mol Genet 10: 1387–1392. <a class="find" href="/article/findArticle.action?author=L%C3%B3pez-Correa&title=Recombination hotspot in NF1 microdeletion patients."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b042" id="pgen-0040003-b042"></a><span class="authors">Kapitonov VV, Jurka J</span> (2003) Molecular paleontology of transposable elements in the Drosophila melanogaster genome. Proc Natl Acad Sci U S A 100: 6569–6574. <a class="find" href="/article/findArticle.action?author=Kapitonov&title=Molecular paleontology of transposable elements in the Drosophila melanogaster genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b043" id="pgen-0040003-b043"></a><span class="authors">Yang HP, Hung TL, You ZL, Yang ZH</span> (2006) Genomewide comparative analysis of the highly abundant transposable element DINE-1 suggests a recent transpositional burst in Drosophila yakuba. Genetics 173: 189–196. <a class="find" href="/article/findArticle.action?author=Yang&title=Genomewide comparative analysis of the highly abundant transposable element DINE-1 suggests a recent transpositional burst in Drosophila yakuba."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b044" id="pgen-0040003-b044"></a><span class="authors">Bergman CM, Quesneville H, Anxolabéhère D, Ashburner M</span> (2006) Recurrent insertion and duplication generate networks of transposable element sequences in the D. melanogaster genome. Genome Biol 7: R112. <a class="find" href="/article/findArticle.action?author=Bergman&title=Recurrent insertion and duplication generate networks of transposable element sequences in the D. melanogaster genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b045" id="pgen-0040003-b045"></a><span class="authors">Galindo MI, Ladeveze V, Lemeunier F, Kalmes R, Periquet G, et al. </span> (1995) Spread of the autonomous transposable element hobo in the genome of Drosophila melanogaster. Mol Biol Evol 12: 723–734. <a class="find" href="/article/findArticle.action?author=Galindo&title=Spread of the autonomous transposable element hobo in the genome of Drosophila melanogaster."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b046" id="pgen-0040003-b046"></a><span class="authors">Lorenc A, Makalowski W</span> (2003) Transposable elements and vertebrate protein diversity. Genetica 118: 183–191. <a class="find" href="/article/findArticle.action?author=Lorenc&title=Transposable elements and vertebrate protein diversity."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b047" id="pgen-0040003-b047"></a><span class="authors">Nekrutenko A, Li WH</span> (2001) Transposable elements are found in a large number of human protein-coding genes. Trends Genet 17: 619–621. <a class="find" href="/article/findArticle.action?author=Nekrutenko&title=Transposable elements are found in a large number of human protein-coding genes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b048" id="pgen-0040003-b048"></a><span class="authors">Zhou Y, Mishra B</span> (2005) Quantifying the mechanisms for segmental duplications in mammalian genomes by statistical analysis and modeling. Proc Natl Acad Sci U S A 15: 4051–4056. <a class="find" href="/article/findArticle.action?author=Zhou&title=Quantifying the mechanisms for segmental duplications in mammalian genomes by statistical analysis and modeling."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b049" id="pgen-0040003-b049"></a><span class="authors">Conrad DF, Andrews TD, Carter NP, Hurles ME, Pritchard JK</span> (2006) high-resolution survey of deletion polymorphism in the human genome. Nat Genet 38: 75–81. <a class="find" href="/article/findArticle.action?author=Conrad&title=high-resolution survey of deletion polymorphism in the human genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b050" id="pgen-0040003-b050"></a><span class="authors">Redon RIS, Fitch KR, Feuk L, Perry GH, Andrews TD, et al. </span> (2006) Global variation in copy number in the human genome. Nature 444: 444–454. <a class="find" href="/article/findArticle.action?author=Redon&title=Global variation in copy number in the human genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b051" id="pgen-0040003-b051"></a><span class="authors">Nurminsky DI, Nurminskaya MV, De Aguiar D, Hartl DL</span> (1998) Selective sweep of a newly evolved sperm-specific gene in Drosophila. Nature 396: 572–575. <a class="find" href="/article/findArticle.action?author=Nurminsky&title=Selective sweep of a newly evolved sperm-specific gene in Drosophila."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b052" id="pgen-0040003-b052"></a><span class="authors">Gao LZ, Innan H</span> (2004) Very low gene duplication rate in the yeast genome. Science 306: 1367–1370. <a class="find" href="/article/findArticle.action?author=Gao&title=Very low gene duplication rate in the yeast genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b053" id="pgen-0040003-b053"></a><span class="authors">Lynch M, Conery JS</span> (2000) The evolutionary fate and consequences of duplicate genes. Science 290: 1151–1155. <a class="find" href="/article/findArticle.action?author=Lynch&title=The evolutionary fate and consequences of duplicate genes."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b054" id="pgen-0040003-b054"></a><span class="authors">Cordaux R, Udit S, Batzer MA, Feschotte C</span> (2006) Birth of a chimeric primate gene by capture of the transposase gene from a mobile element. Proc Natl Acad Sci USA 103: 8101–8106. <a class="find" href="/article/findArticle.action?author=Cordaux&title=Birth of a chimeric primate gene by capture of the transposase gene from a mobile element."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b055" id="pgen-0040003-b055"></a><span class="authors">Makalowski W, Mitchell GA, Labuda D</span> (1994) Alu sequences in the coding regions of mRNA: a source of protein variability. Trends Genet 10: 188–193. <a class="find" href="/article/findArticle.action?author=Makalowski&title=Alu sequences in the coding regions of mRNA: a source of protein variability."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b056" id="pgen-0040003-b056"></a><span class="authors">Morgante M, Brunner S, Pea G, Fengler K, Zuccolo A, et al. </span> (2005) Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize. Nat Genet 37: 997–1002. <a class="find" href="/article/findArticle.action?author=Morgante&title=Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b057" id="pgen-0040003-b057"></a><span class="authors">Capy P</span> (1998) A plastic genome. Nature 396: 522–523. <a class="find" href="/article/findArticle.action?author=Capy&title=A plastic genome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b058" id="pgen-0040003-b058"></a><span class="authors">Schlenke TA, Begun DJ</span> (2004) Strong selective sweep associated with a transposon insertion in Drosophila simulans. Proc Natl Acad Sci USA 101: 1626–1631. <a class="find" href="/article/findArticle.action?author=Schlenke&title=Strong selective sweep associated with a transposon insertion in Drosophila simulans."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b059" id="pgen-0040003-b059"></a><span class="authors">Brookfield JFY</span> (2004) Evolutionary genetics: mobile DNAs as sources of adaptive change? Curr Biol 14: R344–R345. <a class="find" href="/article/findArticle.action?author=Brookfield&title=Evolutionary genetics: mobile DNAs as sources of adaptive change?"> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b060" id="pgen-0040003-b060"></a><span class="authors">Powell JR</span> (1997) Progress and prospects in evolutionary biology—the Drosophila model. New York: Oxford University Press. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b061" id="pgen-0040003-b061"></a><span class="authors">Lachaise D, Harry M, Solignac M, Lemeunier F, Benassi V, et al. </span> (2000) Evolutionary novelties in islands: Drosophila santomea, a new melanogaster sister species from Sao Tome. Proc R Soc Lond B Biol Sci 267: 1487–1495. <a class="find" href="/article/findArticle.action?author=Lachaise&title=Evolutionary novelties in islands: Drosophila santomea, a new melanogaster sister species from Sao Tome."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b062" id="pgen-0040003-b062"></a><span class="authors">Wang W, Zhang J, Alvarez C, Llopart A, Long M</span> (2000) The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster. Mol Biol Evol 17: 1294–1301. <a class="find" href="/article/findArticle.action?author=Wang&title=The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b063" id="pgen-0040003-b063"></a><span class="authors">Tatusova TA, Madden TL</span> (1999) Blast 2 sequences—a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett 174: 247–250. <a class="find" href="/article/findArticle.action?author=Tatusova&title=Blast 2 sequences%E2%80%94a new tool for comparing protein and nucleotide sequences."> Find this article online </a></li><li xpathLocation="noSelect"><a name="pgen-0040003-b064" id="pgen-0040003-b064"></a><span class="authors">Kumar S, Tamura K, Nei M</span> (2004) Brief Bioinform. 5: 150–163. </li><li xpathLocation="noSelect"><a name="pgen-0040003-b065" id="pgen-0040003-b065"></a><span class="authors">Salamov AA, Solovyev VV</span> (2000) Ab initio gene finding in Drosophila genomic DNA. Genome Res 10: 516–522. <a class="find" href="/article/findArticle.action?author=Salamov&title=Ab initio gene finding in Drosophila genomic DNA."> Find this article online </a></li></ol></div> </div> </div> <div style="display:none"> <div dojoType="ambra.widget.RegionalDialog" id="AnnotationDialog" style="padding:0;margin:0;"> <div class="dialog annotate"> <div class="tipu" id="dTipu"></div> <div class="comment"> <h5><span class="commentPrivate">Add Your Note (For Private Viewing)</span><span class="commentPublic">Post Your Note (For Public Viewing)</span></h5> <div class="posting pane"> <form name="createAnnotation" id="createAnnotation" method="post" action=""> <input type="hidden" name="target" value="info:doi/10.1371/journal.pgen.0040003" /> <input type="hidden" name="startPath" value="" /> <input type="hidden" name="startOffset" value="" /> <input type="hidden" name="endPath" value="" /> <input type="hidden" name="endOffset" value="" /> <input type="hidden" name="commentTitle" id="commentTitle" value="" /> <input type="hidden" name="comment" id="commentArea" value="" /> <input type="hidden" name="ciStatement" id="statementArea" value="" /> <input type="hidden" name="isCompetingInterest" id="isCompetingInterest" value="false" /> <input type="hidden" name="noteType" id="noteType" value="" /> <fieldset> <legend>Compose Your Note</legend> <span id="submitMsg" class="error" style="display:none;"></span> <table class="layout"> <tr> <td> <label for="cNoteType">This is a </label><select name="cNoteType" id="cNoteType"><option value="note">note</option><option value="correction">correction</option></select> <span id="cdls" style="visibility:hidden;margin-left:0.3em; white-space:nowrap;"><a href="/static/commentGuidelines.action?target=info%3Adoi%2F10.1371%2Fjournal.pgen.0040003#corrections">What are corrections?</a></span> <label for="cTitle" class="commentPublic"><span class="none">Enter your note title</span><!-- error message text <em>A title is required for all public notes</em>--></label> <input type="text" name="cTitle" id="cTitle" value="Enter your note title..." class="title commentPublic" alt="Enter your note title..." /> <label for="cArea"><span class="none">Enter your note</span><!-- error message text <em>Please enter your note</em>--></label> <textarea name="cArea" id="cArea" value="Enter your note..." alt="Enter your note...">Enter your note...</textarea> <input type="hidden" name="isPublic" value="true" /> </td> <td> </td> <td class="coi"> <fieldset> <legend>Declare any competing interests.</legend> <ul> <li><label><input id="isCompetingInterestNo" type="radio" checked="checked" name="competingInterest" value="false" /> No, I don't have any competing interests to declare.</label></li> <li><label><input id="isCompetingInterestYes" type="radio" name="competingInterest" value="true" /> Yes, I have competing interests to declare (enter below):</label></li> </ul> <textarea name="ciStatementArea" id="ciStatementArea" disabled value="Enter your competing interests..." alt="Enter your competing interests...">Enter your competing interests...</textarea> </fieldset> </td> </tr> <tr> <td colspan="3" class="buttons"> <input type="button" value="Cancel" title="Click to close and cancel" id="btn_cancel"/> <input type="button" value="Submit" title="Click to post your note publicly" id="btn_post" class="primary"/> </td> </tr> </table> </fieldset> </form> </div> </div> <div class="tip" id="dTip"></div> </div> </div><div dojoType="ambra.widget.ContextAction" id="ContextActionDialog" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="caTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> Please follow our <a href="/static/commentGuidelines.action">guidelines for notes and comments</a> and review our <a href="/static/competing.action">competing interests policy</a>. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided: <ul> <li>Remarks that could be interpreted as allegations of misconduct</li> <li>Unsupported assertions or statements</li> <li>Inflammatory or insulting language</li> </ul> <form name="contextActionForm" id="contextActionForm" class="clearfix buttons" method="post" action=""> <input type="button" name="Continue" value="Continue" id="ContextActionDialogContinueButton" onmouseup="ambra.displayAnnotationContext.startComment(event);" title="Add a note to this text" class="primary"/> <input type="button" name="Cancel" value="Cancel" id="ContextActionDialogCancelButton" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);" title="Close this Window"/> </form> </div> <div class="tip" id="caTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogNotLogged" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canlTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You must be logged in to add a note to an article. You may log in by <a onmousedown="ambra.displayAnnotationContext.disconnect(event);" href="/user/secure/secureRedirect.action?goTo=%2Farticle%2Finfo%3Adoi%2F10.1371%2Fjournal.pgen.0040003">clicking here</a> or <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">cancel this note</a>. </div> <div class="tip" id="canlTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogBadSelection" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canBDTipu"></div> <div class="contextActionContent"> <h5 class="annotation icon"><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You cannot annotate this area of the document. <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">Close</a> </div> <div class="tip" id="canBDTip"></div> </div> </div> <div dojoType="ambra.widget.ContextAction" id="ContextActionDialogBadRangeSelection" class="contextActionDialog"> <div class="dialog context"> <div class="tipu" id="canbrTipu"></div> <div class="contextActionContent"> <h5><img src="/images/tooltip_addannotation.gif" /> Add a note to this text.</h5> You cannot create an annotation that spans different sections of the document; please adjust your selection.<br/> <a href="#" onclick="return false;" onmouseup="ambra.displayAnnotationContext.cancelContext(event);">Close</a> </div> <div class="tip" id="canbrTip"></div> </div> </div> <div dojoType="ambra.widget.RegionalDialog" id="CommentDialog" style="padding:0;margin:0;"> <div class="dialog preview"> <div class="tipu" id="cTipu"></div> <div class="btn close" id="btn_close" title="Click to close"><a title="Click to close">Close</a></div> <div id="cmtContainer" class="comment"> <h6 id="viewCmtTitle"></h6> <div class="detail" id="viewCmtDetail"></div> <div class="contentwrap" id="viewComment"></div> <div class="contentwrap" id="viewCIStatement"></div> <div class="detail" id="viewLink"> <!--<a href="#" class="commentary icon" title="Click to view full thread and respond">View all responses</a> <a href="#" class="respond tooltip" title="Click to respond to this posting">Respond to this</a>--> </div> </div> <div class="tip" id="cTip"></div> </div> </div> <div dojoType="ambra.widget.RegionalDialog" id="CommentDialogMultiple" style="padding:0;margin:0;"> <div class="dialog multiple preview"> <div class="tipu" id="mTipu"></div> <div class="btn close" id="btn_close_multi" title="Click to close"><a title="Click to close">Close</a></div> <ol id="multilist"></ol> <br/> <div id="multidetail"></div> <div class="tip" id="mTip"></div> </div> </div> <div dojoType="dijit.Dialog" id="Rating"> <div class="dialog annotate"> <div class="tipu" id="dTipu"></div> <div class="comment"> <h5><span class="commentPublic">Rate This Article</span></h5> <div class="instructions">Please follow our <a href="/static/ratingGuidelines.action">guidelines for rating</a> and review our <a href="/static/competing.action">competing interests policy</a>. Comments that do not conform to our guidelines will be promptly removed and the user account disabled. The following must be avoided: <ol> <li>Remarks that could be interpreted as allegations of misconduct</li> <li>Unsupported assertions or statements</li> <li>Inflammatory or insulting language</li> </ol> </div> <div class="posting pane"> <form name="ratingForm" id="ratingForm" method="post" action=""> <input type="hidden" name="articleURI" value="info:doi/10.1371/journal.pgen.0040003" /> <input type="hidden" name="commentTitle" id="commentTitle" value="" /> <input type="hidden" name="comment" id="commentArea" value="" /> <input type="hidden" name="ciStatement" id="statementArea" value="" /> <input type="hidden" name="isCompetingInterest" id="isCompetingInterest" value="" /> <fieldset> <legend>Compose Your Annotation</legend> <span id="submitRatingMsg" class="error" style="display:none;"></span> <table class="layout"> <tr> <td rowspan="2"> <label for="insight">Insight</label> <ul class="star-rating rating edit" title="Rate insight" id="rateInsight"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Bland" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'insight', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 4);">4</a></li> <li><a href="javascript:void(0);" title="Profound" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'insight', 5);">5</a></li> </ul> <input type="hidden" name="insight" title="insight" value="" /> <label for="reliability">Reliability</label> <ul class="star-rating rating edit" title="Rate reliability" id="rateReliability"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Tenuous" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'reliability', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 4);">4</a></li> <li><a href="javascript:void(0);" title="Unassailable" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'reliability', 5);">5</a></li> </ul> <input type="hidden" name="reliability" title="reliability" value="" /> <label for="style">Style</label> <ul class="star-rating rating edit" title="Rate style" id="rateStyle"> <li class="current-rating pct0"></li> <li><a href="javascript:void(0);" title="Crude" class="one-star" onclick="ambra.rating.setRatingCategory(this, 'style', 1);">1</a></li> <li><a href="javascript:void(0);" title="" class="two-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 2);">2</a></li> <li><a href="javascript:void(0);" title="" class="three-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 3);">3</a></li> <li><a href="javascript:void(0);" title="" class="four-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 4);">4</a></li> <li><a href="javascript:void(0);" title="Elegant" class="five-stars" onclick="ambra.rating.setRatingCategory(this, 'style', 5);">5</a></li> </ul> <input type="hidden" name="style" title="style" value="" /> <label for="cTitle" class="commentPublic"><span class="none">Enter your comment title</span><!-- error message text <em>A title is required for all public annotations</em>--></label> <input type="text" name="cTitle" id="cTitle" value="Enter your comment title..." class="title commentPublic" alt="Enter your comment title..." /> <label for="cArea"><span class="none">Enter your comment</span><!-- error message text <em>Please enter your annotation</em>--></label> <textarea name="cArea" id="cArea" value="Enter your comment..." alt="Enter your comment...">Enter your comment...</textarea> </td> <td rowspan="2"> </td> <td class="coi"> <fieldset> <legend>Declare any competing interests.</legend> <ul> <li><label><input id="isCompetingInterestNo" type="radio" name="competingInterest" value="false" /> No, I don't have any competing interests to declare.</label></li> <li><label><input id="isCompetingInterestYes" type="radio" name="competingInterest" value="true" /> Yes, I have competing interests to declare (enter below):</label></li> </ul> <textarea name="ciStatementArea" id="ciStatementArea" disabled value="Enter your competing interests..." title="Enter your competing interests...">Enter your competing interests...</textarea> </fieldset> </td> </tr> <tr> <td class="buttons"> <input type="button" value="Cancel" title="Click to close and cancel" id="btn_cancel_rating"/> <input type="button" value="Submit" title="Click to post your annotation publicly" id="btn_post_rating" class="primary"/> </td> </tr> </table> </fieldset> </form> </div> </div> </div> </div> <div dojoType="ambra.widget.LoadingCycle" id="LoadingCycle" class="loadingCycler"> <img src="/images/loading.gif" width="58" height="58" title="Loading..." /> </div> </div> </div> <!-- end : main contents --> </div> <!-- end : container --> <!-- begin : footer --> <div id="ftr"> <p><span>All site content, except where otherwise noted, is licensed under a <a href="http://creativecommons.org/licenses/by/2.5/" title="Creative Commons Attribution License 2.5" tabindex="200">Creative Commons Attribution License</a>.</span></p> <ul> <li><a href="/static/privacy.action" title="PLoS Privacy Statement" tabindex="501">Privacy Statement</a></li> <li><a href="/static/terms.action" title="PLoS Terms of Use" tabindex="502">Terms of Use</a></li> <li><a href="http://www.plos.org/advertise/" title="Advertise With PLoS" tabindex="503">Advertise</a></li> <li><a href="http://www.plos.org/journals/embargopolicy.html" title="PLoS Embargo Policy" tabindex="504">Media Inquiries</a></li> <li><a href="http://www.plos.org/journals/print.html" title="PLoS in Print" tabindex="505">PLoS in Print</a></li> <li><a href="/static/sitemap.action" title="Site Map" tabindex="506">Site Map</a></li> <li><a href="http://www.plos.org" title="PLoS.org" tabindex="507">PLoS.org</a></li> </ul> <div class="powered"> <ul> <li><a href="/static/releaseNotes.action" title="Ambra | Release Notes">Ambra 0.9.4 beta</a></li> <li>Managed Colocation provided by <a href="http://www.unitedlayer.com/" title="UnitedLayer: Built on IP Services">UnitedLayer</a>.</li> </ul> </div> </div> <!-- end : footer --> <script type="text/javascript"> var _namespace=""; var loggedIn = false; var almHost = "http://alm.plos.org"; // Safari v3.1.1 "console.debug" issue (http://trac.dojotoolkit.org/ticket/6849) workaround if (/3[\.0-9]+ Safari/.test(navigator.appVersion)) { window.console = { origConsole: window.console, log: function(s){ this.origConsole.log(s); }, info: function(s){ this.origConsole.info(s); }, error: function(s){ this.origConsole.error(s); }, warn: function(s){ this.origConsole.warn(s); } }; } var djConfig = { // don't debug for IE - as dojo's firebug lite module is error prone in IE isDebug: false, parseOnLoad: true }; </script> <script type="text/javascript" src="/javascript/dojo/dojo/dojo.js"></script> <script type="text/javascript" src="/javascript/dojo/dojo/ambra.js"></script> <script type="text/javascript" src="/javascript/init_global.js"></script> <script type="text/javascript" src="/javascript/init_article.js"></script> <script type="text/javascript" src="/javascript/init_ratings.js"></script> <script type="text/javascript" src="/javascript/init_article_body.js"></script> <script type="text/javascript" src="/javascript/init_article_rhc.js"></script> <script type="text/javascript" src="/javascript/alm.js"></script> <script type="text/javascript" src="/javascript/reporting/articleViewsCumulative.js"></script> <script type="text/javascript"> var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www."); document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); </script> <script type="text/javascript"> var pageTracker = _gat._getTracker("UA-338393-1"); pageTracker._trackPageview(); pageTracker._setDomainName("www.plosgenetics.org"); </script> </body> </html>