<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>1471-2164-12-440</ui>
<ji>1471-2164</ji>
<fm>
<dochead>Research article</dochead>
<bibl>
<title><p>Retrotransposon-centered analysis of piRNA targeting shows a shift from active to passive retrotransposon transcription in developing mouse testes</p></title>
<aug>
<au ca="yes" id="A1"><snm>Mourier</snm><fnm>Tobias</fnm><insr iid="I1"/><email>tmourier@snm.ku.dk</email></au>
</aug>
<insg>
<ins id="I1"><p>Centre for GeoGenetics, Natural History Museum, University of Copenhagen, &#216;ster Voldgade 5-7, DK-1350 Copenhagen K, Denmark</p></ins>
</insg>
<source>BMC Genomics</source>
<issn>1471-2164</issn>
<pubdate>2011</pubdate>
<volume>12</volume>
<issue>1</issue>
<fpage>440</fpage>
<url>http://www.biomedcentral.com/1471-2164/12/440</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-12-440</pubid><pubid idtype="pmpid">21884594</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>15</day><month>4</month><year>2011</year></date></rec><acc><date><day>1</day><month>9</month><year>2011</year></date></acc><pub><date><day>1</day><month>9</month><year>2011</year></date></pub></history><cpyrt><year>2011</year><collab>Mourier; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>Piwi-associated RNAs (piRNAs) bind transcripts from retrotransposable elements (RTE) in mouse germline cells and seemingly act as guides for genomic methylation, thereby repressing the activity of RTEs. It is currently unknown if and how Piwi proteins distinguish RTE transcripts from other cellular RNAs. During germline development, the main target of piRNAs switch between different types of RTEs. Using the piRNA targeting of RTEs as an indicator of RTE activity, and considering the entire population of genomic RTE loci along with their age and location, this study aims at further elucidating the dynamics of RTE activity during mouse germline development.</p>
</sec>
<sec><st><p>Results</p></st>
<p>Due to the inherent sequence redundancy between RTE loci, assigning piRNA targeting to specific loci is problematic. This limits the analysis, although certain features of piRNA targeting of RTE loci are apparent. As expected, young RTEs display a much higher level of piRNA targeting than old RTEs. Further, irrespective of age, RTE loci near protein-coding coding genes are targeted to a greater extent than RTE loci far from genes. During development, a shift in piRNA targeting is observed, with a clear increase in the relative piRNA targeting of RTEs residing within boundaries of protein-coding gene transcripts.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Reanalyzing published piRNA sequences and taking into account the features of individual RTE loci provide novel insight into the activity of RTEs during development. The obtained results are consistent with some degree of proportionality between what transcripts become substrates for Piwi protein complexes and the level by which the transcripts are present in the cell. A transition from active transcription of RTEs to passive co-transcription of RTE sequences residing within protein-coding transcripts appears to take place in postnatal development. Hence, the previously reported increase in piRNA targeting of SINEs in postnatal testis development does not necessitate widespread active transcription of SINEs, but may simply be explained by the prevalence of SINEs residing in introns.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>Retrotransposable elements (RTE) constitute a significant proportion of mammalian genomes. The RTEs proliferate through an RNA stage that is subsequently reverse transcribed back to genomic DNA <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The high level of divergence in RTE insertions between closely related organisms <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and the link between RTE insertions and diseases <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp> witness the ongoing activity of RTEs in mammalian genomes. Several genomic mechanisms are devised to minimize the proliferation of RTEs acting both at pre- and post-transcriptional levels <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>.</p>
<sec><st><p>Mouse retrotransposable elements</p></st>
<p>Around 40 percent of the mouse genome consists of RTE sequence, slightly lower than observed for the human genome, although this presumably is a result of the higher substitution rate in mouse, limiting the identification of old RTE sequence <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. RTEs are divided according to the presence or absence of long terminal repeats (LTR). Mammalian LTR elements consist mainly of endogenous retroviruses (ERV) that at some point during evolution have been inserted in the germline and fixed. Although the amount of sequence occupied by LTR elements is comparable between human and mouse, the level of <it>de novo </it>mutations caused by LTR element activity is extensively higher in mouse than in human <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B14">14</abbr></abbrgrp>. The most abundant ERV class in the mouse genome (~5.5%) is the Class III ERVs, which in the RepeatMasker <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> annotation - upon which this study is based - is broadly divided in two groups, the ERVL and MaLR elements. The latter is a non-autonomous transposon, meaning that the elements do not encode the enzymatic machinery required for its own transposition. The Class II ERVs (~4% of the genome), annotated as ERVK in RepeatMasker, is believed to be younger than Class III ERVs <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and consists of a broad range of clades, including the IAP elements (Intracisternal A-type Particles). Class I ERVs (ERV1 in RepeatMasker) cover less than 1% of the mouse genome. Through ectopic recombination between the flanking LTR sequences, solitary LTR sequences may be formed. In RepeatMasker, terminal LTR sequences and the internal sequences (residing between the terminal LTRs in a complete LTR retrotransposon) are annotated independently. Although the terminal and internal sequences may in many cases be determined to form a single LTR retrotransposon, for simplicity, the two annotations (termed 'LTRter' and 'LTRint', respectively) are analysed independently in this study.</p>
<p>Non-LTR retrotransposons are divided into LINEs (Long INterspersed Elements) that are autonomous, and SINEs (Short INterspersed Elements) that are non-autonomous. LINEs occupy roughly 20% of the mouse genome. The majority of mouse LINE elements belong to the L1 superfamily, which contains sub-families that are still active <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. Despite the comparable levels of genome occupied by LINE sequences in human and mouse <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, there are more than 15 times as many full-length L1 elements with intact open reading frames in the mouse genome <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Almost 1.5 million SINE elements are present in the mouse genome, making up approximately 8% of the total genome size. Unlike the human genome where a single SINE, the Alus, is dominating <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, the mouse genome harbours two successful superfamilies of SINEs, Alu and B2 that are present in equal numbers <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The evolutionary histories of the mouse SINEs are truly different; Alus are derived from a 7SL RNA, whereas B2s evolved from a tRNA sequence <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Piwi proteins and small RNAs</p></st>
<p>Piwi-associated RNAs (piRNAs) are small (24-30 nucleotides long) RNAs that bind Piwi proteins of the Argonaute family <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. The mouse genome encodes 3 Piwi proteins, MILI, MIWI and MIWI2 that all binds piRNAs in the male germline <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Initially, piRNAs from adult mouse testis were found to contain less RTE sequence than would be expected from the genomic content of RTEs, suggesting that piRNAs were not specifically targeting RTEs <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. However, a later study on piRNAs from an earlier (pre-pachytene) stage showed a high content of RTE sequence in piRNAs <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Further evidence for the involvement of mouse piRNAs in controlling RTE activity came with the finding that knockout of Mili and Miwi2 resulted in reduced piRNA levels and increased RTE transcription <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Knockout mice further showed decreased DNA methylation levels at RTE loci <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. As the temporal expression of Piwi proteins in developing mouse testis coincides with the resetting of genomic methylation <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, it is hypothesised that piRNAs act as guides for the methylation machinery <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>.</p>
<p>By analysing the piRNAs bound to MIWI2 and MILI, Aravin and colleagues <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> suggested the following scenario: In prenatal development (16.5 days postcoitum, dpc), transcripts from full-length active RTEs are the main substrates for piRNAs that primarily associate with MILI (and to a lesser extent to MIWI2). Available transcripts containing antisense RTE sequence bind this complex and antisense RTE piRNAs are formed which in turn associate primarily with MIWI2 (and MILI, respectively). Both complexes may bind complementary RTE transcripts, entering the so-called ping-pong amplification cycle of piRNAs, in which Piwi-bound piRNAs pair with complementary transcripts that are subsequently cut into new piRNAs having a 10 nucleotide overlap with the template piRNAs <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B34">34</abbr></abbrgrp>. In prenatal development, piRNAs are primarily targeting L1 and IAP RTEs, for which activity has been reported at this stage <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. In postnatal development (10 days postpartum, dpp) MIWI2 is no longer detectable, whereas MILI is present throughout germline development <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The overall level of piRNA targeting of RTE sequences drops at 10 dpp, but interestingly, a relative increase in the piRNAs targeting B1 SINEs (members of the Alu superfamily) was observed <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
<p>This raises two fundamental questions. Firstly, do Piwi proteins discriminate between transcripts and how is RTE sequences then distinguished from other transcripts? The finding of piRNAs targeting supported a scenario with limited discrimination <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Secondly, what lies behind the apparent shift in RTEs being targeted by piRNAs during development in male mouse germline? By analysing to extent to which genomic RTE loci are targeted by piRNAs in developing mouse testes, the present study aims at assessing the transcriptional dynamics of RTE during development, and consider the relationship between RTE activity and piRNA generation further.</p>
<p>The data for such analysis should meet a range of criteria. Although numerous mouse RNA libraries are available, only a small subset is derived from wild-type developing mouse testes <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Further, as the prevalent transcription of RTEs will results in a large population of fragmented transcripts of sizes similar to piRNAs, analysis should be restricted to libraries of RNAs associated with Piwi proteins. This limits the available data to libraries from the above-mentioned study by Aravin and colleagues <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
</sec>
</sec>
<sec><st><p>Results and Discussion</p></st>
<sec><st><p>Theoretical piRNA coverage of individual RTE loci</p></st>
<p>Three libraries of small RNAs bound to MIWI2 and MILI proteins in mouse male germline <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> were reanalysed; one library with MIWI2-bound RNAs from 16.5 dpc (henceforth referred to as 'MIWI2 early') and two libraries with MILI-bound RNAs from 16.5 dpc and 10 dpp ('MILI early' and 'MILI late', respectively) (Table <tblr tid="T1">1</tblr>).</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>Piwi-RNA sequence read libraries</p></caption><tblbdy cols="9">
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4" ca="left">
            <p>
               <b>Percent of mapped reads covering:</b>
            </p>
         </c>
      </r>
      <r>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c>
            <p/>
         </c>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Library</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Stage</b>
            </p>
         </c>
         <c>
            <p/>
         </c>
         <c ca="right">
            <p>
               <b>Raw reads (&#215; 1000)</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>Mapped reads (&#215; 1000)</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>LINE</b>
            </p>
            <p>
               <b>(19.6)</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>SINE</b>
            </p>
            <p>
               <b>(7.6)</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>LTRint</b>
            </p>
            <p>
               <b>(3.0)</b>
            </p>
         </c>
         <c ca="right">
            <p>
               <b>LTRter</b>
            </p>
            <p>
               <b>(7.4)</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>MIWI2 early</p>
         </c>
         <c ca="left">
            <p>16.5 dpc</p>
         </c>
         <c ca="left">
            <p>prenatal</p>
         </c>
         <c ca="right">
            <p>1,940</p>
         </c>
         <c ca="right">
            <p>1,934</p>
         </c>
         <c ca="right">
            <p>30.2</p>
         </c>
         <c ca="right">
            <p>5.2</p>
         </c>
         <c ca="right">
            <p>34.8</p>
         </c>
         <c ca="right">
            <p>17.0</p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>MILI early</p>
         </c>
         <c ca="left">
            <p>16.5 dpc</p>
         </c>
         <c ca="left">
            <p>prenatal</p>
         </c>
         <c ca="right">
            <p>472</p>
         </c>
         <c ca="right">
            <p>470</p>
         </c>
         <c ca="right">
            <p>19.2</p>
         </c>
         <c ca="right">
            <p>5.3</p>
         </c>
         <c ca="right">
            <p>16.3</p>
         </c>
         <c ca="right">
            <p>11.9</p>
         </c>
      </r>
      <r>
         <c cspan="9">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>MILI late</p>
         </c>
         <c ca="left">
            <p>10 dpp</p>
         </c>
         <c ca="left">
            <p>postnatal</p>
         </c>
         <c ca="right">
            <p>1,327</p>
         </c>
         <c ca="right">
            <p>1,324</p>
         </c>
         <c ca="right">
            <p>7.7</p>
         </c>
         <c ca="right">
            <p>19.9</p>
         </c>
         <c ca="right">
            <p>15.9</p>
         </c>
         <c ca="right">
            <p>13.3</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>Three piRNA libraries from reference <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> were used for this study (accession numbers: MIWI2 early, GSM319957; MILI early, GSM319956; MILI late, GSM319953). Mapped reads denote the number of RNA reads mapping perfectly at least once to the mouse genome. The percentage of mapped reads mapping to loci of annotated RTEs are shown to the right. In parentheses under the RTE type the fractions of the mouse genome (mm9) occupied by the RTE sequences according to the RepeatMasker <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> annotation at the UCSC Genome Browser <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> are shown.</p>
   </tblfn></tbl>
<p>To analyse in detail the theoretical piRNA coverage of each individual RTE loci, reads from each RNA library were mapped onto the mouse genome. Only perfectly mapping reads were considered. Due to the inherent redundancy of RTE sequences, reads mapping uniquely to RTE loci are scarce and therefore all mapping reads were considered. The number of reads mapping to each RTE loci was weighted by the uniqueness of the reads, so that the count from each read was simply divided by the number of times that particular read mapped to the entire genome. Further, read coverage was weighted by library size and length of the locus, in an attempt to allow direct comparisons between libraries and RTE types. As seen from Figure <figr fid="F1">1</figr>, differences in this theoretical piRNA coverage differ markedly between RTE families and between libraries. When only considering reads that are exclusively mapping within a particular superfamily of RTEs (<it>e.g</it>. L1 LINEs) the overall patterns of coverage still remain (middle column in Figure <figr fid="F1">1</figr>). For LTR elements (both internal sequence and terminal repeats), most major trends are still observable when only considering reads mapping entirely within a single family of RTE (<it>e.g</it>. internal sequence of IAP-d elements) (right column in Figure <figr fid="F1">1</figr>). Unless otherwise noted, in the following all reads are used for analysis, and only RTE families with a least a thousand genomic members are considered (see Additional File <supplr sid="S1">1</supplr>, Table S1 for a list of these 318 RTE families). The previously reported decrease in piRNA coverage of IAP LTRs and L1 LINEs and the corresponding increase in SINE coverage during development <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> are clearly evident from Figure <figr fid="F1">1</figr> (left column). Also consistent with earlier findings <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, the median MILI early piRNA coverage of individual RTE families is highly correlated with the median coverage from MIWI2 early piRNAs, but not with the coverage from MILI late piRNAs (Additional File <supplr sid="S1">1</supplr>, Figure S1).</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>Median piRNA coverage of RTE loci</p></caption><text>
   <p><b>Median piRNA coverage of RTE loci</b>. Theoretical piRNA coverage levels (see Methods sections) are shown as colours indicating the median log2 values for all loci of belonging to a given RTE family. Only RTE families with at least a 1000 annotated loci and a median value above zero in any library are shown (albeit very low levels are not visible in this representation). For each RTE family, the superfamily it belongs to and the number of loci (in thousands) is listed. Families of internal LTR sequences are suffixed by '-int' in their superfamily. Three sets of columns are shown (All/Superfamily/Family), each set containing the three libraries (MIWI2 early, MILI early and MILI late). The left column set (All) shows coverage of all mapped piRNA reads. The middle column set (Superfamily) shows coverage of piRNAs exclusively mapping to this superfamily of RTE, and the right column set (Family) the coverage of piRNAs exclusively mapping to the particular RTE family.</p>
</text><graphic file="1471-2164-12-440-1" hint_layout="single"/></fig>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Supplementary Material</b>. Figures S1-S5 and Tables S1-S4. PDF format.</p></text>
<file name="1471-2164-12-440-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Higher piRNA coverage of younger elements</p></st>
<p>The RepBase database of repeated sequences <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> contain consensus sequences for individual RTE families, and RepeatMasker <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> annotation is based on sequence similarity to these consensus sequences. As the vast majority of RTE loci are under no negative selection (but see, for example <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>) the level of divergence between genomic loci and the RepBase consensus sequence can be taken as a proxy for the age of the RTE family. When plotting median piRNA coverage of RTE families against their median divergence, a clear trend of highly covered RTEs being relatively young is observed irrespective of RTE type (Figure <figr fid="F2">2</figr>). For all types of RTEs, the average piRNA coverage of younger elements is significantly higher than coverage of older elements (Additional File <supplr sid="S1">1</supplr>, Table S2). Also, the high level of variation in piRNA coverage between individual RTE loci is evident from the percentiles shown in Figure <figr fid="F2">2</figr>.</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>RTE age and piRNA coverage</p></caption><text>
   <p><b>RTE age and piRNA coverage</b>. The median millidivergence from consensus (as a proxy for age) is plotted against the average piRNA coverage (all reads) values. RTE families are coloured according to their type as indicated on the left chart. To allow for easier comparison between RTE types, coverage values are indexed so that the family with the highest average coverage is set to a value of 1. Error bars denote 25 and 75 percentiles for both piRNA coverage and millidivergence.</p>
</text><graphic file="1471-2164-12-440-2" hint_layout="double"/></fig>
</sec>
<sec><st><p>Gene expression levels and piRNA coverage</p></st>
<p>To test the piRNA coverage in the genomic context of protein-coding genes, all known genes with at least 20 kb (kilo base pairs) to the nearest neighbouring gene (in both directions) and with available Affymetrix expression data from testis tissue were retrieved <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. These 3307 genes were grouped into highly expressed genes (25% highest expression signals, 827 genes), lowly expressed genes (25% lowest, 829 genes), and medium expressed genes (rest, 1651 genes). For each gene set, the piRNA coverage of RTE residing within 10 kb upstream of their annotated transcription start sites to within 10 kb of their stop sites was recorded (Figure <figr fid="F3">3</figr>). For all types of RTE elements, piRNA coverage in the context of highly expressed genes was significantly higher than both medium and lowly expressed genes. This was found for all piRNA libraries (Additional File <supplr sid="S1">1</supplr>, Table S3).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Expression levels and piRNA coverage around genes</p></caption><text>
   <p><b>Expression levels and piRNA coverage around genes</b>. Genes are divided according to their expression levels in adult testis (high/medium/low) as indicated below charts. The median piRNA coverage of RTEs around genes are shown as rectangles with error bars denoting 25 and 75 percentiles. Coverage values are normalised by the number of base pairs occupied by RTEs around the genes. Using a Mann-Whitney U test, all values for highly expressed genes are significantly higher than the corresponding values for lowly expressed genes for all four RTE types. Significance levels shown in Additional File <supplr sid="S1">1</supplr>, Table S3.</p>
</text><graphic file="1471-2164-12-440-3" hint_layout="double"/></fig>
<p>For some RTE types, the relative number of young and old loci differ between the vicinities of highly expressed genes and lowly expressed genes, (Additional File <supplr sid="S1">1</supplr>, Figure S2), suggesting that the higher levels of piRNA coverage of RTE near highly expressed genes could simply be explained by the age of the RTE sequences. However, when repeating the analysis without the youngest RTE sequences, essentially similar results and significance levels are found (Additional File <supplr sid="S1">1</supplr>, Table S3).</p>
<p>Interestingly, when assessing the piRNA coverage of RTE sequence near transcription start sites (TSS), peaks are observed immediately upstream of TSS on the reverse strand, and for piRNAs not targeting RTEs, also immediately downstream of TSS on the forward strand (Additional File <supplr sid="S1">1</supplr>, Figure S3). Such a pattern resembles that of the recently discovered short transcripts generated around TSS (the TSS-associated RNAs) <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp> suggesting that these piRNAs may in fact be TSS-associated RNAs. It is uncertain if this represents experimental contamination of non-piRNAs or if these TSS-associated RNAs provide the transcripts that a processed into piRNAs, although the presence of RNA reads smaller than the usual 24-30 nucleotides - especially among early MILI piRNAs - hints that a contribution from the former scenario cannot be ruled out (Additional File <supplr sid="S1">1</supplr>, Figure S4). Assuming all RNAs mapping within 1000 base pairs of an annotated TSS are TSS-associated RNAs and removing these from the analysis does not change any of the presented conclusions (data not shown).</p>
</sec>
<sec><st><p>piRNA coverage and distance to genes</p></st>
<p>The age of RTEs and their genomic distance to protein-coding genes is not independent <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B47">47</abbr></abbrgrp>. If RTEs residing near genes are in general relatively young, one would expect these RTEs to display high levels of piRNA coverage as a result of this. To test if proximity to genes affected piRNA coverage independent of RTE age, members of each RTE family were divided into three equally sized groups based on their divergence from their consensus sequence (called 'young', 'median' and 'old' loci, respectively). Within each age-group, members were further divided into sub-groups according to genomic location; i) RTE loci residing inside the boundaries of known genes (termed 'genic'), ii) RTE loci in intergenic regions in proximity to known genes ('proximal'), and iii) RTE loci in intergenic regions distal to known genes ('distal'). The groups 'proximal' and 'distal' defined as loci closer or further away from genes, respectively, than the median distance of all non-'genic' loci from the RTE family. An overview of the grouping procedure is presented in Figure <figr fid="F4">4</figr>. The assumption that younger RTE members tend to reside closer to genes are confirmed by the observation that for 95% of all RTE families, the fraction of loci being proximal to genes is higher for young loci than for old loci (Wilcoxon Signed Rank, p &lt; 2.2 &#215; 10<sup>-16</sup>; values not shown).</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>Grouping of RTE loci</p></caption><text>
   <p><b>Grouping of RTE loci</b>. A schematic overview of the procedure used to group RTE loci within each family. A hypothetical genome is shown on top with a single protein-coding gene (exons denoted by boxes). RTE loci are shown as vertical lines, with age indicated by increasing colour darkness. The RTE loci are first divided in three equally sized groups (rows below genome) based on age, then divided according to their genomic location (columns below genome). The border between proximal and distal loci is set to the median of the distances between all non-genic loci and the nearest gene. Within each age group, the location groups can now be directly compared against each other. Note that the numbers of RTEs in age groups are equal by definition, whereas this may not be the case in the location groups.</p>
</text><graphic file="1471-2164-12-440-4" hint_layout="single"/></fig>
<p>RTE loci proximal to genes have - irrespective of age - significantly higher piRNA coverage than similar RTE loci distal to genes (Figure <figr fid="F5">5</figr>). With internal LTR sequences belonging to the 'old' group as the only exception, loci proximal to genes have significantly higher coverage than genic loci in prenatal development. Interestingly, in postnatal development no RTE group displays a significantly higher coverage for proximal loci than for genic loci, and furthermore, for all RTE groups, genic loci have significantly higher coverage than loci distal to genes (Figure <figr fid="F5">5</figr>). Thus, coverage by MILI late piRNAs is enriched in genic regions, an observation that is further supported by the fact that the total coverage of MILI late piRNAs mapping to genic regions is increased for all types of RTEs (Figure <figr fid="F6">6</figr>).</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>piRNA coverage and genomic context</p></caption><text>
   <p><b>piRNA coverage and genomic context</b>. A) Schematic depiction of the 3 categories of RTE loci (shown as red boxes). Genic RTEs reside inside the boundaries of protein-coding genes (exons shown as white boxes), proximal RTEs reside close to genes, and distal RTEs reside far from genes. The three RTE boxes are connected by a triangle, and in the remainder of the figure this triangle will symbolise the three categories of RTE loci. B) Example of the triangle graphic. The corners of the triangle correspond to the three categories of RTE loci (highlighted by their first letter in this example), and larger-than and smaller-than signs denote the relative levels of piRNA coverage of the categories. One thin sign corresponds to a non-significant difference, one bold sign to a significant difference at the 0.05 level, and two bold signs to a significant difference at the 0.001 level (see Methods section). C) Symbolic presentations of the real differences in piRNA coverage levels. RTEs are grouped into the 4 shown types (left), and differences are shown for the 3 libraries (top), with RTE loci divided according to age (for each family independently, see main text for details). Absolute values and significance levels are available in Additional File <supplr sid="S1">1</supplr>, Table S4.</p>
</text><graphic file="1471-2164-12-440-5" hint_layout="double"/></fig>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>Proportion of piRNA coverage targeting genic RTE loci</p></caption><text>
   <p><b>Proportion of piRNA coverage targeting genic RTE loci</b>. The fractions of total piRNA coverage that are mapped to genic loci are shown for the four RTE types. Green bars denote different read libraries as indicated on the right. Gray bars show the proportion of loci residing in a genic context. Values are shown for A) All reads and B) Reads mapping exclusively within a single RTE family.</p>
</text><graphic file="1471-2164-12-440-6" hint_layout="single"/></fig>
</sec>
<sec><st><p>Strand bias in piRNA coverage of genic RTEs</p></st>
<p>Aravin and colleagues <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> showed that in early development, the substrate for piRNA generation is provided by actively transcribed RTE elements. Later in development, active transcription of RTEs should then be repressed, and mRNA sequences from active full-length RTE loci are no longer widespread. This suggests that co-transcription of RTE sequences along with mRNAs from protein-coding genes (predominantly in intronic regions) could now take on a relatively larger role in providing RTE sequence transcripts. A prerequisite for the generation of piRNAs is the presence of transcripts with complementary sequences. Although active RTEs need to be transcribed from their forward strand, the RTE sequences scattered around the highly transcribed genome could produce transcripts in both orientations. But if as suggested, transcribed RTE sequences in postnatal mouse testes are mainly provided from co-transcription with genes, the transcriptional orientation of a given genic RTE loci is to a large extent determined by the orientation of the host gene. One can therefore test if the strand of piRNAs mapping uniquely to genic RTE loci corresponds to the orientation of the RTE relative to the host gene. Of course, amplification from the ping-pong cycle may generate multiple piRNAs, which may map on both strands of a genic RTE locus (although the efficiency of the ping-pong cycle may decrease in postnatal development as MIWI2 is no longer expressed <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>), potentially blurring the picture. As seen from Figure <figr fid="F7">7</figr>, a clear pattern of high sense coverage of genic RTEs in the forward orientation and high antisense coverage of genic RTEs in the reverse orientation is seen for postnatal piRNAs, but not for prenatal piRNAs.</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>Strand bias</p></caption><text>
   <p><b>Strand bias</b>. The log2 ratio of sense and antisense coverage from uniquely mapping piRNAs shown for the 4 main RTE types. Each RTE type is divided into genic loci on the forward strand relative to the host gene ('forward'; green columns), genic loci on the reverse strand ('reverse'; orange columns), and loci residing outside genes ('nongenic'; grey columns). All values are shown for the three piRNA libraries.</p>
</text><graphic file="1471-2164-12-440-7" hint_layout="single"/></fig>
</sec>
</sec>
<sec><st><p>Conclusions</p></st>
<p>As reported previously, RTE families are targeted very differently by piRNAs in developing mouse testes. By focusing on the total population of RTE loci, the present reanalysis of published data reveal further differences in piRNA targeting between individual members of RTE families. The available data for this analysis is arguably limited and the presented data relies on a single set of experiments. Although deep-sequencing techniques ideally should provide sequences from all available transcripts in a neutral fashion, biases may be introduced experimentally, especially during construction of libraries <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Furthermore, considerable biological differences in RTE sequences have been reported between mouse strains <abbrgrp><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp>. Nevertheless, the vast majority of RTE sequences will be shared among all extant mice, and the results presented here are all of a global genomic character with no predictions for individual loci, suggesting a fair generality of the findings.</p>
<p>Transcriptional activity is correlated between genomic regions residing near each other <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>, and the observation that piRNA targeting of RTEs is higher around highly expressed genes, may simply reflect that transcription of RTEs is more permissible near highly expressed genes. A correlation between transcription levels of LTR sequences and their neighbouring genes has previously been reported in fission yeast <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. This further supports the notion that RTE transcripts are not specifically recognized as RTEs by the Piwi proteins, but are largely triggering the piRNA response in a manner proportional to their presence. It should be stressed that the reported preference by MILI for sense RTE sequences and the corresponding preference by MIWI2 for antisense sequences <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> suggest some level of discrimination of transcripts.</p>
<p>In postnatal testis development, piRNA targeting is shifted towards loci residing in introns of protein-coding genes. If, as assumed, active transcription of RTE loci is repressed at this stage, one would expect a higher proportion of RTE sequences in the total transcriptome to be derived from co-transcription of intronic RTE loci. This observation could at least in part explain the previously observed increase in piRNAs targeting SINE elements in postnatal stages <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, as SINE elements are the most abundant RTEs in introns (Additional File <supplr sid="S1">1</supplr>, Figure S5). Therefore, the increased piRNA response directed at SINE sequences does not necessitate transcription of active SINE elements in postnatal development. In fact, as SINE elements are non-autonomous, presumably using the enzymatic machinery provided by LINE elements <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>, there should be no basis for SINE proliferation in postnatal development if the prenatal silencing of LINE persists. Yet, SINE transcription may take place without subsequent transposition, and the known functional effects of mammalian SINE transcription <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp> and the recently reported SINE RNA toxicity <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> suggest both active SINE transcription in later development, and the possible need for regulation.</p>
<p>On an evolutionary time scale, RTE activity has contributed hugely to the evolution of mammalian genomes <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>, and when attempting to understand the diversity of present eukaryotic life it is essential to include the history and activity of RTEs. However, RTEs are not just silent passengers that occasionally spring into action, but have to be dealt with within each individual's life history. In this respect, the indirect approach of analysing small RNAs generated to repress RTE activity in the germline may produce further valuable knowledge on the activity of RTEs during development.</p>
</sec>
<sec><st><p>Methods</p></st>
<sec><st><p>Data and Annotations</p></st>
<p>Small RNA libraries accession numbers GSM319953 (MILI late), GSM319956 (MILI early) and GSM319957 (MIWI2 early) were retrieved through DeepBase (<url>http://deepbase.sysu.edu.cn/</url>) <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and mapped to the mouse genome (mm9 assembly) using bowtie <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Prior to mapping, reads were filtered and sequences with ambiguous base calls and low-complexity sequences were removed. The latter was done measuring the linguistic complexity <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> of the sequences in 16 bp windows, and excluding reads with an average complexity of less than 0.75. Preliminary tests showed that this would remove highly repetitive reads with very large numbers of genomic mappings (not shown). For each library, this procedure filtered out between 0.14 and 0.17% of all raw reads. RepeatMasker and known gene annotations were downloaded from the UCSC Genome Browser <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B40">40</abbr><abbr bid="B67">67</abbr></abbrgrp>. A set of non-overlapping TSS was selected by grouping all known genes according to their assignment to ENSEMBL genes <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. For each ENSEMBL gene, the most abundant genomic start point was selected. If more than one point was found to have the highest abundance, the one furthest upstream of these was chosen. Gene expression levels were assessed from the 'testis' signal intensities in the Mouse GNF1M Gene Atlas from BioGPS (<url>http://biogps.gnf.org/</url>) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Mapping and coverage</p></st>
<p>For each RTE loci the number of reads mapped within the locus were recorded. Reads were assigned a score of 1/(number of genomic mappings of read), so that only uniquely mapping reads scored 1. The read score were then divided by the size of the RTE loci (in kilo-base pairs). Finally, scores were divided by the total number (in millions) of mapped reads from the library in question.</p>
</sec>
<sec><st><p>Statistical testing</p></st>
<p>To test for difference between RTE loci from different genomic regions (data presented in Figure <figr fid="F5">5</figr>), all RTE families were first split in 3 groups based on age where after members in each group were divided according to their genomic context (genic, proximal, distal). Hence, for each RTE family, nine sets of loci were formed, and the average piRNA coverage for each set was recorded. To test for difference between two groups (for example, between old LINE loci being genic or distal), pairs of average values were collected for the 90 LINE families (Additional File <supplr sid="S1">1</supplr>, Table S1) and tested using a Wilcoxon Signed-Rank Test. Bonferroni corrections (n = 108 in Figure <figr fid="F5">5</figr>) were calculated as: p<sub>corrected </sub>= 1-(1-p)<sup>n</sup>. All statistical analyses were carried out using R <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>.</p>
</sec>
</sec>
<sec><st><p>Competing interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>TM conceived of the study, analyzed the data and wrote the manuscript.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>This work was supported by a grant from the Lundbeck Foundation. I am indebted to JianHua Yang for invaluable advice on small RNAs, for preparing and providing RNA reads from DeepBase, and for critical reading of the manuscript.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>Mobile Elements: Drivers of Genome Evolution</p></title><aug><au><snm>Kazazian</snm><fnm>HH</fnm><suf>Jr</suf></au></aug><source>Science</source><pubdate>2004</pubdate><volume>303</volume><issue>5664</issue><fpage>1626</fpage><lpage>1632</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1089670</pubid><pubid idtype="pmpid" link="fulltext">15016989</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>Lineage-specific expansions of retroviral insertions within the genomes of African great apes but not humans and orangutans</p></title><aug><au><snm>Yohn</snm><fnm>CT</fnm></au><au><snm>Jiang</snm><fnm>Z</fnm></au><au><snm>McGrath</snm><fnm>SD</fnm></au><au><snm>Hayden</snm><fnm>KE</fnm></au><au><snm>Khaitovich</snm><fnm>P</fnm></au><au><snm>Johnson</snm><fnm>ME</fnm></au><au><snm>Eichler</snm><fnm>MY</fnm></au><au><snm>McPherson</snm><fnm>JD</fnm></au><au><snm>Zhao</snm><fnm>S</fnm></au><au><snm>Paabo</snm><fnm>S</fnm></au><etal/></aug><source>PLoS Biol</source><pubdate>2005</pubdate><volume>3</volume><issue>4</issue><fpage>e110</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.0030110</pubid><pubid idtype="pmcid">1054887</pubid><pubid idtype="pmpid" link="fulltext">15737067</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition</p></title><aug><au><snm>Akagi</snm><fnm>K</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Stephens</snm><fnm>RM</fnm></au><au><snm>Volfovsky</snm><fnm>N</fnm></au><au><snm>Symer</snm><fnm>DE</fnm></au></aug><source>Genome Research</source><pubdate>2008</pubdate><volume>18</volume><issue>6</issue><fpage>869</fpage><lpage>880</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.075770.107</pubid><pubid idtype="pmcid">2413154,2413154</pubid><pubid idtype="pmpid" link="fulltext">18381897</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>Natural genetic variation caused by transposable elements in humans</p></title><aug><au><snm>Bennett</snm><fnm>EA</fnm></au><au><snm>Coleman</snm><fnm>LE</fnm></au><au><snm>Tsui</snm><fnm>C</fnm></au><au><snm>Pittard</snm><fnm>WS</fnm></au><au><snm>Devine</snm><fnm>SE</fnm></au></aug><source>Genetics</source><pubdate>2004</pubdate><volume>168</volume><issue>2</issue><fpage>933</fpage><lpage>951</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1534/genetics.104.031757</pubid><pubid idtype="pmcid">1448813</pubid><pubid idtype="pmpid" link="fulltext">15514065</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Recently mobilized transposons in the human and chimpanzee genomes</p></title><aug><au><snm>Mills</snm><fnm>RE</fnm></au><au><snm>Bennett</snm><fnm>EA</fnm></au><au><snm>Iskow</snm><fnm>RC</fnm></au><au><snm>Luttig</snm><fnm>CT</fnm></au><au><snm>Tsui</snm><fnm>C</fnm></au><au><snm>Pittard</snm><fnm>WS</fnm></au><au><snm>Devine</snm><fnm>SE</fnm></au></aug><source>American journal of human genetics</source><pubdate>2006</pubdate><volume>78</volume><issue>4</issue><fpage>671</fpage><lpage>679</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1086/501028</pubid><pubid idtype="pmcid">1424692</pubid><pubid idtype="pmpid" link="fulltext">16532396</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>Retrotransposable elements and human disease</p></title><aug><au><snm>Callinan</snm><fnm>PA</fnm></au><au><snm>Batzer</snm><fnm>MA</fnm></au></aug><source>Genome Dyn</source><pubdate>2006</pubdate><volume>1</volume><fpage>104</fpage><lpage>115</lpage><xrefbib><pubid idtype="pmpid">18724056</pubid></xrefbib></bibl><bibl id="B7"><title><p>Alu repeats and human disease</p></title><aug><au><snm>Deininger</snm><fnm>PL</fnm></au><au><snm>Batzer</snm><fnm>MA</fnm></au></aug><source>Molecular genetics and metabolism</source><pubdate>1999</pubdate><volume>67</volume><issue>3</issue><fpage>183</fpage><lpage>193</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/mgme.1999.2864</pubid><pubid idtype="pmpid" link="fulltext">10381326</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Mobile elements and disease</p></title><aug><au><snm>Kazazian</snm><fnm>HH</fnm></au></aug><source>Current Opinion in Genetics &amp; Development</source><pubdate>1998</pubdate><volume>8</volume><issue>3</issue><fpage>343</fpage><lpage>350</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0959-437X(98)80092-0</pubid><pubid idtype="pmpid" link="fulltext">21745151</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Keeping active endogenous retroviral-like elements in check: the epigenetic perspective</p></title><aug><au><snm>Maksakova</snm><fnm>IA</fnm></au><au><snm>Mager</snm><fnm>DL</fnm></au><au><snm>Reiss</snm><fnm>D</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2008</pubdate><volume>65</volume><issue>21</issue><fpage>3329</fpage><lpage>3347</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00018-008-8494-3</pubid><pubid idtype="pmpid" link="fulltext">18818875</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>APOBEC3 proteins: major players in intracellular defence against LINE-1-mediated retrotransposition</p></title><aug><au><snm>Schumann</snm><fnm>GG</fnm></au></aug><source>Biochemical Society transactions</source><pubdate>2007</pubdate><volume>35</volume><issue>Pt 3</issue><fpage>637</fpage><lpage>642</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">17511669</pubid></xrefbib></bibl><bibl id="B11"><title><p>Transposable elements in the mammalian germline: a comfortable niche or a deadly trap?</p></title><aug><au><snm>Zamudio</snm><fnm>N</fnm></au><au><snm>Bourc&apos;his</snm><fnm>D</fnm></au></aug><source>Heredity</source><pubdate>2010</pubdate><volume>105</volume><issue>1</issue><fpage>92</fpage><lpage>104</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/hdy.2010.53</pubid><pubid idtype="pmpid" link="fulltext">20442734</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Initial sequencing and comparative analysis of the mouse genome</p></title><aug><au><snm>Waterston</snm><fnm>RH</fnm></au><au><snm>Lindblad-Toh</snm><fnm>K</fnm></au><au><snm>Birney</snm><fnm>E</fnm></au><au><snm>Rogers</snm><fnm>J</fnm></au><au><snm>Abril</snm><fnm>JF</fnm></au><au><snm>Agarwal</snm><fnm>P</fnm></au><au><snm>Agarwala</snm><fnm>R</fnm></au><au><snm>Ainscough</snm><fnm>R</fnm></au><au><snm>Alexandersson</snm><fnm>M</fnm></au><au><snm>An</snm><fnm>P</fnm></au><etal/></aug><source>Nature</source><pubdate>2002</pubdate><volume>420</volume><issue>6915</issue><fpage>520</fpage><lpage>562</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature01262</pubid><pubid idtype="pmpid" link="fulltext">12466850</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Initial sequencing and analysis of the human genome</p></title><aug><au><snm>Lander</snm><fnm>ES</fnm></au><au><snm>Linton</snm><fnm>LM</fnm></au><au><snm>Birren</snm><fnm>B</fnm></au><au><snm>Nusbaum</snm><fnm>C</fnm></au><au><snm>Zody</snm><fnm>MC</fnm></au><au><snm>Baldwin</snm><fnm>J</fnm></au><au><snm>Devon</snm><fnm>K</fnm></au><au><snm>Dewar</snm><fnm>K</fnm></au><au><snm>Doyle</snm><fnm>M</fnm></au><au><snm>FitzHugh</snm><fnm>W</fnm></au><etal/></aug><source>Nature</source><pubdate>2001</pubdate><volume>409</volume><issue>6822</issue><fpage>860</fpage><lpage>921</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/35057062</pubid><pubid idtype="pmpid" link="fulltext">11237011</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>Retroviral elements and their hosts: insertional mutagenesis in the mouse germ line</p></title><aug><au><snm>Maksakova</snm><fnm>IA</fnm></au><au><snm>Romanish</snm><fnm>MT</fnm></au><au><snm>Gagnier</snm><fnm>L</fnm></au><au><snm>Dunn</snm><fnm>CA</fnm></au><au><snm>van de Lagemaat</snm><fnm>LN</fnm></au><au><snm>Mager</snm><fnm>DL</fnm></au></aug><source>PLoS genetics</source><pubdate>2006</pubdate><volume>2</volume><issue>1</issue><fpage>e2</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.0020002</pubid><pubid idtype="pmcid">1331978</pubid><pubid idtype="pmpid" link="fulltext">16440055</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>RepeatMasker Open-3.0</p></title><aug><au><snm>Smit</snm><fnm>AFA</fnm></au><au><snm>Hubley</snm><fnm>R</fnm></au><au><snm>Green</snm><fnm>P</fnm></au></aug><pubdate>1996</pubdate><url>http://www.repeatmasker.org</url></bibl><bibl id="B16"><title><p>Murine endogenous retroviruses</p></title><aug><au><snm>Stocking</snm><fnm>C</fnm></au><au><snm>Kozak</snm><fnm>CA</fnm></au></aug><source>Cell Mol Life Sci</source><pubdate>2008</pubdate><volume>65</volume><issue>21</issue><fpage>3383</fpage><lpage>3398</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00018-008-8497-0</pubid><pubid idtype="pmpid" link="fulltext">18818872</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>The evolution of modern lineages of mouse L1 elements</p></title><aug><au><snm>Mears</snm><fnm>ML</fnm></au><au><snm>Hutchison</snm><fnm>CA</fnm></au></aug><source>J Mol Evol</source><pubdate>2001</pubdate><volume>52</volume><issue>1</issue><fpage>51</fpage><lpage>62</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">11139294</pubid></xrefbib></bibl><bibl id="B18"><title><p>A novel active L1 retrotransposon subfamily in the mouse</p></title><aug><au><snm>Goodier</snm><fnm>JL</fnm></au><au><snm>Ostertag</snm><fnm>EM</fnm></au><au><snm>Du</snm><fnm>K</fnm></au><au><snm>Kazazian</snm><fnm>HH</fnm><suf>Jr</suf></au></aug><source>Genome Res</source><pubdate>2001</pubdate><volume>11</volume><issue>10</issue><fpage>1677</fpage><lpage>1685</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.198301</pubid><pubid idtype="pmcid">311137</pubid><pubid idtype="pmpid" link="fulltext">11591644</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>An actively retrotransposing, novel subfamily of mouse L1 elements</p></title><aug><au><snm>Naas</snm><fnm>TP</fnm></au><au><snm>DeBerardinis</snm><fnm>RJ</fnm></au><au><snm>Moran</snm><fnm>JV</fnm></au><au><snm>Ostertag</snm><fnm>EM</fnm></au><au><snm>Kingsmore</snm><fnm>SF</fnm></au><au><snm>Seldin</snm><fnm>MF</fnm></au><au><snm>Hayashizaki</snm><fnm>Y</fnm></au><au><snm>Martin</snm><fnm>SL</fnm></au><au><snm>Kazazian</snm><fnm>HH</fnm></au></aug><source>Embo J</source><pubdate>1998</pubdate><volume>17</volume><issue>2</issue><fpage>590</fpage><lpage>597</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/emboj/17.2.590</pubid><pubid idtype="pmcid">1170408</pubid><pubid idtype="pmpid" link="fulltext">9430649</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>L1Base: from functional annotation to prediction of active LINE-1 elements</p></title><aug><au><snm>Penzkofer</snm><fnm>T</fnm></au><au><snm>Dandekar</snm><fnm>T</fnm></au><au><snm>Zemojtel</snm><fnm>T</fnm></au></aug><source>Nucl Acids Res</source><pubdate>2005</pubdate><volume>33</volume><issue>suppl_1</issue><fpage>D498</fpage><lpage>500</lpage><xrefbib><pubidlist><pubid idtype="pmcid">539998</pubid><pubid idtype="pmpid" link="fulltext">15608246</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Alu repeats and human genomic diversity</p></title><aug><au><snm>Batzer</snm><fnm>MA</fnm></au><au><snm>Deininger</snm><fnm>PL</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2002</pubdate><volume>3</volume><issue>5</issue><fpage>370</fpage><lpage>379</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg798</pubid><pubid idtype="pmpid" link="fulltext">11988762</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>Repeat sequence families derived from mammalian tRNA genes</p></title><aug><au><snm>Daniels</snm><fnm>GR</fnm></au><au><snm>Deininger</snm><fnm>PL</fnm></au></aug><source>Nature</source><pubdate>1985</pubdate><volume>317</volume><issue>6040</issue><fpage>819</fpage><lpage>822</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/317819a0</pubid><pubid idtype="pmpid">3851163</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>Mighty Piwis defend the germline against genome intruders</p></title><aug><au><snm>O&apos;Donnell</snm><fnm>KA</fnm></au><au><snm>Boeke</snm><fnm>JD</fnm></au></aug><source>Cell</source><pubdate>2007</pubdate><volume>129</volume><issue>1</issue><fpage>37</fpage><lpage>44</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2007.03.028</pubid><pubid idtype="pmpid" link="fulltext">17418784</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Small RNAs in transcriptional gene silencing and genome defence</p></title><aug><au><snm>Moazed</snm><fnm>D</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>457</volume><issue>7228</issue><fpage>413</fpage><lpage>420</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07756</pubid><pubid idtype="pmpid" link="fulltext">19158787</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>Small RNAs in mammalian germline: Tiny for immortal</p></title><aug><au><snm>Tang</snm><fnm>F</fnm></au></aug><source>Differentiation</source><pubdate>2010</pubdate><volume>79</volume><issue>3</issue><fpage>141</fpage><lpage>146</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.diff.2009.11.002</pubid><pubid idtype="pmpid" link="fulltext">20227007</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Biogenesis and germline functions of piRNAs</p></title><aug><au><snm>Klattenhoff</snm><fnm>C</fnm></au><au><snm>Theurkauf</snm><fnm>W</fnm></au></aug><source>Development</source><pubdate>2008</pubdate><volume>135</volume><issue>1</issue><fpage>3</fpage><lpage>9</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">18032451</pubid></xrefbib></bibl><bibl id="B27"><title><p>A germline-specific class of small RNAs binds mammalian Piwi proteins</p></title><aug><au><snm>Girard</snm><fnm>Al</fnm></au><au><snm>Sachidanandam</snm><fnm>R</fnm></au><au><snm>Hannon</snm><fnm>GJ</fnm></au><au><snm>Carmell</snm><fnm>MA</fnm></au></aug><source>Nature</source><pubdate>2006</pubdate><volume>442</volume><issue>7099</issue><fpage>199</fpage><lpage>202</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">16751776</pubid></xrefbib></bibl><bibl id="B28"><title><p>Characterization of the piRNA complex from rat testes</p></title><aug><au><snm>Lau</snm><fnm>NC</fnm></au><au><snm>Seto</snm><fnm>AG</fnm></au><au><snm>Kim</snm><fnm>J</fnm></au><au><snm>Kuramochi-Miyagawa</snm><fnm>S</fnm></au><au><snm>Nakano</snm><fnm>T</fnm></au><au><snm>Bartel</snm><fnm>DP</fnm></au><au><snm>Kingston</snm><fnm>RE</fnm></au></aug><source>Science</source><pubdate>2006</pubdate><volume>313</volume><issue>5785</issue><fpage>363</fpage><lpage>367</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1130164</pubid><pubid idtype="pmpid" link="fulltext">16778019</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Developmentally Regulated piRNA Clusters Implicate MILI in Transposon Control</p></title><aug><au><snm>Aravin</snm><fnm>AA</fnm></au><au><snm>Sachidanandam</snm><fnm>R</fnm></au><au><snm>Girard</snm><fnm>A</fnm></au><au><snm>Fejes-Toth</snm><fnm>K</fnm></au><au><snm>Hannon</snm><fnm>GJ</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>316</volume><issue>5825</issue><fpage>744</fpage><lpage>747</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1142612</pubid><pubid idtype="pmpid" link="fulltext">17446352</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>MIWI2 is essential for spermatogenesis and repression of transposons in the mouse male germline</p></title><aug><au><snm>Carmell</snm><fnm>MA</fnm></au><au><snm>Girard</snm><fnm>A</fnm></au><au><snm>van de Kant</snm><fnm>HJ</fnm></au><au><snm>Bourc&apos;his</snm><fnm>D</fnm></au><au><snm>Bestor</snm><fnm>TH</fnm></au><au><snm>de Rooij</snm><fnm>DG</fnm></au><au><snm>Hannon</snm><fnm>GJ</fnm></au></aug><source>Dev Cell</source><pubdate>2007</pubdate><volume>12</volume><issue>4</issue><fpage>503</fpage><lpage>514</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.devcel.2007.03.001</pubid><pubid idtype="pmpid" link="fulltext">17395546</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>A piRNA pathway primed by individual transposons is linked to de novo DNA methylation in mice</p></title><aug><au><snm>Aravin</snm><fnm>AA</fnm></au><au><snm>Sachidanandam</snm><fnm>R</fnm></au><au><snm>Bourc&apos;his</snm><fnm>D</fnm></au><au><snm>Schaefer</snm><fnm>C</fnm></au><au><snm>Pezic</snm><fnm>D</fnm></au><au><snm>Toth</snm><fnm>KF</fnm></au><au><snm>Bestor</snm><fnm>T</fnm></au><au><snm>Hannon</snm><fnm>GJ</fnm></au></aug><source>Molecular cell</source><pubdate>2008</pubdate><volume>31</volume><issue>6</issue><fpage>785</fpage><lpage>799</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2008.09.003</pubid><pubid idtype="pmcid">2730041</pubid><pubid idtype="pmpid" link="fulltext">18922463</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>DNA methylation of retrotransposon genes is regulated by Piwi family members MILI and MIWI2 in murine fetal testes</p></title><aug><au><snm>Kuramochi-Miyagawa</snm><fnm>S</fnm></au><au><snm>Watanabe</snm><fnm>T</fnm></au><au><snm>Gotoh</snm><fnm>K</fnm></au><au><snm>Totoki</snm><fnm>Y</fnm></au><au><snm>Toyoda</snm><fnm>A</fnm></au><au><snm>Ikawa</snm><fnm>M</fnm></au><au><snm>Asada</snm><fnm>N</fnm></au><au><snm>Kojima</snm><fnm>K</fnm></au><au><snm>Yamaguchi</snm><fnm>Y</fnm></au><au><snm>Ijiri</snm><fnm>TW</fnm></au><etal/></aug><source>Genes Dev</source><pubdate>2008</pubdate><volume>22</volume><issue>7</issue><fpage>908</fpage><lpage>917</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1640708</pubid><pubid idtype="pmcid">2279202</pubid><pubid idtype="pmpid" link="fulltext">18381894</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>Imprinting and epigenetic changes in the early embryo</p></title><aug><au><snm>Weaver</snm><fnm>JR</fnm></au><au><snm>Susiarjo</snm><fnm>M</fnm></au><au><snm>Bartolomei</snm><fnm>MS</fnm></au></aug><source>Mamm Genome</source><pubdate>2009</pubdate><volume>20</volume><issue>9-10</issue><fpage>532</fpage><lpage>543</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/s00335-009-9225-2</pubid><pubid idtype="pmpid" link="fulltext">19760320</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>The Piwi-piRNA pathway provides an adaptive defense in the transposon arms race</p></title><aug><au><snm>Aravin</snm><fnm>AA</fnm></au><au><snm>Hannon</snm><fnm>GJ</fnm></au><au><snm>Brennecke</snm><fnm>J</fnm></au></aug><source>Science</source><pubdate>2007</pubdate><volume>318</volume><issue>5851</issue><fpage>761</fpage><lpage>764</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1146484</pubid><pubid idtype="pmpid" link="fulltext">17975059</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Tightly regulated, developmentally specific expression of the first open reading frame from LINE-1 during mouse embryogenesis</p></title><aug><au><snm>Trelogan</snm><fnm>SA</fnm></au><au><snm>Martin</snm><fnm>SL</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1995</pubdate><volume>92</volume><issue>5</issue><fpage>1520</fpage><lpage>1524</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.92.5.1520</pubid><pubid idtype="pmcid">42551</pubid><pubid idtype="pmpid" link="fulltext">7878012</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Germ line-specific expression of intracisternal A-particle retrotransposons in transgenic mice</p></title><aug><au><snm>Dupressoir</snm><fnm>A</fnm></au><au><snm>Heidmann</snm><fnm>T</fnm></au></aug><source>Molecular and cellular biology</source><pubdate>1996</pubdate><volume>16</volume><issue>8</issue><fpage>4495</fpage><lpage>4503</lpage><xrefbib><pubidlist><pubid idtype="pmcid">231448</pubid><pubid idtype="pmpid" link="fulltext">8754850</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>Two mouse piwi-related genes: miwi and mili</p></title><aug><au><snm>Kuramochi-Miyagawa</snm><fnm>S</fnm></au><au><snm>Kimura</snm><fnm>T</fnm></au><au><snm>Yomogida</snm><fnm>K</fnm></au><au><snm>Kuroiwa</snm><fnm>A</fnm></au><au><snm>Tadokoro</snm><fnm>Y</fnm></au><au><snm>Fujita</snm><fnm>Y</fnm></au><au><snm>Sato</snm><fnm>M</fnm></au><au><snm>Matsuda</snm><fnm>Y</fnm></au><au><snm>Nakano</snm><fnm>T</fnm></au></aug><source>Mech Dev</source><pubdate>2001</pubdate><volume>108</volume><issue>1-2</issue><fpage>121</fpage><lpage>133</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0925-4773(01)00499-3</pubid><pubid idtype="pmpid" link="fulltext">11578866</pubid></pubidlist></xrefbib></bibl><bibl id="B38"><title><p>Mili, a mammalian member of piwi family gene, is essential for spermatogenesis</p></title><aug><au><snm>Kuramochi-Miyagawa</snm><fnm>S</fnm></au><au><snm>Kimura</snm><fnm>T</fnm></au><au><snm>Ijiri</snm><fnm>TW</fnm></au><au><snm>Isobe</snm><fnm>T</fnm></au><au><snm>Asada</snm><fnm>N</fnm></au><au><snm>Fujita</snm><fnm>Y</fnm></au><au><snm>Ikawa</snm><fnm>M</fnm></au><au><snm>Iwai</snm><fnm>N</fnm></au><au><snm>Okabe</snm><fnm>M</fnm></au><au><snm>Deng</snm><fnm>W</fnm></au><etal/></aug><source>Development</source><pubdate>2004</pubdate><volume>131</volume><issue>4</issue><fpage>839</fpage><lpage>849</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1242/dev.00973</pubid><pubid idtype="pmpid" link="fulltext">14736746</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>deepBase: a database for deeply annotating and mining deep sequencing data</p></title><aug><au><snm>Yang</snm><fnm>JH</fnm></au><au><snm>Shao</snm><fnm>P</fnm></au><au><snm>Zhou</snm><fnm>H</fnm></au><au><snm>Chen</snm><fnm>YQ</fnm></au><au><snm>Qu</snm><fnm>LH</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Database</issue><fpage>D123</fpage><lpage>130</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp943</pubid><pubid idtype="pmcid">2808990</pubid><pubid idtype="pmpid" link="fulltext">19966272</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>Repbase Update, a database of eukaryotic repetitive elements</p></title><aug><au><snm>Jurka</snm><fnm>J</fnm></au><au><snm>Kapitonov</snm><fnm>VV</fnm></au><au><snm>Pavlicek</snm><fnm>A</fnm></au><au><snm>Klonowski</snm><fnm>P</fnm></au><au><snm>Kohany</snm><fnm>O</fnm></au><au><snm>Walichiewicz</snm><fnm>J</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><issue>1-4</issue><fpage>462</fpage><lpage>467</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084979</pubid><pubid idtype="pmpid" link="fulltext">16093699</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>Thousands of human mobile element fragments undergo strong purifying selection near developmental genes</p></title><aug><au><snm>Lowe</snm><fnm>CB</fnm></au><au><snm>Bejerano</snm><fnm>G</fnm></au><au><snm>Haussler</snm><fnm>D</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2007</pubdate><volume>104</volume><issue>19</issue><fpage>8005</fpage><lpage>8010</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0611223104</pubid><pubid idtype="pmcid">1876562</pubid><pubid idtype="pmpid" link="fulltext">17463089</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Alu and b1 repeats have been selectively retained in the upstream and intronic regions of genes of specific functional classes</p></title><aug><au><snm>Tsirigos</snm><fnm>A</fnm></au><au><snm>Rigoutsos</snm><fnm>I</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2009</pubdate><volume>5</volume><issue>12</issue><fpage>e1000610.</fpage><xrefbib><pubidlist><pubid idtype="pmcid">2784220</pubid><pubid idtype="pmpid" link="fulltext">20019790</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Analysis of transposon interruptions suggests selection for L1 elements on the &#215; chromosome</p></title><aug><au><snm>Abrusan</snm><fnm>G</fnm></au><au><snm>Giordano</snm><fnm>J</fnm></au><au><snm>Warburton</snm><fnm>PE</fnm></au></aug><source>PLoS genetics</source><pubdate>2008</pubdate><volume>4</volume><issue>8</issue><fpage>e1000172</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pgen.1000172</pubid><pubid idtype="pmcid">2517846</pubid><pubid idtype="pmpid" link="fulltext">18769724</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>A gene atlas of the mouse and human protein-encoding transcriptomes</p></title><aug><au><snm>Su</snm><fnm>AI</fnm></au><au><snm>Wiltshire</snm><fnm>T</fnm></au><au><snm>Batalov</snm><fnm>S</fnm></au><au><snm>Lapp</snm><fnm>H</fnm></au><au><snm>Ching</snm><fnm>KA</fnm></au><au><snm>Block</snm><fnm>D</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Soden</snm><fnm>R</fnm></au><au><snm>Hayakawa</snm><fnm>M</fnm></au><au><snm>Kreiman</snm><fnm>G</fnm></au><etal/></aug><source>Proc Natl Acad Sci USA</source><pubdate>2004</pubdate><volume>101</volume><issue>16</issue><fpage>6062</fpage><lpage>6067</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.0400782101</pubid><pubid idtype="pmcid">395923</pubid><pubid idtype="pmpid" link="fulltext">15075390</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs</p></title><aug><au><snm>Jacquier</snm><fnm>A</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2009</pubdate><volume>10</volume><issue>12</issue><fpage>833</fpage><lpage>844</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">19920851</pubid></xrefbib></bibl><bibl id="B46"><title><p>Divergent transcription from active promoters</p></title><aug><au><snm>Seila</snm><fnm>AC</fnm></au><au><snm>Calabrese</snm><fnm>JM</fnm></au><au><snm>Levine</snm><fnm>SS</fnm></au><au><snm>Yeo</snm><fnm>GW</fnm></au><au><snm>Rahl</snm><fnm>PB</fnm></au><au><snm>Flynn</snm><fnm>RA</fnm></au><au><snm>Young</snm><fnm>RA</fnm></au><au><snm>Sharp</snm><fnm>PA</fnm></au></aug><source>Science</source><pubdate>2008</pubdate><volume>322</volume><issue>5909</issue><fpage>1849</fpage><lpage>1851</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1126/science.1162253</pubid><pubid idtype="pmcid">2692996</pubid><pubid idtype="pmpid" link="fulltext">19056940</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Retroelement distributions in the human genome: variations associated with age and proximity to genes</p></title><aug><au><snm>Medstrand</snm><fnm>P</fnm></au><au><snm>van de Lagemaat</snm><fnm>LN</fnm></au><au><snm>Mager</snm><fnm>DL</fnm></au></aug><source>Genome Res</source><pubdate>2002</pubdate><volume>12</volume><issue>10</issue><fpage>1483</fpage><lpage>1495</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.388902</pubid><pubid idtype="pmcid">187529</pubid><pubid idtype="pmpid" link="fulltext">12368240</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Experimental design, preprocessing, normalization and differential expression analysis of small RNA sequencing experiments</p></title><aug><au><snm>McCormick</snm><fnm>KP</fnm></au><au><snm>Willmann</snm><fnm>MR</fnm></au><au><snm>Meyers</snm><fnm>BC</fnm></au></aug><source>Silence</source><pubdate>2011</pubdate><volume>2</volume><issue>1</issue><fpage>2.</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1758-907X-2-2</pubid><pubid idtype="pmcid">3055805</pubid><pubid idtype="pmpid" link="fulltext">21356093</pubid></pubidlist></xrefbib></bibl><bibl id="B49"><title><p>Extensive variation between inbred mouse strains due to endogenous L1 retrotransposition</p></title><aug><au><snm>Akagi</snm><fnm>K</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Stephens</snm><fnm>RM</fnm></au><au><snm>Volfovsky</snm><fnm>N</fnm></au><au><snm>Symer</snm><fnm>DE</fnm></au></aug><source>Genome Res</source><pubdate>2008</pubdate><volume>18</volume><issue>6</issue><fpage>869</fpage><lpage>880</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.075770.107</pubid><pubid idtype="pmcid">2413154,2413154</pubid><pubid idtype="pmpid" link="fulltext">18381897</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>The mouse intracisternal A particle-promoted placental gene retrotransposition is mouse-strain-specific</p></title><aug><au><snm>Chang-Yeh</snm><fnm>A</fnm></au><au><snm>Mold</snm><fnm>DE</fnm></au><au><snm>Brilliant</snm><fnm>MH</fnm></au><au><snm>Huang</snm><fnm>RC</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1993</pubdate><volume>90</volume><issue>1</issue><fpage>292</fpage><lpage>296</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.90.1.292</pubid><pubid idtype="pmcid">45646</pubid><pubid idtype="pmpid" link="fulltext">7678343</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Evolution of chromosome organization driven by selection for reduced gene expression noise</p></title><aug><au><snm>Batada</snm><fnm>NN</fnm></au><au><snm>Hurst</snm><fnm>LD</fnm></au></aug><source>Nat Genet</source><pubdate>2007</pubdate><volume>39</volume><issue>8</issue><fpage>945</fpage><lpage>949</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng2071</pubid><pubid idtype="pmpid" link="fulltext">17660811</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>Ripples from neighbouring transcription</p></title><aug><au><snm>Ebisuya</snm><fnm>M</fnm></au><au><snm>Yamamoto</snm><fnm>T</fnm></au><au><snm>Nakajima</snm><fnm>M</fnm></au><au><snm>Nishida</snm><fnm>E</fnm></au></aug><source>Nat Cell Biol</source><pubdate>2008</pubdate><volume>10</volume><issue>9</issue><fpage>1106</fpage><lpage>1113</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ncb1771</pubid><pubid idtype="pmpid" link="fulltext">19160492</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>Chromatin remodelling is a major source of coexpression of linked genes in yeast</p></title><aug><au><snm>Batada</snm><fnm>NN</fnm></au><au><snm>Urrutia</snm><fnm>AO</fnm></au><au><snm>Hurst</snm><fnm>LD</fnm></au></aug><source>Trends in Genetics</source><pubdate>2007</pubdate><volume>23</volume><issue>10</issue><fpage>480</fpage><lpage>484</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.tig.2007.08.003</pubid><pubid idtype="pmpid" link="fulltext">17822800</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Large-scale transcriptome data reveals transcriptional activity of fission yeast LTR retrotransposons</p></title><aug><au><snm>Mourier</snm><fnm>T</fnm></au><au><snm>Willerslev</snm><fnm>E</fnm></au></aug><source>BMC genomics</source><pubdate>2010</pubdate><volume>11</volume><fpage>167</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-11-167</pubid><pubid idtype="pmcid">2848245</pubid><pubid idtype="pmpid" link="fulltext">20226011</pubid></pubidlist></xrefbib></bibl><bibl id="B55"><title><p>Human LINE retrotransposons generate processed pseudogenes</p></title><aug><au><snm>Esnault</snm><fnm>C</fnm></au><au><snm>Maestre</snm><fnm>J</fnm></au><au><snm>Heidmann</snm><fnm>T</fnm></au></aug><source>Nat Genet</source><pubdate>2000</pubdate><volume>24</volume><issue>4</issue><fpage>363</fpage><lpage>367</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/74184</pubid><pubid idtype="pmpid" link="fulltext">10742098</pubid></pubidlist></xrefbib></bibl><bibl id="B56"><title><p>LINE-mediated retrotransposition of marked Alu sequences</p></title><aug><au><snm>Dewannieux</snm><fnm>M</fnm></au><au><snm>Esnault</snm><fnm>C</fnm></au><au><snm>Heidmann</snm><fnm>T</fnm></au></aug><source>Nat Genet</source><pubdate>2003</pubdate><volume>35</volume><issue>1</issue><fpage>41</fpage><lpage>48</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1223</pubid><pubid idtype="pmpid" link="fulltext">12897783</pubid></pubidlist></xrefbib></bibl><bibl id="B57"><title><p>B2 RNA binds directly to RNA polymerase II to repress transcript synthesis</p></title><aug><au><snm>Espinoza</snm><fnm>CA</fnm></au><au><snm>Allen</snm><fnm>TA</fnm></au><au><snm>Hieb</snm><fnm>AR</fnm></au><au><snm>Kugel</snm><fnm>JF</fnm></au><au><snm>Goodrich</snm><fnm>JA</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2004</pubdate><volume>11</volume><issue>9</issue><fpage>822</fpage><lpage>829</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb812</pubid><pubid idtype="pmpid" link="fulltext">15300239</pubid></pubidlist></xrefbib></bibl><bibl id="B58"><title><p>The SINE-encoded mouse B2 RNA represses mRNA transcription in response to heat shock</p></title><aug><au><snm>Allen</snm><fnm>TA</fnm></au><au><snm>Von Kaenel</snm><fnm>S</fnm></au><au><snm>Goodrich</snm><fnm>JA</fnm></au><au><snm>Kugel</snm><fnm>JF</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2004</pubdate><volume>11</volume><issue>9</issue><fpage>816</fpage><lpage>821</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb813</pubid><pubid idtype="pmpid" link="fulltext">15300240</pubid></pubidlist></xrefbib></bibl><bibl id="B59"><title><p>Human Alu RNA Is a Modular Transacting Repressor of mRNA Transcription during Heat Shock</p></title><aug><au><snm>Mariner</snm><fnm>PD</fnm></au><au><snm>Walters</snm><fnm>RD</fnm></au><au><snm>Espinoza</snm><fnm>CA</fnm></au><au><snm>Drullinger</snm><fnm>LF</fnm></au><au><snm>Wagner</snm><fnm>SD</fnm></au><au><snm>Kugel</snm><fnm>JF</fnm></au><au><snm>Goodrich</snm><fnm>JA</fnm></au></aug><source>Molecular cell</source><pubdate>2008</pubdate><volume>29</volume><issue>4</issue><fpage>499</fpage><lpage>509</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2007.12.013</pubid><pubid idtype="pmpid" link="fulltext">18313387</pubid></pubidlist></xrefbib></bibl><bibl id="B60"><title><p>DICER1 deficit induces Alu RNA toxicity in age-related macular degeneration</p></title><aug><au><snm>Kaneko</snm><fnm>H</fnm></au><au><snm>Dridi</snm><fnm>S</fnm></au><au><snm>Tarallo</snm><fnm>V</fnm></au><au><snm>Gelfand</snm><fnm>BD</fnm></au><au><snm>Fowler</snm><fnm>BJ</fnm></au><au><snm>Cho</snm><fnm>WG</fnm></au><au><snm>Kleinman</snm><fnm>ME</fnm></au><au><snm>Ponicsan</snm><fnm>SL</fnm></au><au><snm>Hauswirth</snm><fnm>WW</fnm></au><au><snm>Chiodo</snm><fnm>VA</fnm></au><etal/></aug><source>Nature</source><pubdate>2011</pubdate><volume>471</volume><issue>7338</issue><fpage>325</fpage><lpage>330</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature09830</pubid><pubid idtype="pmpid" link="fulltext">21297615</pubid></pubidlist></xrefbib></bibl><bibl id="B61"><title><p>Reverse transcription in genome evolution</p></title><aug><au><snm>Mourier</snm><fnm>T</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><issue>1-4</issue><fpage>56</fpage><lpage>62</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084938</pubid><pubid idtype="pmpid" link="fulltext">16093658</pubid></pubidlist></xrefbib></bibl><bibl id="B62"><title><p>Alu-SINE exonization: en route to protein-coding function</p></title><aug><au><snm>Krull</snm><fnm>M</fnm></au><au><snm>Brosius</snm><fnm>J</fnm></au><au><snm>Schmitz</snm><fnm>J</fnm></au></aug><source>Mol Biol Evol</source><pubdate>2005</pubdate><volume>22</volume><issue>8</issue><fpage>1702</fpage><lpage>1711</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/molbev/msi164</pubid><pubid idtype="pmpid" link="fulltext">15901843</pubid></pubidlist></xrefbib></bibl><bibl id="B63"><title><p>Impact of transposable elements on the evolution of mammalian gene regulation</p></title><aug><au><snm>Medstrand</snm><fnm>P</fnm></au><au><snm>van de Lagemaat</snm><fnm>LN</fnm></au><au><snm>Dunn</snm><fnm>CA</fnm></au><au><snm>Landry</snm><fnm>JR</fnm></au><au><snm>Svenback</snm><fnm>D</fnm></au><au><snm>Mager</snm><fnm>DL</fnm></au></aug><source>Cytogenet Genome Res</source><pubdate>2005</pubdate><volume>110</volume><issue>1-4</issue><fpage>342</fpage><lpage>352</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1159/000084966</pubid><pubid idtype="pmpid" link="fulltext">16093686</pubid></pubidlist></xrefbib></bibl><bibl id="B64"><title><p>Transposable elements have rewired the core regulatory network of human embryonic stem cells</p></title><aug><au><snm>Kunarso</snm><fnm>G</fnm></au><au><snm>Chia</snm><fnm>NY</fnm></au><au><snm>Jeyakani</snm><fnm>J</fnm></au><au><snm>Hwang</snm><fnm>C</fnm></au><au><snm>Lu</snm><fnm>X</fnm></au><au><snm>Chan</snm><fnm>YS</fnm></au><au><snm>Ng</snm><fnm>HH</fnm></au><au><snm>Bourque</snm><fnm>G</fnm></au></aug><source>Nat Genet</source><pubdate>2010</pubdate><volume>42</volume><issue>7</issue><fpage>631</fpage><lpage>634</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng.600</pubid><pubid idtype="pmpid" link="fulltext">20526341</pubid></pubidlist></xrefbib></bibl><bibl id="B65"><title><p>Ultrafast and memory-efficient alignment of short DNA sequences to the human genome</p></title><aug><au><snm>Langmead</snm><fnm>B</fnm></au><au><snm>Trapnell</snm><fnm>C</fnm></au><au><snm>Pop</snm><fnm>M</fnm></au><au><snm>Salzberg</snm><fnm>SL</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><issue>3</issue><fpage>R25</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-3-r25</pubid><pubid idtype="pmcid">2690996</pubid><pubid idtype="pmpid" link="fulltext">19261174</pubid></pubidlist></xrefbib></bibl><bibl id="B66"><title><p>Sequence complexity and DNA curvature</p></title><aug><au><snm>Gabrielian</snm><fnm>A</fnm></au><au><snm>Bolshoy</snm><fnm>A</fnm></au></aug><source>Comput Chem</source><pubdate>1999</pubdate><volume>23</volume><issue>3-4</issue><fpage>263</fpage><lpage>274</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0097-8485(99)00007-8</pubid><pubid idtype="pmpid" link="fulltext">10404619</pubid></pubidlist></xrefbib></bibl><bibl id="B67"><title><p>The UCSC genome browser database: update 2007</p></title><aug><au><snm>Kuhn</snm><fnm>RM</fnm></au><au><snm>Karolchik</snm><fnm>D</fnm></au><au><snm>Zweig</snm><fnm>AS</fnm></au><au><snm>Trumbower</snm><fnm>H</fnm></au><au><snm>Thomas</snm><fnm>DJ</fnm></au><au><snm>Thakkapallayil</snm><fnm>A</fnm></au><au><snm>Sugnet</snm><fnm>CW</fnm></au><au><snm>Stanke</snm><fnm>M</fnm></au><au><snm>Smith</snm><fnm>KE</fnm></au><au><snm>Siepel</snm><fnm>A</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><issue>Database</issue><fpage>D668</fpage><lpage>673</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl928</pubid><pubid idtype="pmcid">1669757</pubid><pubid idtype="pmpid" link="fulltext">17142222</pubid></pubidlist></xrefbib></bibl><bibl id="B68"><title><p>Ensembl 2007</p></title><aug><au><snm>Hubbard</snm><fnm>TJ</fnm></au><au><snm>Aken</snm><fnm>BL</fnm></au><au><snm>Beal</snm><fnm>K</fnm></au><au><snm>Ballester</snm><fnm>B</fnm></au><au><snm>Caccamo</snm><fnm>M</fnm></au><au><snm>Chen</snm><fnm>Y</fnm></au><au><snm>Clarke</snm><fnm>L</fnm></au><au><snm>Coates</snm><fnm>G</fnm></au><au><snm>Cunningham</snm><fnm>F</fnm></au><au><snm>Cutts</snm><fnm>T</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2007</pubdate><volume>35</volume><issue>Database</issue><fpage>D610</fpage><lpage>617</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkl996</pubid><pubid idtype="pmcid">1761443</pubid><pubid idtype="pmpid" link="fulltext">17148474</pubid></pubidlist></xrefbib></bibl><bibl id="B69"><title><p>R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL</p></title><aug><au><cnm>R Development Core Team</cnm></au></aug><pubdate>2007</pubdate><url>http://www.R-project.org</url></bibl></refgrp>
</bm>
</art>