Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

BACKGROUND: Transposable element (TE)-derived sequence dominates the landscape of mammalian genomes and can modulate gene function by dysregulating transcription and translation. Our current knowledge of TEs in laboratory mouse strains is limited primarily to those present in the C57BL/6J reference genome, with most mouse TEs being drawn from three distinct classes, namely short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and the endogenous retrovirus (ERV) superfamily. Despite their high prevalence, the different genomic and gene properties controlling whether TEs are preferentially purged from, or are retained by, genetic drift or positive selection in mammalian genomes remain poorly defined. RESULTS: Using whole genome sequencing data from 13 classical laboratory and 4 wild-derived mouse inbred strains, we developed a comprehensive catalogue of 103,798 polymorphic TE variants. We employ this extensive data set to characterize TE variants across the Mus lineage, and to infer neutral and selective processes that have acted over 2 million years. Our results indicate that the majority of TE variants are introduced though the male germline and that only a minority of TE variants exert detectable changes in gene expression. However, among genes with differential expression across the strains there are twice as many TE variants identified as being putative causal variants as expected. CONCLUSIONS: Most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection. Our findings demonstrate that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation.

Original publication

DOI

10.1186/gb-2012-13-6-r45

Type

Journal article

Journal

Genome Biol

Publication Date

15/06/2012

Volume

13

Keywords

Algorithms, Animals, Brain, Chromosome Mapping, Computational Biology, DNA Transposable Elements, Endogenous Retroviruses, Evolution, Molecular, Female, Gene Expression Profiling, Gene Expression Regulation, Gene Library, Genome, Genomics, Long Interspersed Nucleotide Elements, Male, Mice, Mice, Inbred BALB C, Mice, Inbred C3H, Mice, Inbred C57BL, Mice, Inbred NOD, Molecular Sequence Annotation, Phylogeny, Selection, Genetic, Sensitivity and Specificity, Short Interspersed Nucleotide Elements, Transcription, Genetic