Repbase Reports

2004, Volume 4, Issue 2
February 29, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 39

L1-1_CR

L1-1_CR is a L1-like non-LTR retrotransposon - a consensus.

Submitted:
29-Feb-2004
Accepted:
29-Feb-2004
Key Words:
non-LTR retrotransposon; endonuclease; reverse transcriptase; L1 superfamily; L1-1_CR
Source:
consensus
Organism:
Chlamydomonas reinhardtii
Taxonomy:
Eukaryota; Plantae; Thallobionta; Chlorophycota; Chlorophyceae; Volvocales; Chlamydomonadaceae
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
L1-1_CR, a family of L1-like non-LTR retrotransposons from the green algae genome.
Journal:
Repbase Reports 4:(2) p. 39 (2004)
Abstract:
Several hundred copies of L1-1_CR are present in the C. reinhardtii genome. They are ~98% identical to the consensus sequence and constitute nearly 1% of the genome (some elements are less than 1% divergent from each other). L1-1_CR elements usually are flanked by 2-14-bp target site duplications. Many elements are inserted as a 15-50 bp long 3' terminal portion of the consensus, including the polyA tail. L1-1_CR1 encodes the 350-aa L1-1_CR1p (pos. 1057-2106) and 1534-aa L1-1_CR2p (pos. 2109-6710) proteins. L1-1_CR1p is not similar to known proteins; L1-1_CR1p is composed of the L1-like endonuclease and reverse transcriptase domains. L1-1_CR1p: MALPRRRPGVGDGCNQPRTHRRRRRSNSPAPKGAVAPPSSRSRGQGAWAAGCLPRRTVLARRRTCPLGGR PGDSRLVGAAGRGARVEERGGQCPSRRPRRGRHCRRHNRRVVVRAGAGVSADTAGRAGGGAGSEAEYGAV CLTRDTCVAIGSDLVTHQNRCADVQGYEWAQLGRDTLGGRLLAYLRDQGATVPDWAVCRVPAGRTAVLAQ LHDEFTGWWRLYSAQPADTPLPSDVTEQLDDAAQQVQTAYMDYVLLDARTLRSAKRGPADPGGASGPSRR QRQHRSRSTSSLMSLGSAPLRPGGASSGAASMSTSSGGAPPSQGGGRRGKRHSSNSNRNRHGAAVAGGGS L1-1_CR2p: MRPPVGAANLRLLTVNVNGLGSPFKARALVSHLQQVGADVAMVQETHATDTTALESCLRAAQGACLPWRH CLAASPAASPHSCGAAILARSRLSLPGCVLQPPSTDAAGRVVCWDWDVGHLRLRFVCVYAPTAVADKPAF FAGLHPHLATDRVLVVGGDWNCVTDASQEAAPSPSRAAGAPQLASLLAQFSLVDPWASKRGGAKGYTHPA TPKPATPARLDRWYVSATAAPWVVDVARTYGAPGDHNGVLLTLSLPDLPHAHREQWRFPTYLLFHPSLRL ELEQRLEAHVAANPVASTGDGACTQWEADKFFLREAATSIHRRHARQTRDGLHGVVLAADAAAALADRPG ASAAQRQAAAMANLAVREERAAAAAASHNARAALMEEHGERGTRWFHRQADEPAAGAQEPITHLKVPGQP APVALTGPGTRNTVSAAAAAMYSSTSPTGLFRVQPVCTASQQQLLAAIDRKVPADLQAAAEGSGDGALSD AELMAALAGSANGKAPGSDGVPYEVYKVFWALLGPRLCAAAAAAFAAAADAHDGGEMAAALPASWREGII TLIYKGKSLDRAELASYRPITLLNCDFKMVSKAVSARLQPALDAVVDELQTAFITGRWIGDNALYLQGLI EWMRLDVGADGTPRQGGALYFLDIEKAYDRVHRQWLYASAEGLGFGPRMLRWIRLLTANGSARVCVNGML SDAFPVLNGLPQGSTASPPLWVIQMQPLTSFLRRQVEQGALRTPLLPSGEQAPPAAHHADDTTLTARDPA VDGPVLMAAVQLFCRASNARVHPDKSKAMGLGRFAHLTGPCPHTGVPFTTGAVTHLGVPLSWDSDAAAAD LYTRRARGMAFVARLWAALSLTLVGRVHIAKQVLAAKLAYHFSFLNPSPAQLKELTDLVDHFAARSMHAE DASLVSHGNPLLLPKRETACLPYKDGGVNHVDLPAFLSALQAKTFALLAQPGRQPWKMLTRALLTHVRPD SATTWAWVYSDAPAPAGLPARLAAAVGHVRSAGVEQHPPQPATQPPAAPPQWRVSLDQLWVANAAGAVSY VHYTGRLLEPGPGVLPPAVDGAWQPACVLQHRKPRHLWTFEERAAYDAASPGERAGAWPRAPYFLAPEAG VVVHPEHCRIAGVSLADYTVRDVRRAITAANPAAPPAPARPAAMPCPAPAQQAGGSGTQPAAQSRLAERE AEWQRAAAQLTTTAAQHFHNNPVALDPWLHRTSAAAGLQNTPARELQSYASPSQQSGEGPRRSARLQEQA AGGAGPSTGPATAAAAAAAAVEGDPRMPPPDASLLRGTWRRLWDSHASRGAKVLVYRLQHAYLPCGLYRA GKGIRPRVTTGCGGLGAHCPHPACGPPGPRAWASLTHIFLECPAYAQARTWLQQLWACVAPQAAAPPVTD AGFMLGDRMGMWASGPRGAGALLWSTLRATFLYAVWCAYWSREPAKQTSEHVVREVVSELRRVMQLRFTA ATLTPETLSALPTQLLTAQLKAAKLEHFVAIWSAGGALCEVEEVQGGSPKLNLRLTLASPVQAP
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute