Repbase Reports

2004, Volume 4, Issue 7
July 31, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 198

RandI-3

RandI-3 is a family of non-LTR retrotransposons - a consensus sequence.

Submitted:
00-Jul-2004
Accepted:
31-Jul-2004
Key Words:
non-LTR retrotransposon; RandI superfamily; AP endonuclease; reverse transcriptase; RNaseH; RandI-3
Source:
Chlamydomonas reinhardtii
Organism:
Chlamydomonas reinhardtii
Taxonomy:
Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Volvocales; Chlamydomonadaceae; Chlamydomonas
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
RandI-3, a family of RandI non-LTR retrotransposons from the Chlamydomonas reinhardtii genome
Journal:
Repbase Reports 4:(7) p.198 (2004)
Abstract:
RandI-3 is a family of RandI non-LTR retrotransposons. Approximately 20 copies of RandI-3 are present in the genome. They are ~1% divergent from the consensus sequence and are likely mobile. The RandI-3 elements are usually flanked by 10-15-bp target site duplications. The consensus sequence encodes the 3671 RandI-3p protein (nucleotide pos. 98-11110) composed of the AP endonuclease (aa pos. 1030-1300), reverse transcriptase (aa pos. 1650-2000), and RNase H1 (aa pos. 3170-3340) domains. RandI-3p: MATMLVLITYVMLILGVLVRTSHWPEEVRPFHTPEVLQALAGLLARSWTPSLTSGGPEEVATTA MHRMPPQKYAPFCIPATLVATNPAICSAGSRLSSGSYGSQQMNVRSAAPTPTQQSQQPLPGTRK QRLCNETNTTLPVSLRPPAPDVQNLDLSRDRRRPATTSPRAANSCTSRTLKSPTLQHYVSTSLQ PPASDGQNSDMSRGPATTSPMAANPFSPKTLKSPASQHYSTAERPPDDTPHDPGPGSSARGPRA PAALLCHTRTASPALQLWCQTMGPYPIVQLQMHNDTDPLIATVSYISAHITSALCWTAVGTFHG SCALLRHTIASLRTLLTHTAAYMTTCTGTLIAHSWRTAASITLWPPALTLVAASAARLAWSART MLSHPSAGNLTTLTFTLLATLTLSQPLLLLISTHQHISTTLTLLTTTAAIYATLRKTHGAPLIL LGLPVYATCRAVAVKPAYTMRTLLTILLLLTNIQATAEAPELTPAAAPLAMQLLTRIPGSPTLN TGVVGLTYLLSQTHIHPQPACLALTTNLATYRYLMDVTTPWSHTLGSPIPPPIQLRGGNPIEHT PATNRVAAATESGRKPERIQWPRQIQHTFFREQQQLNFCTVHAINNMLGKPWLLATDVLAFCQV SHTAMADTPALYTLWGHHFRYHGLFHEAAINHYLRHRPPAVDHTGARLLLRTVAKTLTPGHTQD HIISLLPDCSKGLGLMVHDGRHAYAIRKENDTWYLVDSLAHIGRKPMTQELWAATKGDLFCLHP LHPDMPPDPLTYPTIPGLPEPPPPPTPTTQPRPRKGTVQLTLNTTMTRTLNRAPLNPNPENSTT SNRAQTRPQTQTTLANLVARSQTHAEPQGTQLTPPEDPQEPPPRTHAPTHEHGRPPTPPSAARP TRRLRRNDQISPHSPTRHRTQQDATTENQQPQAPPPANDPVPTPQPRPTWAQVLTNNASAVGTA TLHVLTRLSPAPRHRPDTPTSAARPTQRPRQELPGSPTTHTATAPPDPPPTNPTATPPVTEQAA EPAASWLHVVTHNVRGLSTNLLSTVCHLREWDADIVILTETKLGPRTRWLKDTLRHEKLSYRTF TSTKPGTAQYKRRSASVLMAISKRYHSGSNIHTATIPPHLLGHLAHCTIRTPHSIPLHILGVYC PEDMPTRRRIYTYCTSTLKAAAAAGEHVLIGGDFNAVLTAADRTGPLDDADRHHARFVSSHRLH RITEPNGTTSMTYYQTRDGHPQAISRIDDILICQHTSNALVAAATEPGGNTPTLKVQPSGGLFD HSSVHIHLPTFPMRLWTAPTGNRTGNTTQPPTGPWPQVVLPISTSTLEAVRTKIEHTLASPIAN LAAALQPASTAIRDHLDRYTTGATNATELDTQLKGGPTVQAPNLDALAGQLHDILKDGLNILVD MCPKKPPRTGAFAPRRISKKIRRAHEELRQLRAAIADNDTLHTPGNAATGAPTTPQPSPCPPTE GEPTTPTPTRGPRPPPPNQTVSDLINKIKHVGSALRQTLKKLTQSEVQKSRARFQHKLATRAKQ AHKDIFSNQEQARGIPAVLHPHTKAHCTDKASILEAIHTYFQDQGATITGPRTGNYKPEDRHPN TRYPFDPSNRTAPDTFKLHSTCSPTSHALLPSISDSANFDTCIDHLSRNKSPGPDGIPNELLRA LPTPLRTALHNLLILMWVRAHTPASWTASETVLLPKPGDPLLLKNYRPIAQANTTYKLWTSLIT VAISDISHDLDLFSETQEGFLRYRNTERQTLNLVHALEDAGLTNQDIYVMYADFSSAFNTISHD RLLEIMWDLGMPHDLLRVVQNLYANARTYIRTEHGLTSPVHIERGTVQGDTLSPVLFLMFIEPL IWWLHVGGRGYSYGCLPNHLNNRFHCSSAAYTDDLAVLTNTLSDLRIQCDKIHRYSAWAGLQVN HAKCRVTGILHRRAQQDKGLNGPTCNRTLKSMLENKIHIGDKPVPYLPATEPFKYLGVQITMNL HWGPQFAYLCDAIKEKSANLQTSLASPEQCLRIIKSCIQSMAAYSFAVMPYAENDIRTLDAMIA RLAKKCYRLTPGFPTRATLAPADMSGLGVGSLLPLYAQKATRALTLALNDPGRLGMVTHALLEL QSRIAGTAAIHKLRRESHFYTTLKQLSIMREYDITLWDHGAPYAGPINTILQAVNRNPRPIPHH LVHTLAQAGLDLRALISTERTKEGLTCLIPSDSLFLRLPRNQCKRQHRIALNRLSLCLSAEGAP TRAAPYDQPETYNCHLPLPQQYRALPATPAFAALLPQSATATGAPQTTLMAHRQHMTTLHPEPN STQPHAPTAFNATAPPRAAQPRGVPTRQAALTRHLRPVETACTLATIPAPPTQAGTPDPNQTAT PIALAQGTAPTLPPADTPATQPQTQTNTNKRKRINLQVKKVKGPSASKRLERRIEAEPRHDPLG TRGQRDHTGRHGFVLDHGEPVAGDTVPADFVMWAPRFYSMLQKDMSGKTHIPRLGIPPGKQKFP TTLPRPRYTRQIPTHLTTPYFLNDGEPHNPVQTYLRDHNATKAPLCAYDTPARWAQAGDFIIEP EPPRWPTITGADPANPDDPREADPDPNGMSRKAYMAERALLRQRTQHQDGRPRLITRWMGIRAQ VMTGGLQWPKNTSDATRAASQLIFSLYYDLESVSHVHAGPIHKLIAAPARNERPQTALYYEVAW KPTTIVRGAAAAYTIMKYTGTPHAHLAYGQSCSRYFVHMHWNVTSEPASKCPTLQGWRAAHAAH LQRLSKPPSEPTPPPPRDSHLSPAQRQGRWTPPDVEGAARRRTTRQNTQINTKPCNPYHDIYPT GAYTIQITDTYTDAQTLAHIHGPDGRHISSITMGTLHTLHTRYKHALASNPTETASLRSPGGFP EDVAKLLIRTRVDEKGKPCKTAPALERSLPTCLKTTLLDALAAAHPDETNPALMERFASPLNCS NRVCQYWSAQPADALFGATTDVLSAPFTGLSIAHPGPNPAAYTKALAWAVATTELLEGQMDPAI TLMVIPHNHKSPHTRLFSSPHVTHIAYLPPDTPATNLDEGLGDLSPAITLRHGLDLLLVANTTG VQAFTARNNLANLQAALTDIGAQPPRPTRTTAAVARWPRRITLPRDRHQPTSRPYSHRAQTRAR QATHRTPPTPPAPPTCTVSPPIQAELNALRDCFPAISRPLYDAATVIWTDGSCIKIAVGDDGAE MNQLGACAWTATTGEPLYVEPGGLNSTNTIQRAELSAILAALRHYRPRAQLLGLTKLVIASDSL VSLYLIRRAISDPMKLTLSKHKNLLSDIVKELNECADELLPVVFLKVRSHTGLAGNDEADKGAA LVAQRDAGAMSQAVHLPPDEPWANIWWPKRNSDNFYISDLNRGVLSSLPPRAQYGFTDSTTVLD NWAAAADGACPQSSNLTLASPARHPWLVKQILYARYGYLFNATLKRRYGLGGNGLCPLCEAPDS GGHILGGCSQRQMKGAYINRHNIAVQRVAKCISKGSYGGGYMVMDAGPLADLPEYASGQRPPAW LFRNGTPPEVIARLRPDILFIPALDTTTTQREDYRPPRNRSRTPIYILEIGYSSDSQHLHKREE KSQQHQELKQLLEEAGWTVHYSPAECIALGVTGTIPANLVPLLTTLGVSPKAAKTCGHKLHHHA VNSAGGIIKARRIMEYGHTRTPG
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute