Repbase Reports

2004, Volume 4, Issue 7
July 31, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 196

RandI-1

RandI-1 is a family of non-LTR retrotransposons - a consensus sequence.

Submitted:
00-Jul-2004
Accepted:
31-Jul-2004
Key Words:
non-LTR retrotransposon; RandI superfamily; AP endonuclease; reverse transcriptase; RNaseH; RandI-1
Source:
Chlamydomonas reinhardtii
Organism:
Chlamydomonas reinhardtii
Taxonomy:
Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Volvocales; Chlamydomonadaceae; Chlamydomonas
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
RandI-1, a family of RandI non-LTR retrotransposons from the Chlamydomonas reinhardtii genome
Journal:
Repbase Reports 4:(7) p.196 (2004)
Abstract:
RandI-1 is a family of RandI non-LTR retrotransposons. Approximately 100 copies of RandI-1 are present in the genome. They are ~1% divergent from the consensus sequence and are likely mobile. The RanI-1 elements are usually flanked by 10-15-bp target site duplications. The 5'-terminal portion of the consensus sequence may be still incomplete. It encodes a 3571-aa RandI-1 protein (nucleotide pos. 2-10714) composed of the AP endonuclease (aa pos. 990-1250), reverse transcriptase (aa pos. 1600-1900), and RNaseH (aa pos. 3040-3200) domains. RandI-1p: XRMHTNTPLTNLASSSLGNMWKMVARRKVERKSAGICVLICFALCMLTTNATTAGNASGSAAAD RWFDRGPPRHSSAFALLGNHDRTMPIQKWSARGLTTCTESSAKGGCSHVCNSRATGAQLPALSH TDGPIATHGINHAHGSVWRENFSFAHSMFTVHSNPANATLLSDRCSQKGIFDTSILHTTCYTAS KRAGAEPFGMSPKRPSQSPTAAKHPSFEELISSKHIPVASEPLANDVPERGRAEPPVFTANSQL YENASLLTEPPPTSSERRQYRSPSGLLSRAKVKPKPGCTNIRNVMITALQPVGEGLLSTNLKIA QACRSGVTKLCTAEGAYRSYATKTIGLLGESLLTSVLALGDLGSRHGGTMLRSGASALHSMAET AYGPQPTLNCSAPWENSPLLRLHLTLVVHFHWAGAGHVRAAMHSHTIAIRVGAHLATRSRLAIY QTAAQTGKALHTILDKAIVSLTIITATGAMYMLWTHLLHAETRTASVNSEICANGAILSLLLWG PTLTTLALSPLCVIALALGITAIPAALGLLFALTVTSILAVLCLPALLKHVVQVTRQIASATWA RRKLLLWGTAMAIILRNLYLTLPDPCSVQPAQPALWGPALVTGLLTASQATTLLTLTYAVNRTH TMPYVKPTLRGALPTDPTAPPAPADPRTHAAVNIRPPAALHWPPNPNTTYRETQRQHFCSVHAL NNSLGLAWLDPLDVLSYAKRVHAHLTATQDPNALFWKECYCPNSGAFSEFLLNHYLYHNATISN IFAYPNRKLIMRRTHFPRLNGDISKEKVLESLPVAARTRGFTVHQYTVRHTIAVRYEAGQWRVI DSVNSPIHNTVLHDNTWNTLDGEVWCLDAVRASDTATLQDALGQLREIVPPPRPPPHMEATRPV QTGPAATGRPAPEPAQHAQDNSPLRAAHRAPQHAAPHTTAPAQPHNANAGIAPRLNTTGTRQTI LNWLVRPTARNHAHMEPNQAHTNTRPPSSNTTLHIVTHNVRGLLSEILSTGQPGNLSFTLCNLA QWKADIVVLTETKLTGKTDSIKRAFRNEGYRLYCSTTPRVATARGSAGVAVAISARYSDLGCVT HHTPPPTLHGYVSHVQIKTPGSTPLTVLGIYAPEDMQTRKAIYTYCQQVVGRADACGHHLVTAG DFNAVARAHERDSPIDTADRAHQRFLADSGLRPIRGDTTTTAEWSYEQTRPGMAPYHSRIDDIL LCPATRAACTEAREYTSTVAGNFDHKPVHAELLAADLQLWPAPQAGARNPAPQQQTQQRWAEVA LPVTQKQLAAAAIRLEEALVEATADLHSATRQATQSIEHALTRHSMDPTGYPASVMHRDLAQDT SIQKADINQLAEQLASALDTGLTCLLEECTRKAPFTGKHHASRSTARVLKPLWDKEVALKTQLT NLTQGPQALPEVDAANAAAKLRNEIKACQAEHRQLVADRAKAQREAAATALQHTLATRPAQGHK RIFQKEDMERGLPAVRNPETGEVTTDSTSILAILETHFRKLSAPPRGTRTGDFRLPSNATRGYP FEKADATDQFTLDRNRHPDTHSMLPSMADTANFEQCISHLSRNKATGPDGIPNELLRILPSGMK RNLHCILQIMYVKSQIPETWAASETVLLPKPGDALDIKNKRPIALANTCYKLYTSMLTLGIGEL AGPLQLFSEAQEGFRAYCNTERQVLNLVHALEDAALFGKDVYAVYVDYSSAFNTIDQDRLLQIM FDLGLPTDLIRAVRNLYAHATTRIRTEHGSTSAIPIERGTVQGDTLSPVLFILFMEPLVRWLHA GGRGYHYGCLTPSENLQYHCSAAAYADDLAALTNSLDDLQVQCDKIASYAEWASLRVNHTKCAT TAIWHDKSRSDPNLDGPTGKATLAAMRRNMTNTIKIGTTPVPYFPPTQPYKYLGVQLTFSLDWS AHVARVTEIVKDKGTAIATSLATPAQRLRMIQQCVHTTVAYGFAAMPFTKQDITTLDTTLAGFA KRCYGLPRSFPTRTSLLPANEYGLGLGSLLPQYARVAQRALVLALNDSGRLGIVTRALLPRQAS IAGPTQAHLLPAHRSHHLTTLKQMTLAKEYGVTLYQNGSAFTAPTWSIAAALEAEAEARGVEPL PIEYVLPLADLRLELSHLVDRNTGKHLITSSDLEKHMGASRVRHKHKVALNRLSLALSMAARAG ENAPAHGSPAPLTTAQRALPDVAAIVALAMRADPLPLGMANPLDRYLQPLPEAPPAQANTTQAP ADLAPPSPAGAAQAHPGAQGAAAPEVPADGATQPSLRPPARARKPATTRRPPQNQTRARRVTKH AAQRARIAETTRTTPEDLLTGDRAAINHALRLTNGDESLPVGTRGYETACSVFQALAKDMTGHM RICHPHPQDPPSLAHAVEPRQQRRGLPPALTSRLTANAANPEPNAISEYLNNTGLPMEVNQAAA HNDPMDADSPTPPLTAPAAQAAGGHGARPSWRLARARITHDLVPRPADNRTNGDTQLTYDLDND GQELAQVLHSENKQTVTDKELTRKRKAREDTSGTREKYWKVQWKPSICPAGIVSAFVRMGYATI QNARCYKHPLATRPGPQPAQTEAEPPAETQAHAISPEEPAQAAEPAAPQPTGAPLFSSWTTKLL RHVTWAPSTDRERDLKHSPQWEELLEECKTRLANGTNQGPRPRTVRPAPDRNLTARQRQGRHDA EPLDTASRARDIRTTTTITTSPCNPYKDIVAPGAYTITTTGGTRRDPAEAHVHEPSGRWLGTIT YPRLLTLWERFRHTGNQRPNAFEEAVAALIMRYRYDPAQQDRKAMPMHQVSLPAGTVAAIVQCL RVTQAVHIREMFASPLNSSTAAHEYWTRDPADGAFGALHDAYQTAWTGLQYAHPPSTPHDARKA LMWALACAEAMRDSQEPTLTVLALPKAATFPHTQWLQHPLCHELANWSAGTAGLDTGLGTNTQA ERQKGLRLVIVGNPAGLQCFAPRLKRLVDTLKRGANAPTHISDPRTWTHTATAPTCPALPNSLL KQAKYQHANPKTIHAMADEARRFPNARFQTHHALAHDVNGTVWTDGSVSKIKTDNGKEVQVAGA CAWFSDSRVVYVNPNGAGCTNTITRAELAAIRAALAEFGGEGKEFATKKLTIASDSVASLYLIK RAINEPRRLHLSKHRDLLDSVVALLHARHARGAPTALIKVISHTGLHGNEEADKGAAAVATQEK PADVSEPADNDPHARHWWPTFTVTRDDDSTATHYVSDLNRGLTRAMAPSCRGGYANKTLYTEKW AEAAPSLDPRASNAYMTNSDASHGLRRFIQNARQGGLICPARLALFKLRDSDKCGLCAHTQRSR GSDPERGTAGHLAGHCAHPQLVGARIAKHNTAVRTIAECLHHGHNGGGYMIMDATSRAELPEYC AGMRPPSWLCPQVPAADLNRMRPDILFIPNLPRSEAERFMTSPPANKGAYPVYILEVGYTSDLH HSEKLIQKQAQHATLATAMRAAGWTVHYDTKHVITLGHGGTVPRTLETLLKDLGAKPQAAKACC TRLHMHSVTTLRGTANLYYRLEREMGIANTRCRPGGRTQGTANGGPRAGEPG
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute