Repbase Reports

2004, Volume 4, Issue 7
July 31, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 199

RandI-4

RandI-4 is a family of non-LTR retrotransposons - a consensus sequence.

Submitted:
00-Jul-2004
Accepted:
31-Jul-2004
Key Words:
non-LTR retrotransposon; RandI superfamily; AP endonuclease; reverse transcriptase; RNase H1; RandI-4
Source:
Chlamydomonas reinhardtii
Organism:
Chlamydomonas reinhardtii
Taxonomy:
Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Volvocales; Chlamydomonadaceae; Chlamydomonas
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
RandI-4, a family of RandI non-LTR retrotransposons from the Chlamydomonas reinhardtii genome
Journal:
Repbase Reports 4:(7) p.199 (2004)
Abstract:
RandI-4 is a family of RandI non-LTR retrotransposons. Approximately 50 copies of RandI-4 are present in the genome. The 5' terminal portion of the master RandI-4 element is not completely reconstructed. Most of RandI-4 elements are shorter than 100-bp and represent the 3' terminus of the RandI-4 consensus sequence. The consensus sequence encodes the 2548-aa RandI-4p protein (nucleotide pos. 1-7644) composed of the AP endonuclease (aa pos. 130-350), reverse transcriptase (aa pos. 770-1070), and ribonuclease RNase H1 (aa. pos. 2030-2190) domains. RandI-4p: RANDIPKLGPTPAALAQDPPTNPTGGPPAGPNTTNSPSPPTSANPGPAARSNTATRIVPKPAGP TKRPARRTPNPTPKLRKLATGTPSAPPPAKRLHQTTIATSHKRRHTATLQNTPTEPTNGQAQAS PSPPPDGRISILTLNTRELLSHHESILHTVTTHTPDILFLTETKIPTRLHYSMGSAKKGLSSHY QTYTSNAPYNPREGTRPHAGVMVGVTHRFSRPSGTTDMNAHNTDSRLAGHVCHIRIAPPHSTPV HAIGIYAPSDDATRAAIYKYCTDQAAKTTAAKEHLLIGGDWNAVLFPSDRSTGQLNATDRAHIA FTTSAGLHPLTPPTHEHPRPHTYLQDQATAPHMSRIDDLLVSHHTHATITTTGGGQEAVHECGD ALDHRALLSSVPLAALRLYPFPTAPAAPSDPDAWVNYHPPTKDQREKARIKMEETIGQEIADLL NTLHPAHLAITNKLYPAGLTPAQAERNTPSPAEFRALPEVSAADTTSISNTLESILQKCLDVLK SYCDKKPPRASTTSPRHHTSKTVTRLLTATSKLKERLTAALTVTNAAGATPPTTSPHPAPTPPP NTTSPTPTPPTPAEADTPTAIRSQLLNARTLLRAAKKEARKLNTKRAARAFRRLVATQPKRAHQ KIFDETDQPRGTPALTDPTTKEVHTAPEKILSSIHQHFTAQLLPINGPRTGHYDIPHSEKQYPW SRPDAPDRMPALSKPTDAPATLLPGIADSTTFFGCVNHLSRRKAPGPDGIPNELLQTIPTSLQT AIHQLMTLMWVKAHIPTHWTQSDTVLLPKKGDPLLIQNKRPIALANTLYKLYTSLITVNAALFS EQAHIFTESQEGFLRGRNTERQVQNLLHAIEDAATTGRDLFLLYIDFTSAFNTIDHDKLLIVMH DLGYPTDLIEVVRDLYGKAATSVRTEHGTTPPIPIQRGTVQGDVLSPLLFIIFIEPLLRWLHVG GRGYQYGCLTKAENDKHNLSSGAFADDLVTATTQISNLRIQVEKIEKYAAWGGLKVNVSKCQLQ GILHGRARADPTLGGPRSKPHQAALARMLEGQIRIDGTAIPYKPPHESYKYLGIHITPTLDWSA QVQYIREKIKKRGQRIATSLATPDQKLRMIQLTIHPAVAYSLPFMPYSTHDVTSLDSLIASTAK MCYGLKRGFPTRAILQPTNALGLGVGSIRELYVRRAGKALVQALNDGGRLGAVTRAMLHLQHSV AQKIPADQLRTQCRFYTSLRQLSLITDAGLELTQQGSAYHTTLHGLSALLARTVPPGADAEQAL PFRLIQPLLELTSDFRELLTPDRTCVIDANTLKNTWPRAQRRHRLALNRLTLALHAETRTRAKT HNSTAPLTTAQRTLPDGHTFTGLHTAHTAPAAVDWGTLNRFLRQRAPATGQTANAGETAAATTA AATTAAAEPEEDHHAGENHAPAQTRRPGTQQRKRACIKMVREPEGRGSPPADAPAAFDWDPSLD PLTEAHCADLIHTAPSLLGKSKPQNPAAHLRGLLLGPTPPATFTHRHKLSQEEQDAYLAATSDT WKHFSETYRTKALTTTNTLYGQQERVQQVIAHTTSKGAPFYRVQWEPTVITPGSLTLYARNRYR PKPGGLTWLLTPGNEPRVLVEWEDLEEPAAILDAQDNKQELLDALNARLAVPLGPRPPPPATDT HLSHDQRQGRWTDASNDTARGQRTLRHYVDINVEPCNPSRDIHPTQAYTVQLGTVLDPGSPACG THLAHVYDPRGRCIGTISAEQLSLLWQRYQHAQTTWPKVHAKHDQGRGFAAAIARLLLRYKPGG GSKSHPVTEKGECMLCEGLTQTLHTAIGSTTELFASPLNVSDAATTYYTLYEEDMLFGANYDAY SQPWDGVAVACPEPNTEDGAHAVRWALAWAEHTETLTNTRNPPTTATLLILPTQRNAPYTRYLA HPRVTLLGSISATTTHALLGQIHNDTGITLKHDLVLVSNTPGAQLAQPIRDALGTSLYLSGMPV DPQPQLTWAHTRWVTKQVRRMTRTRPAAPRVTPPPVPATPATLAPFAATQPLRLNPYEGFYTDG SVQKDALGQPSAWGAAAIDARTLRCYQIDVNPGQHGCNTITRCELSAIHQVLKLDEVQALPEVT VYTDSAASLFLIRKTINQPQLLTFSKHKALLDNILALLKRRSHAGRRTNLLKVKAHSGIAHNEL ADQAAVKTAKGEQIPGAEWLVEESDNDPRSKYYWIRKIIKPTVEGVPREKNEGHYAHDLNRGIA TLAAPHIAQGHTRETLYTGFVANAIPDLDIKSSTRTMNLQSGRHRKAKRCMDYLYGQLWNNKLA ARFGRTVTGRRAPRAGPDTPAPCPLCRGPDSGGHILGGCHNPQLRAMYIKRHNIAVQTIARAIS RGDCGGCYMVMDATSKADLPDYAADNRLPPWLLPDQEGKPAGRHLHYRPDILLIPSISLAAALN PDFVVLPSEHDTIHIIEAGYTADTNHAAKQHEKAQQHQALAADLREAGWKVQYTPQSAISLGFA GTIRKDLHPLLTSLGVTSHKARQCCDTLHDHAVATLNHIVLTRRRLERGLPPGDPGGT
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute