Repbase Reports

2003, Volume 3, Issue 5
May 31, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 89

GYPSY8-I_AG

GYPSY8-I_AG is an internal portion of the GYPSY8_AG LTR retrotransposon.

Submitted:
31-May-2003
Accepted:
31-May-2003
Key Words:
LTR retrotransposon; Gypsy clade; 4-bp TSD; CLP protease; aspartyl protease; reverse transcriptase; integrase; GYPSY8_AG; RETRO23_AG_LTR; GYPSY8-LTR_AG; GYPSY8-I_AG
Source:
Anopheles gambiae str. PEST
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
GYPSY8_AG, a family of LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 3:(5) p. 89 (2003)
Abstract:
GYPSY8_AG is a young family of gypsy-like LTR retrotransposons. GYPSY8-I_AG, an internal portion of GYPSY8_AG, is flanked by identical GYPSY8-LTR_AG LTRs. The internal sequence encodes the GYPSY8_AG1p 369-aa protein (pos. 1088-2194), and the 1218-aa GYPSY8_AG2p pol-like protein (pos. 2203-5856), composed of the aspartil protease (aa 9-159), reverse transcriptase (aa 313-486) and integrase domains (aa 942-1100). GYPSY8_AGp1 is not homologous to gag-like proteins encoded by known gypsy-like elements. This protein is similar remotely to the CLP proteases, although the significance of this similarity is low. GYPSY8_AG1p: MITISDRIEILKHIFKKIQDPNQRTCTKTQLRIQAEETFKEIQKEIEKNKFKYTFNKLLEFSKISNALIHN IIAMSTSKTNDDSHNKTSDCSTNSLEDKNTNLTLLTTNKLSFKLLAQTISAFLQLHKKSKMAFDLNSVGFS VLTGMQPFEGKASELTKFIDTVNTVKSIISSSNSLIAINLLLTKLGKEPRELFKELPKSFDEIIVTIKGRY TNTISVAKIYKQLCDRRKGKFENFNTFAKSLNELADQLQTAYIQENMTPDIAKTLTQQNVIIKLKEAGISR DSQLILDIKEFSSITDMLDTVKSYEGKNPPHEQNSRTTNFTPNANAFKAAHSRVMHTQIQTNTPDKKEEEE GDRFLDDDQNQYPQ GYPSY8_AG2p: MTGIKTNNFIKVKLNIANGKNTTLVIDSGAEVSLFKASSLKKQIQLKKNKELTLIGITTDTMKTKGYTQAT IHFGDKNVEHTFFIIKDLPTQADGVLGMDFISKFQCDILFSTWMLQFRNGNDIIEHPVDDSINGIIEIPPR SEVIRKLTLKPITEDSVIFSKEIKPGVFIGNTIISKTDPNIKLINTTESTAFINTGTIRPQIEPLKNYEIF LANNYNTPERTRIIQDKVHIEQVPEIAKRNLKNLIAEFSDIFCLENEPITTNNFYKQPIELMDNNPSYIPN YKQIHSQADEINQQVDKMLKNDIIEHSVSAYNSPILLVPKKSIDGSKKWRLVVDFRQLNKKILPDKFPLPR IDTILDQLGRAKYFSTLDLMSGFHQIELEPASRRFTAFSTPTGHYQFTRMPFGLNISPNSFQRMMAIAMAG LSPELAFVYIDDIIVTGCSAQHHISNLSKVFNKLRKCNLKLNPEKCCFFKTEVTYLGHKITDKGIYPDDSK YEIIEKFPVPKNANDARRFVAFCNYYRKFVQNFAKIAKPINNLIKKDVKFDWTKECQEAFEKLKQSLLSPT ILQYPDFTKQFIITTDASDTACGAVLSQITDGNDLPVAFASKSFTPGEKNKPIIEKELTAIHWAINYFKPY IYGRKFIVKTDHRPLAYLFGMKNPTSKLTRMRLDLEEFDFDVQFLAGKANVAADALSRVVITSDELKAQIP TNIMLATTNNIQNLKSTILMVHTRAMVKQKEAKKVTPHKPIDTRSDQPTMWTTDTPSKTRKLLKIRTSISD NNITFEVCNSTYKKVLGKVNAKAEKNGSQALELALLNICKIANNYKNKKLAWSLHDQIFTQYSHQTLKEIA NRVIDKYEFILFTPPRWVETEHDRLRIIHDYHMTPSGGHVGQFRLYRKIRDTYTWKNMRNDIKNFINKCEA CLVNKVNRHTKEQTVITTTPNKPFNVISIDTVGPLTKTNNNYRYAITIQCDLTKYVVIIPIHNKEANTIAK ALVENFILTFGTFIELKSDQGLEYNNEILNKITEILQIKQTFSTAYHPQTIGALERNHRCLNEYLRSFVNE HHDDWDDWIKFYEFVYNTTEHTDTGYTPYELIFGRKANLPQEIYQNKIEPVYNVEQYYNEMKFKLQKSREI AQKNLVSSKENRQNILNKNTNSLKLEIGDIVYLTNENRKKLDPVYIGPFTVTNIQEPNCTIIHKTTNKSST VHKNRLIKSKE
Derived:
Positions 1735905 1730002 Accession No AAAB01008859.1 GenBank
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute