Repbase Reports

2003, Volume 3, Issue 5
May 31, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 87

GYPSY7-I_AG

GYPSY7-I_AG is an internal portion of the GYPSY7_AG LTR retrotransposon - a consensus sequence.

Submitted:
31-May-2003
Accepted:
31-May-2003
Key Words:
LTR retrotransposon; Gypsy clade; Gypsy group; 4-bp TSD; gag; protease; reverse transcriptase; integrase; env; GYPSY7_AG; GYPSY7-LTR_AG; GYPSY7-I_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
GYPSY7_AG, a family of LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 3:(5) p. 87 (2003)
Abstract:
GYPSY7_AG is a young family of gypsy-like LTR retrotransposons. GYPSY7_AG belongs to the Gypsy group of the Gypsy superfamily. GYPSY7-I_AG, an internal portion of GYPSY7_AG, is flanked by GYPSY7-LTR_AG LTRs. The GYPSY7-I_AG consensus sequence was reconstructed based on multiple alignment of 5 copies; they are ~0.4% divergent from the consensus sequence. The consensus sequence encodes the Gypsy7_AG1p 434-aa gag-like protein (pos. 507-1808), the 1061-aa Gypsy7_AG2p pol-like protein (pos. 1756-4939), composed of the protease, reverse transcriptase and integrase domains, and the Gypsy7_AG3p env protein (pos.4962- 6506). GYPSY7_AG1p: MEALAGRIAALEARFSESNVTDDFQDPPLFFTKQDGSAVDPESFEKIPGVVKDLPIFCGDPSELNSWIND VDGIIRLYQTISSHSLEKQNKFHMICKFIRRKIRGEANDALVASNVGINWNMMRKTLITYYGEKRDLETL DFQLMSVYQKGRTLEVYYDEVNRLLSLIANQIQTDDRFNHPEASKAMIGTYNKKAIDAFIRGLDGDVYKF IRNYEPTSLAAAYSYCISFQNLECRKMLTKPKHFNTPPSAPRNQIPLPTPHLPPRVFQHQQRPMTANNVR PHFAHHPPIQNFAGNFTQRPVWNQPNQQRPIFQRTNFNQPNQMKNFTQQRNNFRQNGPEPMEIDPSIRSH QVNYANRPNSSNIRPLKRQRAFNIEAVPRRELEPTSYEDNLYDDDVESQASYERYMRNVEKQEKLNENSH YDEISREAELNFLG GYPSY7_AG2p: KFSLRRNFSRSRIKFFRLKSALPYFIYHGKAGQQIKILIDTGSNKNFINPLHAKISHDVIKPFFVSSVGG DLLITKYSQAQIFAPYSDVNVKFYHLQGLKSFDAIIGYDTIKEMGAFVDAKRDNLVLENFIIPLSLHPLQ EVNRIEIRDTHLNHQEKEKLHLFLNKFQDLFQPPDEKLPFTTKVEATIATNDTEPIYCKSYPYPLSLKQE VETQIKKLLNNGIIRPSRSPYNSPVWIVPKKVDASNEKKYRLVIDYRKINLKTKSDRYPIPDTSTVLANL GNNKYFTTLDLASGFHQIRLAEKDIEKTAFSINNGKYEFLRLPFGLKNAPSIFQRVMDDVLREHIGKICH VYIDDIIVFGKTFDEHLKNLEIVLNTLREANFKIQPDKSEFLRTEVEFLGFIVSEYGLKPNEKKIESILK YPEPQTIRELRSFLGLSGYYRRFVKNYAALAKPLTKLLRGEDGQGHCKITKNQSKNFPIKLDDDAKRAFK TLKEVLSSDDVLAYPDFDHDFILTTDASDKAIGAVLSQNVNGVEKPITFISRTLSKTEENYATNEKEMLA IVWALHSLRNYIYGAKIIILTDHQPLTYAMSPKNNNAKLKRWKAFIEEHNYELSYKPGKTNVVADALSRI QINSLTPTQHSAEEDDLSFIPSTEAPINVFRNQLIFQKGTISSYEFVNPFPKFKRHTFIEPQFSIDFIKD KLKRFMIPGIINGIFTDEPTMGIIQETFKNLFNISTMKARFSQTQVQDICDQEQQIEEIRKIHNFAHRNA KENSLQAIKKFYFPSMRNKIEQYVKNCETCKVEKYERRPPEYIPVKTPIPKYPGEIVHVDIFAYNANFLF ISSMDKFSKYLKLKPIKSKSIADVKEVLLQLLYDWNLPRQIIFDNECTFVSNVIEQSILNLGVSIFKTPV NRSESNGQVERCHSTIREIARCTKGLNPDMSLITLIQQAVYKYNNTIHSFTKETPRKVYIGEQSEELSFR DRSKLKEKIESKIIKIFEEKNEKIKDDKYQDYEPNQFAYEKNKTMNKRDSRYKTVVVKENHPTYIIDSNN RKIHKINLRKN GYPSY7_AG3p: LIFYYRIAFFTLYGVLQASINIFDLTNNPLAIVPLGQAKIRIGYLRTIHPIDLTELEEIISRVFENSTNS TGKSPLQSLINLKLEKLNATISKIRPRRLRTKRWNSIGTAWKWIAGSPDAEDLTIINTTLNSLILQNNEQ LLINNGLSRRFQETTNIANHVIDLQNRIQREHQTEIQQIIKIANLDALQAHIKTLQEAILAAKHGIPNSE LLSIEDLNTVAEFLAQNGIYYTSVEEMLTQATAQVTMNSTHVIFMLKFPRLSYETYEYNYIDSIIQNDKR ILIKHNYIIRNLTHMFELPQPCIDQSSHQLCESKDLEEPSRCIRQLVQGEHTECMYEKVYSTGLVKHINN ANILLNDATAEISSNCSNINHILNGSYLIQFHNCNIFINGELFPSTEVSITGKPYISTLGLIAKEDGIRD EPSIEHLRNITLQHREKLHTISLVNNSLTWKLHIFGSIGLTTIVLITIAILYFITSIRRTKISLNIPTNN TNRQDVHHIETFVKKPTTFHALGRL
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute