Repbase Reports

2003, Volume 3, Issue 4
April 30, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 75

GYPSY2-I_AG

GYPSY2-I_AG is an internal portion of the GYPSY2_AG LTR retrotransposon - a consensus sequence.

Submitted:
30-Apr-2003
Accepted:
30-Apr-2003
Key Words:
LTR retrotransposon; Gypsy clade; 4-bp TSD; gag; AP protease; reverse transcriptase; integrase; GYPSY2_AG; GYPSY2-LTR_AG; GYPSY2-I_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V., Pavlicek,A. and Jurka,J.
Title:
GYPSY2_AG, a family of LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 3:(4) p. 75 (2003)
Abstract:
GYPSY2_AG is a family of gypsy-like LTR retrotransposons. GYPSY2-I_AG, an internal portion of GYPSY2_AG, is flanked by GYPSY2-LTR_AG LTRs. The GYPSY2-I_AG consensus sequence was reconstructed based on multiple alignment of 20 copies; they are ~1% divergent from the consensus sequence. The consensus sequence encodes the 1408-aa Gypsy1_AGp poliprotein (exons 846-1789 and 1874-5156, predicted by FGENESH) composed of the putative gag-like (pos. 1-300), AP protease (pos. 325-410), reverse transcriptase (pos. 514-681), and integrase (pos. 1030-1200) domains. GYPSY2_AGp: MLHSPPVRDVSTPDGVTPSADPAASGSKSPHVPTPPVPNTPRVPGPSACDAMFMPPESQI DTLNAMQLKPPEMDTTDIQTFFFALENWFDAWNITTNQHIRRFNILRTRIPLRVLPELRP LLENIRQYATDRYEVAKRAIIEHFEESQRSRLHRLLAEMNLGDRKPSQLLAEMRRAANGA MTDSMLVDLWIGRLPPYVQSAVIATNTDTNDRAKVADSVMDSFALYHRTGPYQTIHEVRN EDFERLSRHVTELGQRLDAVLSKLNERERARPRSRTRQRQPNQDAVTPSGHCYYHTQYGQ AARNCRAPCSFNNRRYRLVITDPKTNIKFLIDTGADVSVIPRQHSSVPSKPSTMKLFAAN STPIQVYGESLYTLDLGLRRSFLWNFIIADVGTAIIGADFLQHFHLLVDLRKKCLVDALT NVRSTGVPSQNPSEPTVKVCDSTSPIATLLKEFPGLTALSTPGTLLQSEVTHRIETTGQP TFARPRRLPPEKYAAARKEFESLVQLGVCRPSNSSWASPLHMTKKADGTWRPCGDYRALN AKTVPDRYPLPFLQDFTMHLQDKIIFSKVDLHKAYHQIPIHPDDIAKTAITTPFGLYEFT TMPFGLRNAAQTFQRLIHDVLRGLEFVFPYIDDMIVASTSEAEHHEHLRQLFERLEKHQL AINPAKCEFYRNEISFLGHLVNASGIRPLPDRVQAISELPQPTTIMELKKFLAMINYYRR FLPHALETQGILLEMTPGNKKKDRTPLTWSLEASEAFAQCKEQLKRATLLAHPVKNAELS LWTDASDFAAGAVLHQRTNEDLQPLGFFSKRLEKAQQKYSTYDRELTAIYLAIRHFRYQL EGREFCIYTDHKPLTFAFRQTHDNASPRRARQLDFIGQFSTDIRHIAGKDNVTADLLSRI ETVHATPTIDYERLAEEQERDPELSDILSGKIQTDLFLQKTPIPGSPKSLYADCPGGIIR PYITRSFRTQLLHAVHDLSHPGARATARLITERFVWLNARKESQDFARNCLACQRAKVGR HVKSPLIPYPATTARFSHINVDIIGPFPISNGNRYCLTIIDRFTRWPEAIPISDITASTV VSALLFHWIARFGVPAHVTTDQGRQFESSLFKELTKALGTKHIRTTAYHPQANGIIERWH RTLKAAITCKDTARWSEHLPLILLGLRTTFKNDINASPAELVYGTTLTIPAEFFIAKPQN ALADQSDFAKTLEETMSSIRPQSTAWHTNRTPFVHSDLNKCTHVFIRDDTVRPALTTPYH GPYKVLTRNPKSFQILLRGQPTLVSIDRLKPAYGAEEEATPAPQCSWEGLTTNLLPPTTD HSETLPLPDVQANSDRRDATAASKPTSREQPVRNQTTPAPPSHPTTSRQTDRAAVDAPPP SILRRNDQTVSTGVTRSQRKVIIPLRYR
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2019 - Genetic Information Research Institute