| 2003, Volume 3, Issue 4 |
| April 30, 2003 |
| Copyright © 2001-2007 - Genetic Information Research Institute, Mountain View, California |
| ISSN# 1534-830X |
| Page 73 |
GYPSY1-I_AG |
|||
|---|---|---|---|
GYPSY1-I_AG is an internal portion of the GYPSY1_AG LTR retrotransposon - a consensus sequence. |
|||
|
Submitted: 30-Apr-2003 |
Accepted: 30-Apr-2003 |
||
|
Key Words: LTR retrotransposon; Gypsy clade; 4-bp TSD; gag; AP protease; reverse transcriptase; integrase; GYPSY1_AG; GYPSY1-LTR_AG; GYPSY1-I_AG |
|||
|
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
| [1] |
Authors: Kapitonov,V.V., Pavlicek,A. and Jurka,J. |
||
|
Title: GYPSY1_AG, a family of LTR retrotransposons from African malaria mosquito. |
|||
|
Journal: Repbase Reports 3:(4) p. 73 (2003) |
|||
Abstract: GYPSY1_AG is a family of gypsy-like LTR retrotransposons. GYPSY1-I_AG, an internal portion of GYPSY1_AG, is flanked by GYPSY1-LTR_AG LTRs. The GYPSY1-I_AG consensus sequence was reconstructed based on multiple alignment of 12 copies; they are less than 1% divergent from the consensus sequence. Some copies of GYPSY1_AG are 100% identical to each other. They can be active retroelements. The consensus sequence encodes the Gypsy1_AG1p 239-bp gag-like protein (pos. 67-782) and the 1029-aa Gypsy1_AG2p, composed of the AP protease (pos. 40-130), reverse transcriptase (pos. 227-394), and integrase (pos. 744-900) domains. GYPSY1_AG1p: MSTENVSNEETSPATAAVSVKLPEFWKNDPSLWFSQAEIQFLLAGVHKDETKFYHIVAKLEQSVLCHIAD YVKQPPATGKYEAVKQRLISRFELTEQAKMDQLLGSYDFGDLRPTHLLTKMQELAAGLNVNDSLLKRLFL QKLPANIRAILSIHDGSLSKLAEMADKMIEMAPQTSVIHASVQKETTENLAEEVAAMKVELRQMKARQPE RGRLRSTSQNRSNENICWYHRKYGNRATRCRSPCQYHQSKN GYPSY1_AG2p: MSKPLPVSSVKKLDFRPSEIGEVGGLRISRRLQIFDKSSGIRFLIDTGSDVSIIPASKIEKTREPSPFLL HAANGTKIRTYGSKFVSVDLGLRRKFSWNFLQADVTSAIIGADFLAHFGLLVDLGNRKLIDGGTKLHTVC GLSKSSVYGVTTIAKDHPFRDLLVEFREITAPPTMRTEVRHNVTHHIQTTGPPVASKPRRMPPDKLQAAK KEFETMMELGICRPSKSSWASPLHCVPKKNGQWRFVGDYRSLNRITVPDRYPVPHIHDLLNNFLGKNCFT TLDLVRAYHFVPVEESDVPKTAVITPFGLFEFTKMQFGLCNASQTFQRFMHHVFGDLDFVVVFVDDICIA SSNEEEHLSHVRTVFERLKSNGLVLNLDKCKFVQKEVNFLGYHINASGIKPQANRVQAVVDYSRPITVKD LRRFLALLNGYKRFIRNAVSLQQPLQALIIGNRKNDTRKLQWTIAADEAFVKCKESLANAALLSYPDSSK RMGLMIDASDTAAGATLQQNVAGAWQPLGFFSQKFSPSQKKYSVFGRELTAMKLAVQYFRHLVEGREFTI YTDHRPLTYALNSNSNHLPHEERYLQYISSFTKDIRHISGKDNSAADALSRVNTISAPSTVDFELLSKAQ HDDPELQKLLADRTTSMNLQLRSSVSTNQLLYCDVSDNVHVRPYVPEKLRLEVLRNIHCLSHPGVRATRK MVARRFVWPSMNRDVARFVRSCIDCQRSKIHRHTSAALNEFELPKSRFRHVHIDLVGPLPTSNGKRYLLT MIDRFSRWPEAVPLPDILAETVAKAFCECWISRFGVPETITTDQGRQFESELFTELTRLLGALRIRTTAY HPEANGLIERFHRTLKTSLTCVDSKRWCDKLPLVLLGLRTAIREDIDCSVAEMTYGQPLRIPGDFLEPSK TEICRSEFAKLLCRTMQQIGPIRNSHHDKRSVFVPKDLQSCKSVFVRIDSVKRPLTHPYEGPFQIIERHE KYMDLNMNGEKRRISIDRIKPAYICEKDSNEDNERTKVTPSGHRVRFLA
|
|||
|
Derived: [1] (Consensus) |
|||
|
Download Sequence - Format: IG, EMBL, FASTA |
|||
|
References: |
|||
© 2001-2008 - Genetic Information Research Institute