| 2004, Volume 4, Issue 3 |
| March 31, 2004 |
| Copyright © 2001-2007 - Genetic Information Research Institute, Mountain View, California |
| ISSN# 1534-830X |
| Page 76 |
GYPSY42-I_AG |
|||
|---|---|---|---|
GYPSY42-I_AG is an internal portion of retrotransposon GYPSY42_AG - a consensus sequence. |
|||
|
Submitted: 31-Mar-2004 |
Accepted: 31-Mar-2004 |
||
|
Key Words: LTR retrotransposon; Gypsy clade; GYPSY lineage; 4-bp TSD gag; AP protease; Reverse Transcriptase; RNase-H; integrase GYPSY42_AG; GYPSY42-LTR_AG; GYPSY42-I_AG |
|||
|
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
| [1] |
Authors: Tubio,J.M.C., Costas,J.C. and Naveira,H.F. |
||
|
Title: GYPSY42_AG, a member of the Gypsy lineage of the Ty3/gypsy group of LTR retrotransposons in Anopheles gambiae. |
|||
|
Journal: Repbase Reports 4:(3) p. 76 (2004) |
|||
Abstract: GYPSY42_AG is a family of gypsy-like LTR retrotransposons that, according to the aminoacid sequence of its Reverse Transcriptase, RNase and Integrase is phylogenetically grouped with representatives of the GYPSY lineage of other organisms. GYPSY39_AG, GYPSY40_AG, GYPSY41_AG, GYPSY43_AG, GYPSY44_AG, GYPSY45_AG, GYPSY46_AG and GYPSY47_AG are other members of this same lineage in Anopheles gambiae. The GYPSY42-I_AG consensus was reconstructed after multiple alignment of 5 copies. The consensus encodes the 373-aa GYPSY42_AG1p gag-like poliprotein (pos. 481-1599) and the 1138?aa GYPSY42_AG2p pol-like poliprotein (pos. 1560-4973). The sequence of the LTRs flanking GYPSY42-I_AG is deposited as GYPSY42-LTR_AG. GYPSY42_AG1p: MHVNSSANVKPRIELSVLFKNLSASEESSDSELSGEYSNIPLQDSSLPNLDQLNINTIEM EPTEQLKIMNQTIADLQEKIAVLTIQQSQPSIDVASFFRIPDPIKSLPSFDGNRKQLSTW LTTTEETLNLFKDRVTGEVFKMYLTAVINKIEGKARDILCLAGSINDFESLKEILFDAFG DRQELSTYKCKLWQNKMVDGMTIHKYYQKTKEIIQCIKTIAKQTQAYKDNWAVINQFIDE DGLAAFISGLKGMYFGHIQAARPKDIEEAYAFLCKFKSHEITADCMVQKPPNQQKNSFFQ NNRTATTQKSHFAQNQNINREPYPQPMEVDNSMRSRLTLNKRTINNFEVASQDDNSANCE QNFHLDSPSTSIT GYPSY42_AG2p: TKFSLGFAINQHNIKYCSNFLPYIKVKDSKTNRQIRMLIDTGANKNIIRPGIIKNTIKTE QVSIKNIFGTKIIQEKAICKLLGPNIPAQTYYIMEFHDFFDGIIGTEFLSQTNTVIDFKN NVVVINETKIWFEKLFSSKKFYHHTISIETDQNGDWCVPTFENLSENIIIEPGLYSSIDN KTFVKVLSTSKTTPHIPKLHFTVNNFETLTPIPSACNDIPTKKIIETLIRTDHLSFYEKS KLFETVIKNHNVLLKLNEKLTSTTIIKHKINTTDDLPVYTKTYRYPHVYKQDVETQIRDM LDSGIIQPSTSPYSSPIWVVQKKMDASGKKKVRVVIDYRKLNDKTINDKFPMPEIEDILD SLGKSQYFTILDLKSGFHQIEMHPEHQEKTAFSTSHGHFEFTRMPFGLKNAPATFQRAMN NILAELIGKICYVYLDDIVIVGTNLEDHLKNVSTVLGRLAQFNLKIQLDKCEFLKRETEF LGHIISPDGIRPNPEKVKKILDWPIPSNEKQIRQFLGLSGYYRRFIKDYSKITKHLTKYL KKDQTININDPEYIDSFSKLKETIASDQILAYPDFNLPFVLTTDASDFAVGAVLSQIQNK VERPIAFASRTLNKAEINYSTIEKEALAIIWAIRKYKAYLYGNEFKLFTDHKPLTFIKTS IKNNRILNWRLELENYQYSVEYKEGRANVVADALSRKTENTNEINSTNTTILATNHSGST SDDFYIKSSERPLNYYRNQIVFELVAQHEDLIEIPFPNYKRTIIRRTDYDESKITDILRK FHNGKQTAILASQNLIQIIQNSYKNHFSCSGYIVMTHSQVKDVASVEEQNQLITREHERA HRGIHEIENQMKRSYFFPKMHDRIKSAINACPVCNMHKYERKPYNIKISPRAATDKPMER GHMDIFSINSKSFLSLADSFSKFAQMIPIDTKNLVDVKNALAKYFSTFGIPLQIITDHET TFRSIQLKNFLCNLGCSLTYASSSESNGQVEKTHSTIIEIYNTNKHKFVDMDTEALIPIA VSLYNATVHSATGYTPNEILFNQTNEMRPITIHEQAEKIFANAKTNIERSRQNQMKGNIR KETPPLIREGQEVYVKPNIRKKLDPRARNTTVNNVTDRTFENSRHIKRHKNKIHRIRS
|
|||
|
Derived: [1] (consensus) |
|||
|
Download Sequence - Format: IG, EMBL, FASTA |
|||
|
References: |
|||
© 2001-2008 - Genetic Information Research Institute