Repbase Reports

2004, Volume 4, Issue 3
March 31, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 74

GYPSY41-I_AG

GYPSY41-I_AG is an internal portion of retrotransposon GYPSY41_AG - a consensus sequence.

Submitted:
31-Mar-2004
Accepted:
31-Mar-2004
Key Words:
LTR retrotransposon; Gypsy clade; GYPSY lineage; 4-bp TSD gag; AP protease; Reverse Transcriptase; RNase-H; integrase GYPSY41_AG; GYPSY41-LTR_AG; GYPSY41-I_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Tubio,J.M.C., Costas,J.C. and Naveira,H.F.
Title:
GYPSY41_AG, a member of the Gypsy lineage of the Ty3/gypsy group of LTR retrotransposons in Anopheles gambiae.
Journal:
Repbase Reports 4:(3) p. 74 (2004)
Abstract:
GYPSY41_AG is a family of gypsy-like LTR retrotransposons that, according to the aminoacid sequence of its Reverse Transcriptase, RNase and Integrase is phylogenetically grouped with representatives of the GYPSY lineage of other organisms. GYPSY39_AG, GYPSY40_AG, GYPSY42_AG, GYPSY43_AG, GYPSY44_AG, GYPSY45_AG, GYPSY46_AG and GYPSY47_AG are other members of this same lineage in Anopheles gambiae. The GYPSY41-I_AG consensus was reconstructed after multiple alignment of 5 copies. The consensus encodes the 360-aa GYPSY41_AG1p gag-like poliprotein (pos. 501-1580), the 1180?aa GYPSY41_AG2p pol-like poliprotein (pos. 1538-5077) and the 519-aa GYPSY41_AG3p env-like poliprotein (pos. 5014-6570). The sequence of the LTRs flanking GYPSY41-I_AG is deposited as GYPSY41-LTR_AG. GYPSY41_AG1p: MKATERFDSHRDLSTSDNLETEGDKMEEIATQLVEMMRAITSLQNQYTALSASTSSSNAG NNRAFDDYFRIPDPIKSLPTFEGNRKQLASWLSTADNTLALFKDLVPAAVYQMYVTAVTN KICGKAKDILCLSGSPQNFDEIKEILISSLGDRQELSTYKCQMWQNKMTDGMSIHKYYHQ TKEIVQNIKTLAKQNEQYRTNWVAINAFIDEDALAAFIAGLRGNYFGHAQAARPKDIEDA YAFLCKFKATEQNAGSLTKNVQTPSNKPPFKNKFNQNESTSYNKPTKAISEKKFSIKNSD KPEPMDVDASMRSKYAQNKKQFHNNEVETEQESNDSDSDDETDHFNEVNFRLAGSLKNNT GYPSY41_AG2p: RSKFSPSRKSEKQYLNSKKKYNYLPYLRTKQGLNLLIDSGANKNLIQPGVLKTKKEIKQI EITNIVGKQIIDTCGKTNLLYKEIPSQKYYELKFHNFFDGLIGSQFLAENEAILNYRKQT LEISKVIMPFEKYFPNEKNYNHVVTLPTNTDGEWIVYEPTKLCKKITVQPGVYSAKNKKT TILLQTNRPKPPNIQHKALEITVNNFETVTPLPMKPESKITSEMLSEIIRTSHLSTLEKD HLFRTIIKNQNVLLKAGEKLSATPDVKHKITTTNDAPVFTKSYRYPHAFKNDVEEQINEL LRNGIITHSTSPYSSPIWVVPKKVDASGKRKIRVVIDYRKLNEKTIDEKFPIPQIEEILD SLGKSVYFTTLDLKSGFHQIEMDSNDKGKTAFSTAQGHFEFNRMPFGLKNAPAAFQRAMN SVLTGLIGNICFVYLDDIIIIGKNLENHIENLNTVLERLSKFNLKIQLDKCEFLRKETEF LGHVITQEGIKPNPDKITKILEWKLPSTQKEIKQFLGLSGYYRRFIKDYSKLTKPLSKCL KKDTKINTQDEEYKTSFNSLKQIIASDQILAYPDFERPFILTTDASNYALGAVLSQIQEG KERPIAFGSRTLNEAESRYSTTEKEALAIIWSVQKYKSYLYGHKFTLVTDHKPLTFIKTS TKNSKILRWRLELENFDFDIQYKEGKANVVADALSRKTEILTNTNINQDSSISGTPKNNV SNTIDVNFEESSSISESLQHNNPSPNNTNSDSQTMHSADTSDDYFIHFSERPINYYRNQI IFRKSHITTDITETPFNNYKRAIICRNDFDELTILDSLKNFHNNKQTAIMAPDESTISLI QSVYRQYFNQHGHFVLTHLQVEDVSNEQRQDIIIAKEHERAHRGIHEVHNQLTRCYFFPH MMTKIKKLINLCKICNVHKYERKPYNIKITPRPIETTPFSRVHIDIFGIDKHNYLTFVCA FSKFLQTIEIPSRNLTDIRKALAHFITTFGAPRKIICDHETTFRSLQLQSFLANLGTELE FSSSSETNGQVERTHSTIIELFNTNKHKFRDLSSPEIIKVVTALYNETVHSSTGFTPNEI IFNRTSNRNPEQIIQTTRNIYEKVSQKLHNASRNMQKYNDEKETPPEIETGKQIFVKKGV RKKLDPRFNEKTCLNANDKTVTMARNIKRNKNKLRRIKSQ GYPSY41_AG3p: NSYNGKKYQEEQKQVKENQVPVIRSFRFLYLSQAWRSKMQYFPITFMLILLTQLSQSKEL EIIDLNRQPIFFLKTRTCRLQTGSIKFIHPINMLTLENAINTITHFSYENINNELKEIVR LKVKLLYSNFQQLKPKHRTARSLEILGTAWKWIGGSPDADDLRIINTTMNELTENNNKQY RINKQFDHRLRTLTDTINQLTKEREQVMLNELETIKTIMNIDIINHVLEEIQEAISWTKV SVVSNKILSSPEINSIKTILEDQGVKVELPDEALKLVQPIIAINSNSILYILKIPQLADE EATMLEVFPLSIDNRIIVETPTHLIKTRNKVFKPAKPDEYIQNQYREYIDKCTSNLILGR KSDCSTARKNNTTIKLISDGLIIVDNAKGAALSSSCGPDDKLVSGNLLIRFNDCEVTIMN QTFSSKTISSTVEPYWGAVSITEVRWQHHKPMIRQDAFENIGTMQHSYLQQFNSAWNWSL LGGVLVSTIFTLSLAIFVFTFYKRSIRTIADVLPKIADA
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute