| 2003, Volume 3, Issue 2 |
| February 28, 2003 |
| Copyright © 2001-2007 - Genetic Information Research Institute, Mountain View, California |
| ISSN# 1534-830X |
| Page 12 |
BEL3-I_AG |
|||
|---|---|---|---|
BEL3-I_AG is an internal portion of the BEL3_AG LTR retrotransposon - a consensus sequence. |
|||
|
Submitted: 28-Feb-2003 |
Accepted: 28-Feb-2003 |
||
|
Key Words: LTR retrotransposon; Bel clade; 5-bp TSD; PHD zinc finger; reverse transcriptase; integrase; BEL3_AG; BEL3-LTR_AG; BEL3-I_AG |
|||
|
Source: consensus |
Organism: Anopheles gambiae str. PEST |
Taxonomy: Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles |
|
| [1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
|
Title: BEL3_AG, a family of Bel/Pao-like LTR retrotransposons from African malaria mosquito. |
|||
|
Journal: Repbase Reports 3:(2) p. 12 (2003) |
|||
Abstract: BEL3_AG is a young family of Bel/Pao-like LTR retrotransposons. BEL3-I_AG, an internal portion of BEL1_AG is flanked by BEL3-LTR_AG LTRs. The BEL3-I_AG consensus sequence was reconstructed based on multiple alignment of 4 copies; they are less than 1% divergent from the consensus sequence. There is no tRNA-like PBS at the 3' end of BEL3-I_AG. The consensus sequence encodes one protein: a 1749-aa BEL3-I_AGp (positions 204-5450). BEL3-I_AGp is composed of the PDH domain (aa positions 12-59), reverse transcriptase (positions 740-940) and integrase (positions 1440-1610) domains. BEL3-I_AGp: MQEESAVIADFTCAVCDKADSVDSLLQCDFCDKWYHYECAHVDKTVETRAWWCAECEVKTKLASGKKDDEI ARLKREMEALKATTEKALALIREKDAEVARLSKSSERTSFSPGESLPCSTAKRISVNEEGDLSQSQIAARQ AVRYELPSFNGNPEEWPIFLSTFRRSSRTFGFTEDENILRLQGALRGKALRTVQGRLRHADNLEEILSALE KSYGRPDVLVNTLLEQIRESPPIKSERLDSFIEYGDLVAEICSTIKASGTSDRLYDAALLQELVDRMPAYL RWSWGMHSQELKSVTMSEFGAWIQKATDGAMAVTPPQLKKKTTTRQVHAQHVETHQPQPRRHRECALCNSD TCGTIAECRVFNRLSVADRWDKVRTLKLCKRCLGKHYGPCSKRDDCGVQGCVAKHHRKLHRVTSEERVEIN HHGTRSDGTLLRYVPVKLHGESGPICTHALLDEGSTVTLMEQELAGQLGVSGVLDPLCLQYSAGERRDERD SERVAVQVSSAEENASAFSMADVRTVSRLSLPIQSVDVNELKRKYKHLEAIPAASYEAVSPRLLIGIDHYR LTRPLKTIEGQPGQPTATKTRLGWLIFGKCTDNANDTSIVQPESSYHVCDCQGETSRADRMMAAYFEVEGY GPAKEPLLSKEDQRAMSILQNNTKHVDGRYTTGLLWRSDNVFMPENRQMALSRMECLERKMSRDTSLAEKI NAILEDYLEKGYARPIRADELKTFYPRKWYLPVFPVTNPHKPNKVRLVWDAAAEVRGISLNKKLLTGPDLL TPLQAVLFRFREYRVAVAADIREMYHQVRICDDDVHSQRFLWRWGNTNAEPQEFVMLRMTFGAACSPSTAQ FVKNENAEKYRSLYPRAVRCIHEEHYVDDMLTSVETEPEAIELAYQVSLIHNNAGFSLHNWLSNSIRVVTA VKGTESTLKEMDFEPCLKPEKVLGMWWDTTTDSFGFKLSRVRHLELARKDKPPSKRQMLRTLMSIYDPLGL IAGVLFYLKVLLQEVWRLHLGWDDEVPEEIQHKWDAWMERLPELESFIIPRCYRQLASLTESSLQLHVFVD AGADGYAAVAYFRFECHGRIEVSLVGSKAKVAPLKYLSVPRLELQAAVMGCRIASSITSAHRETISGSYFW TDSTDVIDWINADHRKYSIFVAHRVAEVLDTTNVDDWRWLPTKLNVADEATKWTNLQHHLASERWFSGPEF LQLPEAEWNIPRRVPSETSEEVRKKDRLKLVGIHIARPIFIDYERFSRWTRLVRTMAYVCRYVNIITKTKS PSTGPLNRDEIQRAETVILRDVQRNAFTDEYAILWKARENSTTPSWKSPIPRSSLLFKRSPYMDEDGLLRL SGRIDRCRYVDPGRKRPILLPRRHRVSELIVDDVHRRYKHGSQETVVNEVRQRFDIPALRSVCRHVRLQCR TCTLLYAKPASPEMCELPAARLAAFSRPFSYTGIDYFGPMVIVNGRKTEKRWGVLFTCLTVRAVHIELVQS LSTSDCLMAVRSFMARRGTPIEIVSDRGTNFVGADRELKEAAERVDSAILNEFGSPDPVWKFNPPAAPHFG GSWERMIQSVKRMLSRTLTERHPTEAVLSAALIEVENMLNSRPLTHVPVDGEDEEPLTPNHFLLGSSAGMK PLVKPDDSPAGLKQNWRAVQAKMNELWKKWIKTYLPTLVRRTKWFESCKPIETGDVVLIVDENSPRNCWPR GRVERVVPSKDGVIRRVVIKTAKGTMLERPVVKLVSLNVAPRVIV
|
|||
|
Derived: [1] (Consensus) |
|||
|
Download Sequence - Format: IG, EMBL, FASTA |
|||
|
References: |
|||
© 2001-2008 - Genetic Information Research Institute