Repbase Reports

2003, Volume 3, Issue 3
March 31, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 35

BEL14-I_AG

BEL14-I_AG is an internal portion of the BEL14_AG LTR retrotransposon - a consensus sequence.

Submitted:
31-Mar-2003
Accepted:
31-Mar-2003
Key Words:
LTR retrotransposon; Bel clade; 5-bp TSD; PHD domain; protease; reverse transcriptase; integrase; BEL14_AG; BEL14-LTR_AG; BEL14-I_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V., Pavlicek,A. and Jurka,J.
Title:
BEL14_AG, a family of Bel/Pao-like LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 3:(3) p. 35 (2003)
Abstract:
BEL14_AG is a young family of Bel/Pao-like LTR retrotransposons. BEL14-I_AG, an internal portion of BEL1_AG is flanked by BEL14-LTR_AG LTRs. The BEL14-I_AG consensus sequence was reconstructed based on multiple alignment of 16 copies; they are ~1.5% divergent from the consensus sequence. The consensus sequence encodes a 1898-aa BEL14_AGp Bel-like protein (pos. 57-5750). BEL14_AGp is composed of the PHD (pos. 9-55), protease (pos. 277-400), reverse transcriptase (pos. 900-1060) and integrase (pos. 1600-1765) domains. BEL14_AGp: MGPKKARGCKACGNQVDDTLYVQCDECDAWWHFSCAGITASVEAVEKCAWLCEECARKTLREQSSPREGN KEPKEGTSKHVDGDLVRNLSLEAATDGGARPVTNPQRRPLLSLDEANDEIAPGTSTHVAGGPVHNLNQDA ATEGGVRPVMTPKRRPLSSLDEADRGKTSVSSNIVHRGSCPNLNLDAASDDVARQLAVLKRRQEVEKRRM ELELQLKFVQEEEALLGFGENKSFSISPQLNSFQTEKRTVKRSEEEKEEPDLTPRQEAARHMVSKELPVF SGDPAEWPIFISHYEYTTRRCGYSNWENMLRLQKCLKGPALEAVRSRLVLPDVVPQVIEKLRSKYGRPVH LIKTFIEKVRKIPAPQTDKLDSLVEYGEAVQCMVDHMVAAGERAHITNPLLLQEVVGKLPTDQQLRWSHH IRGMTSVDLSTFSDYMEDLAEDAARLTTIDSPSVRGTSKGRPTKGYVHAHVDPDGATTSSAAERQCVSCN VAGHVLSTCTNFRGLPVKDRWRRARELSVCFSCLEKHNWRSCKNRSRCGINDCAFRHHALLHDPDAIESP STADRERRHFPRTSGSQTHQVINNYHQSNPMSALFRIVPVTAYGPGVMIKTFAFLDEGSSMTLMDEDLAK QLGVKGDRRPLCIKWTGDTTRVEPASMMIDLQIGPVTSTKRFTLKAVRTVTSLSLPQQTFTMDDKRWDHL KQLPLPEYRDARPQLLIGLDNLRLAVPLKTREGLAGEPVAVKTRLGWCVYGKTAGSQIGRVLHMCECGAS DENSTIQGALRKFYELEQLGTVSSDVPDPDERRALTILETTTVRIGNRFESGLLWKTDNVELPSSLGMAR RRLECLERRMERDPKLKTVVHHHIADMMEKGYIHKATSAELAECNSKRIWYLPLGVVTNPKKPGKVRIIW DAAAKVQGTSLNDMLLKGPDELISLPGVLFRFRMYGIAVCADVKEMFLQIRMRDEDKHAQRFLWREDPAD DIATYFVDVVTFGSACSPATAQYVKNRNAKEHAEKYPRAVRGILTSTYVDDYLDSFGTFEEASRVSREVR GIFSNGGFVLRNWVSNNPVVLERLGGESSSPGMKSLTSTADDGERVLGLRWNPSSDQLSFYTQACVGMAE IFETECTPTKREVLKCVMSLFDPLGLLANFTIHGRILIQDLWRAGTGWDEAISPSQMRDWRRWVDVFPLI AQLRIPRCYFPEAREKVYENAELHLFVDASQLAYACVLYLRVVDSEGEPHCTMLCGKAKVAPLKPLTIPK MELQACLLGARLLKSTEQHHPISVKKRVLWTDSTVALSWIHADPRNYRPFVANRVAEIQENTNVNEWRWV PTQDNPADEATKWKGRANFNWDGIWFQGPSFLLQDEESWPTRRLVSTTPEEEIRRVNLHREKLNPGLLPL KAERFSRLERMIRTLAWIVRYVDNLMRKVGGAPLHLGILSQDELERAETIAWKQAQGEYFQDEVRVLSVG EGTGRSTVPKESPIYGLLPYADERGVLRMRGRIGAAPELPYAARYPIVLPRDAWITHLLVDKFHRRFRHA NNETVVNELRQYFQIPKMRRLVSKVVRQCVFCHIRRTLPQIPPMAPLPKQRLTAFVRPFTFVGLDYFGPL LVRRGRAQEKRWVALFTCLTIRAIHLEVVSSLSTDSCILAVRRFVARRGAPVEVFSDNGTNFVGASQQLR KEIDERNDALAATFTNANTRWTFNPPGAPHMGGVWERMVRSVKAAMSTMTELQRTPDDETLLTVIVEAEG MINTRPLTYIPLESADQESLTPNHFLLGSSSGVKQRPVAPTSLQTGLRSNWKMVQHILDGFWRRWIKEYL PVLARQSKWFETVREIEVGDIVLIVDGGARNQWKRGIVERVVSGADGRIRQAWVRTNTGTLRRPAAKLAL LEIRKGDK
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute