Repbase Reports

2003, Volume 3, Issue 2
February 28, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 26

P4_AG

P4_AG, a P-like DNA transposon - a consensus sequence.

Submitted:
28-Feb-2003
Accepted:
28-Feb-2003
Key Words:
DNA transposon; P superfamily; composite transposon; HATN2_AG; P4_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
P4_AG: a family of P-like DNA transposons from African malaria mosquito.
Journal:
Repbase Reports 3:(2) p. 26 (2003)
Abstract:
The A. gambiae genome harbors many divergent families of P-like DNA transposons. One of those families is P4_AG. P4_AG elements are flanked by 8-bp target site duplications. Terminal inverted repeats are 27 bp long (4 mismatches). Subterminal inverted repeats are 28 bp long (4 mismatches), their positions are 64-91 and 7606-7580. The P4_AG consensus sequence was reconstructed from 6 copies that are only ~2% divergent from each other. Presumably, P4_AG copies have multiplied in the genome during last 2 million years. P4_AG elements carry a copy of the hAT-like HATN2_AG transposon, that was inserted into an ancestral form of P4_AG. The P4_AG encodes a 877-aa P-like DNA transposase called P4_AGp. Putative exons (based on FGENESH): 310-570, 2573-4146, 4211-5011. P4_AGp: MSTCAASFCQHSRYIVKKMGLDVIFHKFPTDPTLLRKWVEFCQREEAWVPSISSILCSAHFNKTDYQLIN SPSKANRKILKKLKPSAFPSVIKSQAREPSNNVVQQCSTNNVVQQIETDEGRIDTSVEHHDDDLPSDNIT HAKCQNCVQNETEIELLNQTLKKTQDKCNNLLEVNTFLSKQLEIVSKELTQSQKEIELLKTNHNKFKDVA ISPNEFTTRMKNVLKDTLTSNQIDLITEERKRVRWTKAELSKFFTLRYLGKRAYQYLRDDLNFPAASIST LQRYGRTLNLKQGILDDVINLLKNITVDLPECHRECVLSFDEMKVNRILEYDPASDEVLGPHNYLQVVMA RGLFKNWKQPVFIGFDQQMTKEILFELIKRLYAIKINVVAIVSDNCQSNIGCWKDLGAHDYCHPFFSHPI TKCNIYVFPDAPHLLKLIRNWLIDTGFEYNNKLIKADKLFELVAYRNAAELTPVHKLTQNHLVMTPQERQ NVRRAAEVLSRTTAIALQRYFPDDCDAQELASFIEKVDMWFSVANSYSPCAKLHYKKSFNANENQLAALS DMFELMSNITALGKKSMQVFQKSLLMHITSLKMLYEDMRKKHSIVYISTYKLNQDVLENFFSQLRQIGGV HDHPSPLHCMYRIRMMILGKSPTTLKNHTELKNDDVENSHEHHEEFLSATVFSVADIPQSVPDISVMEKT NQICQAIEECSQESDLISTVSSTCNVQSAQESDGLQYVMGYIANKYNTKYPELDLGVQTFKLTTDHCYSQ PPTFVQHLSAGGLFEPSPTFLLLGNRMEKIFLKMHPDGTFSKTKKIVAKIAKNIQNQISELPVEIIRTFA KQRMIVRMRFLNLKSSTENLMKSKRKHVNQHGKGAKK
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2019 - Genetic Information Research Institute