;ID   ATENSPM9    DNA   ; ATH   ; 9233 BP
;XX
;DE   ATENSPM9, an autonomous DNA transposon - a consensus.
;XX
;AC   .
;XX
;DT   01-FEB-2001 (Rel. 6.1, Created)
;DT   01-FEB-2001 (Rel. 6.1, Last updated, Version 1)
;XX
;KW   autonomous DNA transposon; En/Spm superfamily; TIR;
;KW   transposase; ATENSPM9.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 9233)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (February 2001)
;XX
;CC   ATENSPM9 is an autonomous DNA transposon from En/Spm superfamily.
;CC   Its individual copies are ~96% identical to the consensus 
;CC   sequence. They are bordered by 3 bp-long target site duplications.
;CC   ATENSPM9 has perfect 13-bp TIRs. The consensus sequence has been
;CC   reconstructed based on 5 copies. These copies form two minor
;CC   subfamilies. ATENSPM9 encodes two proteins, ATENSPM9p1 and
;CC   ATENSPMp2. ATENSPMp1 is a 1147-aa transposase encoded by six 
;CC   exons (673-2848, 2917-3265, 3340-3519, 3608-3707, 3924-4191,
;CC   4232-4602).
;CC   ATENSPMp1: 
;CC   MAGNYNYPSSGGFNRDWMYKRFDEMTGNLSAEYVAGVEEFLTFANSQPIVQSCRGKFHCP
;CC   CAVCKNKKHIVSGRKVSSHLFSQGFMPDYYVWYMHGEDFNMNVGTSNYVDSTYLRENYES
;CC   VGNVVEDPYVDVGNVVEDPYVDMVNDAFRYNVGFDDNYHQDGTNQNVEEPVRNHSKKFYD
;CC   LLEGAQNPLYDGCRQGQSQLSLAARVMQNKADHNMSERCVDSVCQMLTDFLPEGNQATDS
;CC   HYKTEKLMRNLGLPYYTIDVCINNCMIFWKEDEKEDKCRFCSAQRWKPMDDYRRRTKVPY
;CC   SRMWYLPIGDRLKRMYQSHKTAAAMRWHAEHQSKEEEMNHPSDAAEWRYFQELHPMFAEE
;CC   PRNVYLGLCTDGFNPFGMSRNHSLWPVILTPYNLPPGMCMNTEYLFLTILNSGPNHPRGS
;CC   LDVFLQPLIEELKELWSTGIDAYDVSLNQNFNLKAVLLWTISDFPAYSMLSGWTTHGKLS
;CC   CPICMESTNSFYLPNGRKTCWFDCHRRFLSHGHPLRKNKKDFRKGKDASTEYPPESLTGE
;CC   QIYYERLSGVNPPRTKDVGGNGHEKKMPGYGKEHNWHKESILWELPYWKDLNLRHCIDVM
;CC   HTEKNFLDNIMNTLMSVKGKSKDNIMARMDIERFCSRPDLHIDSKGKAPFPAYTLTNEAK
;CC   MSLLQCVKHAIKFPDGYSSDLSSCVDMENGKLSGMKSHDCHVFMERLLPFIFAELLDRNV
;CC   HLALSGVGAFFRDLCSRTLQKSRVQILKQNIVLIICNLEKIFPPSFFDVMEHLPIHLPYE
;CC   AELGGPVQYRWMYPFERFFKKLKGKAKNKRYAAGSIVESYINDEISYFSEHYFADNIQTK
;CC   SRLTRFNEGEVPVYHVPGVPTIFSSVGRPSGEIREVWLSEEDYQCAHGYVIRNCDYFQVI
;CC   ESMFEDFLSIKYPGLNEKELFVKRNEEYHVWVKDYSHGAGRKTCNYGVCVKGENYTDASD
;CC   AADFYGNLTDIIELEYEGVVSLKITLFKCSWYDPKLGRGTRRSNSGVVDVLSSRKYNKYE
;CC   PFILVHIQISNSPNMVHLQTTSQAEQVCFIPYPYTKKPKREWLNVLKVNPRGNILGEYEN
;CC   KDPSLLQTENDDAVLITTIEDLVLDNLTINRNPINLDLDAGDADPEDEFRCNLSSSDDEE
;CC   QQDEEQY
;CC   ATENSPM9p2 is similar to the PttA, Tnp1 and gene 1 proteins
;CC   from petunia, snapdragon and maize Em/Spm-like transposons.
;CC   It is 605 aa long and is encoded by six exons (5352-5911,
;CC   6554-6713, 7088-7375, 7463-7690, 7887-8084, 8166-8549)
;CC   ATENSPM9:
;CC   MFGRGKKKRTSPNLAQRASTSTAGRRPRSLPSQYDFTPAAERSPQLQTPASEDAGPPLQA
;CC   TAAHVRNYPPPLQLFQHSGSRQPEVQRSASVEVQNNPANQAATTQQVPPAPQQDPPPSQQ
;CC   DPPPSAVQESRAHSHPSSQGNNFEEYPPLSPDLQEDTLQSLNDLLMLPERDKFVTVLSPI
;CC   PRPNTTCLVLCWSLVFDILMFFAATIYCNVFEEPCRLGLYSTCCLREQLLVAAKYRVCHC
;CC   KTHHWDPLITGTVQFYFNEICLRRMKGMVSTVRTSRKKPKWIGKTLWKEMTAYWDTEEAQ
;CC   ERSQIYSNARMSDRNGLGPHIHFSGSKSYHQIRDELEDQLGKTVSIGDVFIKTHTKPDGT
;CC   YVDRKAEKIAELYQKNLQLRQSELEAEASAVSDGTSRVRELTAEECTTIFLQNNVFIKTH
;CC   TKPDGTYVDRKAEKIAELYQKNLQLRQSELEAEASAVSDGTSRARELTAEECTTIFLQST
;CC   ERDSRGVPYGVGSLKESLVNGKRKQAGDSTSFVALQEQLLEAQRKIEEQVSYNQRRESEI
;CC   ALREAENSRAADEQKKKLEHLSLVEKFLRENDPRFLNFLESHSAKETTTDPISPSPAASP
;CC   SSSAS
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 9233 BP; 2705 A; 1668 C; 1878 G; 2981 T; 1 other;
ATENSPM9
cactacaagaaaacagttaaatacccactacacccggcgactacatacatagtcgctatatttggtgact
atttggctacaattatacgactaatttacaactaacaaaaatatagacgttagttagtcgccaaattacc
actgtagaggcgagacgccttggtagtcgctaaagggcgactattatatgacacttcttaatkagtcgcc
aaatgtagtcgtaaattagtcgctattttgcgattaatatacgacacgttgtaatagtcgttaaacagcg
actattatatgacaactgcggcgtgtcattaaatatagtcgcaaaatagacgtaatttgccgactaatat
acgacagctcgtaatagtcgtctaaatagcgactattggatgactacagcggcgtgtcattaaatatagt
cgcaaaatagacgtaattttcgattaaaaaaatacatgtaattcgttttgtaagttgttgtacctgtttt
aaacgactacactatttagtcgtgttttagtcactctaaaattgcctatataatgaaaatttctcgtaca
caatattcattcacaacacaaacacaatacaaatattacctttaaaaaaaataaacgtttttgaagaaaa
aaaacgttaaaaaatatatttagttttattattggtcttataatggcgggaaactataattatccgagta
gcggtggttttaatcgagattggatgtacaagaggttcgatgaaatgacggggaatttgtcagcagaata
cgtcgcaggagtggaggagttcttgacatttgctaatagccagcctatagtacaaagttgtcgaggtaaa
ttccactgtccttgtgctgtgtgcaagaataaaaaacatatcgtctcgggtagaaaagttagtagtcatt
tgtttagtcaaggatttatgccagattattatgtttggtatatgcatggggaagatttcaacatgaatgt
aggaacgagtaattatgttgatagtacgtatctaagagagaattatgaaagtgtgggtaatgttgtagaa
gatccatatgtggatgtgggtaatgttgtagaagatccatatgtggatatggtgaacgatgcatttcgtt
ataacgtggggtttgatgataactatcatcaagacggtactaatcagaatgtggaggaaccggtacgtaa
ccattctaaaaaattctacgacttgttagaaggtgctcaaaatccattgtacgatggttgtcgacaaggc
cagtctcaattatcgttagcagctcgagtcatgcagaacaaggcggatcataatatgagtgaaagatgtg
tggattcggtatgtcaaatgttgacagattttttaccagaaggaaaccaagctactgattcgcattacaa
gacagaaaaattgatgcgcaatttaggccttccttattatacaattgatgtttgtattaacaattgtatg
attttctggaaagaagacgaaaaggaagataaatgtcggttttgtagtgctcaaagatggaagcctatgg
atgactaccgtcgaagaaccaaagtgccatatagtcgtatgtggtatctacctattggtgaccgattgaa
gagaatgtatcagagccacaagacggcagctgcaatgcgttggcatgcagagcaccaatcaaaagaagaa
gaaatgaatcatccttcagatgcggcggagtggagatattttcaagagctacatcccatgtttgccgaag
aaccccgtaacgtttatctcggattatgtaccgatggattcaatccatttggcatgtcgcgaaatcattc
tttgtggcctgtgatcctgactccatacaatttaccaccgggtatgtgcatgaatacagagtacttgttt
cttacaattctgaattctgggccaaatcacccgcgaggtagtctcgatgtcttcctccaacctcttattg
aggagctaaaagagttatggtctactggaatcgatgcatatgatgtgtctttgaatcaaaattttaatct
aaaagcagtgttgctgtggacgataagcgactttccggcgtacagcatgttatcaggatggacaacccac
ggtaaactgtcttgtccaatttgcatggaaagtactaactctttttatctacctaatggaaggaagacgt
gctggtttgattgtcaccgaagatttctttctcatggtcatccattacggaagaacaaaaaagacttccg
aaaaggaaaagacgcttctaccgagtatccacctgagtctttaaccggtgagcaaatttattacgagcgg
ttgtctggtgtaaatccaccaagaaccaaggatgttggtggcaacggtcacgaaaagaagatgccaggct
atgggaaggaacataactggcacaaggaaagcatattatgggagcttccatattggaaggatctgaatct
ccgacattgtatcgatgtgatgcatacagagaagaattttttggataacataatgaatactcttatgagc
gtgaagggtaaatcaaaggacaacataatggcaagaatggatatagaacgattttgttctcggcctgact
tacatattgatagtaagggaaaagctccatttccagcttatacattgacaaatgaagccaaaatgagttt
attgcaatgtgttaaacatgcaatcaaattccctgatggttattcgtctgatttgagtagttgtgttgat
atggagaatgggaagttatcaggtatgaagagtcatgattgccatgtttttatggagcggttacttccat
ttatctttgcagaactcctcgaccggaacgttcaccttgcattatcaggtaaaataagaatataatttac
tctttatatcatatatgcacttgagatgatgattttttttttccaggggttggcgcattttttagggact
tatgttcgagaactttgcagaaaagtcgcgttcaaattcttaagcagaacattgttttgatcatatgcaa
cttagagaagatcttcccaccatcattttttgatgttatggagcacctgcctatacatctcccctacgaa
gcagaattgggcggtcctgtccaatataggtggatgtatccttttgagaggtttttcaaaaagttgaaag
gaaaagcaaaaaataaaagatatgcggccggatcaattgttgagtcatatatcaatgacgagatttctta
tttctcagagcactactttgccgataatatacaaacaaaatcaaggtaaccatgaattatcgatgtgatg
caagttttcaatgcgcaaaaataaccatgaattacctcttttggatcaggttaacaagattcaatgaggg
tgaagttcctgtatatcatgttcctggagtacctactatatttagttctgttggtcgtccaagtggagaa
atacgtgaagtatggctatcagaggaagactatcaatgcgcacatggatatgttatacggaattgtgatt
attttcaagtaattgagaggtatataacaaagtcatactcccattttaagttaattatataaatgtttgt
ttactaacccttatatgttgtttttgtctttatatagtatgttcgaagattttctttctattaaatatcc
aggattgaacgaaaaagaactcttcgtgaaaagaaacgaggaatatcatgtgtgggtgaaagattatgta
tgtctaaatagtttataatatatatgatttgcatttatttattgttattcttatctctataatgatgttt
atttctaaatttgcatacaggttacatattggaactctagtaatccttttccaacttgggttcaagagat
agtcaatggacctttgcacaaagtcaaaacatggccaatgtattttacaagaggctatttgtttcatacg
cagagtcatggagcaggacgtaagacttgtaactatggtgtatgtgtgaaaggtgaaaattatacggatg
catctgacgcagcagatttttacggcaacttaactgatatcatagaacttgagtatgagggggtggtcag
tttgaaaatcacactttttaaatgttcgtggtatgaccctaagctcggaagaggtactcggaggagcaat
agtggtgttgtcgacgttctttcatcgaggaaatataacaaatacgaaccctttattttaggtacgtatg
gtatatatatatggtcatttagtttttctagttcatatacaaatatctaattctccaaatatggtgcatt
tacaaacaacatctcaagcggaacaagtgtgctttattccttatccatacacgaaaaaaccaaagcggga
gtggctcaatgttttaaaagtaaatccaaggggaaacatattaggagaatatgaaaataaagacccgagt
ttattgcaaacagaaaatgatgatgctgttttaataacaacaatagaagatcttgtactcgacaatttga
caatcaatcgcaaccccataaacctcgatttagatgccggagatgctgatccagaagatgaatttcgatg
taatttatcgtcttctgatgatgaagaacaacaagacgaagaacaatattagttttgtgtttatgatatg
tacttaagttatttctagtaattatgatatgtacttaagttattttgaaatactatgcaaacatttttaa
attattttgaaaaactatatttgattaagtatgttatgtttatttatatttttatagataattactattt
ttgaatatctatggcccaaaaaaacacaaggcccaaacccaattattaaaaattaaacaaagcgactaat
ttgtgacaatttagcgactataaccattctattgggccaaaataactcgaggcccaaatacggcgactaa
tttgtgacaatttagcgactacaaaaaaattattgggccaaaaatttcgaggcccaaaattggtgactaa
tatgtgacaatatagcgactaatgtttaaatttcgagaactttttaggcccaaacccattaggcccaatc
cctttcaatgtcaaaaccctaagttccatctctttctcttcgaaacatccgcagccttcttcttcgattt
ctctcttcctcttcgatttcctctgcgaaaccctagcaaatttctctcttcctcttcgattttcctctta
atctctctaccaaacatctgcagtcttcttcttcttcaatctctctcttcgatttcctctttgaaaccct
accaaatcaatctttcctcttcgtttttctcttaaatctcttcttcttcttcgatttcgtcgatttgaac
ctctgtctcatctagatctctccttaatatcatgtttgggcgaggaaaaaagaagcgaaccagtcctaac
cttgctcagagagcctccacatccacggctggtcgacgacctcgttctcttccgtcccagtacgatttca
cgccggcagcagaacggtcgcctcagctacagacgcctgcatcagaagacgccggtcctcctttacaggc
aacggcagctcatgttcggaactatccgccgcctctgcagttgttccagcactctggcagtcgacaacct
gaagtgcaaaggtctgcctccgtggaagtgcagaacaatcctgcgaatcaagccgcgactactcagcagg
ttcctccggctcctcaacaagaccctccgccttctcagcaagaccctccgccttctgcagttcaagagtc
tcgggctcacagtcacccatcctctcaaggcaacaacttcgaagaatatccacctctgtcgccggatctc
caggaggacacacttcaatccctaaacgatcttcttatgttgccggagagggataagttcgtcaccgtcc
tctctcccattcctcgaccgaataccacctggtatgatcttcttcttctcttgtttgtgatcccatgttg
tgttcaattagggtttagttgagaaatgagctttgtgagattgattagattaagaaaatatctgaattga
ttctttgttgtgttggtctgaaagtccttgatattatagatgttactttgcagccacaatctattctcat
gtttgaagagtcttgttgtgttggtcttattttgttgtttgcgagaacaagtagtttaagttgtgttttg
atcttagggtttagttgagtaatgagctttgtgagattgattagattgagaaaatgtctgaaatgattct
ttgttgtgtctcgtgttgtgttggtctgaaagtctgaaagttttgttcttaggatttaagttatgtgttt
tgttattagggtttagttgggtaatgatctttgtgacgtgttgtgttggtctgaaagtctttgatattat
agatgttactttgcagccacaatctattctgttgtttgaagagtcttgttgtgttggtcttattttgttg
tttgcgagaacatgttttgttaagttgtgttttgatcttagggtttagttgagtaatgagctttgtgaga
ttgattagattgagaaaatgtctgaattgattctttgttttagtctcgtgttgtgttggtctttggtctt
tgatattttgatgttctttgcagccacaatctattgcaatgtgtttgaagagccttgtcgtctaggtctc
tactctacttgttgtttgcgagaacaactgcttgtggctgcaaagtatcgtgtctgtcattgtgtaagtt
tagttctagatgatactttgcagccacaatctatgttctgctctatatgatctgtttgtactttgcagcc
acaatttatgttttgctagtctttggtctgatatttagatattttctgccatataataactgcttgtttt
tcttgtgtatggaggtttactcgggacacaaactcacgacttgttaggaatatcactagagtgtttacaa
acaagtttgatggtccctactacagctggacatgtgtgcctcaagagagacaagagaaatacttcctcga
gtttgctgttaagtgttctttcagcccttcttttatatagttgattacacttcttttttttactgacatt
accttttctatttgtagaaaacacaccattgggatcctttgatcacagggactgttcagttttacttcaa
cgagatctgtttaaggcgaatgaaaggcatggttagcactgtaagaactagtcgaaagaaacctaaatgg
attgggaaaactctatggaaggaaatgactgcgtactgggacactgaagaagctcaggaaagaagtcaaa
tctattcaaatgcccgtatgtctgaccgtaacggtctaggtcctcacatacacttctcagggtctaagtc
atatcatcaaatccgggacgaattggtaagtcttctctctgtttactcctttgcaactttaactcgatgt
ttcaatgtacttttctatgacttgttttttttttcattacaggaagaccaattgggcaaaactgtcagta
ttggtgacgttttcatcaaaacacatacaaaacctgatgggacgtatgttgatcgaaaggcagagaagat
tgcagagttatatcagaagaatttgcagctgaggcagtctgagctcgaggctgaagcttctgctgtttca
gatggcacttcgcgggtacgggagctcacagctgaggaatgtacaaccatatttcttcaggtaacgtttt
atgtttcccaatttagtgtttttgagttgtctttttcagtctctaatctcagcttttatcatactatgca
gtccactgagagggattcgagaggcgttccttatggagtaggaagcctcaaagagtctcttgtcaatggc
aagcggaagcaagcaggtgactcaacttcttttgtggctttgcaagaacaacgttttcatcaaaacacat
acaaaacctgatgggacgtatgttgatcgaaaggcggagaagattgcagagttatatcagaagaatttgc
aactgaggcagtctgagctcgaggctgaagcttctgctgtttcagatggcacttcgcgggcacgggagct
cacagctgaggaatgtacaaccatatttcttcaggtaacgttttatgtttcccaatttagtgttttcgag
ttgtctttttcattctctaatctcagcttttatcatactttgcagtccactgagagggattcgagaggcg
ttccttatggagtaggaagcctcaaagagtctcttgtcaatggcaagcggaagcaagcaggtgactcaac
ttcttttgtggctttgcaagaacaattactggaagctcaacgcaagatagaagagcaggtctcttacaat
cagaggcgtgaatctgagattgctttgcgtgaagctgagaattcccgagctgcagatgagcagaagaaga
agcttgagcacttgtccttagtggagaagtttttgcgcgaaaatgatcctcggttcctcaatttcctcga
atctcattcagctaaggagacaaccacagatcctatctcaccctctccagctgcctctccctcttcatct
gcttcataggtctgaatcttaactccttcctcaagactcaatatcacggctcaaagtttttatatatgtg
tacttgtgttgcttgtacttatgctgaacaaagttctaagttgtagtgaatcgaacatgtttttgtaatt
tgggttttgtacttttctgaaatgacaataaatttgtgttcatgcttcttgtattgtgtttatgcttctt
gtaatgttttgttttctcaggaaggtttccatctttagaacaatcatataggtgttctaataacaacaat
agtcgcaatttagtcatcaaaaatgtaaccaagcagtcggtttttgataacaatttagtcactaaaattt
ccgactaaatgatatttcagtcgcaactcagtcttaaatatagtgacaacatagtcgtaaattttaacga
ctaattacctacaacgactacataacccttaagttaatcgttaatttgtcgtctaatagtcgctaaattt
aacgactaaaaggtcactaaaaaagcgattacttaaacttagtcgtaactttgtcgctctttggcgacta
aaatacgactatcgtatttaccgactctcgattagcgactaaatttaatagtcgtttatttgtcgtaaag
aggtgtttaacgactacagagtgactaatactgaagtcggtaaatggcaattttcttgtagtg1