;ID ATENSPM9 DNA ; ATH ; 9233 BP ;XX ;DE ATENSPM9, an autonomous DNA transposon - a consensus. ;XX ;AC . ;XX ;DT 01-FEB-2001 (Rel. 6.1, Created) ;DT 01-FEB-2001 (Rel. 6.1, Last updated, Version 1) ;XX ;KW autonomous DNA transposon; En/Spm superfamily; TIR; ;KW transposase; ATENSPM9. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; ;OC Dilleniidae; Capparales; Brassicaceae. ;XX ;RN [1] (bases 1 to 9233) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (February 2001) ;XX ;CC ATENSPM9 is an autonomous DNA transposon from En/Spm superfamily. ;CC Its individual copies are ~96% identical to the consensus ;CC sequence. They are bordered by 3 bp-long target site duplications. ;CC ATENSPM9 has perfect 13-bp TIRs. The consensus sequence has been ;CC reconstructed based on 5 copies. These copies form two minor ;CC subfamilies. ATENSPM9 encodes two proteins, ATENSPM9p1 and ;CC ATENSPMp2. ATENSPMp1 is a 1147-aa transposase encoded by six ;CC exons (673-2848, 2917-3265, 3340-3519, 3608-3707, 3924-4191, ;CC 4232-4602). ;CC ATENSPMp1: ;CC MAGNYNYPSSGGFNRDWMYKRFDEMTGNLSAEYVAGVEEFLTFANSQPIVQSCRGKFHCP ;CC CAVCKNKKHIVSGRKVSSHLFSQGFMPDYYVWYMHGEDFNMNVGTSNYVDSTYLRENYES ;CC VGNVVEDPYVDVGNVVEDPYVDMVNDAFRYNVGFDDNYHQDGTNQNVEEPVRNHSKKFYD ;CC LLEGAQNPLYDGCRQGQSQLSLAARVMQNKADHNMSERCVDSVCQMLTDFLPEGNQATDS ;CC HYKTEKLMRNLGLPYYTIDVCINNCMIFWKEDEKEDKCRFCSAQRWKPMDDYRRRTKVPY ;CC SRMWYLPIGDRLKRMYQSHKTAAAMRWHAEHQSKEEEMNHPSDAAEWRYFQELHPMFAEE ;CC PRNVYLGLCTDGFNPFGMSRNHSLWPVILTPYNLPPGMCMNTEYLFLTILNSGPNHPRGS ;CC LDVFLQPLIEELKELWSTGIDAYDVSLNQNFNLKAVLLWTISDFPAYSMLSGWTTHGKLS ;CC CPICMESTNSFYLPNGRKTCWFDCHRRFLSHGHPLRKNKKDFRKGKDASTEYPPESLTGE ;CC QIYYERLSGVNPPRTKDVGGNGHEKKMPGYGKEHNWHKESILWELPYWKDLNLRHCIDVM ;CC HTEKNFLDNIMNTLMSVKGKSKDNIMARMDIERFCSRPDLHIDSKGKAPFPAYTLTNEAK ;CC MSLLQCVKHAIKFPDGYSSDLSSCVDMENGKLSGMKSHDCHVFMERLLPFIFAELLDRNV ;CC HLALSGVGAFFRDLCSRTLQKSRVQILKQNIVLIICNLEKIFPPSFFDVMEHLPIHLPYE ;CC AELGGPVQYRWMYPFERFFKKLKGKAKNKRYAAGSIVESYINDEISYFSEHYFADNIQTK ;CC SRLTRFNEGEVPVYHVPGVPTIFSSVGRPSGEIREVWLSEEDYQCAHGYVIRNCDYFQVI ;CC ESMFEDFLSIKYPGLNEKELFVKRNEEYHVWVKDYSHGAGRKTCNYGVCVKGENYTDASD ;CC AADFYGNLTDIIELEYEGVVSLKITLFKCSWYDPKLGRGTRRSNSGVVDVLSSRKYNKYE ;CC PFILVHIQISNSPNMVHLQTTSQAEQVCFIPYPYTKKPKREWLNVLKVNPRGNILGEYEN ;CC KDPSLLQTENDDAVLITTIEDLVLDNLTINRNPINLDLDAGDADPEDEFRCNLSSSDDEE ;CC QQDEEQY ;CC ATENSPM9p2 is similar to the PttA, Tnp1 and gene 1 proteins ;CC from petunia, snapdragon and maize Em/Spm-like transposons. ;CC It is 605 aa long and is encoded by six exons (5352-5911, ;CC 6554-6713, 7088-7375, 7463-7690, 7887-8084, 8166-8549) ;CC ATENSPM9: ;CC MFGRGKKKRTSPNLAQRASTSTAGRRPRSLPSQYDFTPAAERSPQLQTPASEDAGPPLQA ;CC TAAHVRNYPPPLQLFQHSGSRQPEVQRSASVEVQNNPANQAATTQQVPPAPQQDPPPSQQ ;CC DPPPSAVQESRAHSHPSSQGNNFEEYPPLSPDLQEDTLQSLNDLLMLPERDKFVTVLSPI ;CC PRPNTTCLVLCWSLVFDILMFFAATIYCNVFEEPCRLGLYSTCCLREQLLVAAKYRVCHC ;CC KTHHWDPLITGTVQFYFNEICLRRMKGMVSTVRTSRKKPKWIGKTLWKEMTAYWDTEEAQ ;CC ERSQIYSNARMSDRNGLGPHIHFSGSKSYHQIRDELEDQLGKTVSIGDVFIKTHTKPDGT ;CC YVDRKAEKIAELYQKNLQLRQSELEAEASAVSDGTSRVRELTAEECTTIFLQNNVFIKTH ;CC TKPDGTYVDRKAEKIAELYQKNLQLRQSELEAEASAVSDGTSRARELTAEECTTIFLQST ;CC ERDSRGVPYGVGSLKESLVNGKRKQAGDSTSFVALQEQLLEAQRKIEEQVSYNQRRESEI ;CC ALREAENSRAADEQKKKLEHLSLVEKFLRENDPRFLNFLESHSAKETTTDPISPSPAASP ;CC SSSAS ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 9233 BP; 2705 A; 1668 C; 1878 G; 2981 T; 1 other; ATENSPM9 cactacaagaaaacagttaaatacccactacacccggcgactacatacatagtcgctatatttggtgact atttggctacaattatacgactaatttacaactaacaaaaatatagacgttagttagtcgccaaattacc actgtagaggcgagacgccttggtagtcgctaaagggcgactattatatgacacttcttaatkagtcgcc aaatgtagtcgtaaattagtcgctattttgcgattaatatacgacacgttgtaatagtcgttaaacagcg actattatatgacaactgcggcgtgtcattaaatatagtcgcaaaatagacgtaatttgccgactaatat acgacagctcgtaatagtcgtctaaatagcgactattggatgactacagcggcgtgtcattaaatatagt cgcaaaatagacgtaattttcgattaaaaaaatacatgtaattcgttttgtaagttgttgtacctgtttt aaacgactacactatttagtcgtgttttagtcactctaaaattgcctatataatgaaaatttctcgtaca caatattcattcacaacacaaacacaatacaaatattacctttaaaaaaaataaacgtttttgaagaaaa aaaacgttaaaaaatatatttagttttattattggtcttataatggcgggaaactataattatccgagta gcggtggttttaatcgagattggatgtacaagaggttcgatgaaatgacggggaatttgtcagcagaata cgtcgcaggagtggaggagttcttgacatttgctaatagccagcctatagtacaaagttgtcgaggtaaa ttccactgtccttgtgctgtgtgcaagaataaaaaacatatcgtctcgggtagaaaagttagtagtcatt tgtttagtcaaggatttatgccagattattatgtttggtatatgcatggggaagatttcaacatgaatgt aggaacgagtaattatgttgatagtacgtatctaagagagaattatgaaagtgtgggtaatgttgtagaa gatccatatgtggatgtgggtaatgttgtagaagatccatatgtggatatggtgaacgatgcatttcgtt ataacgtggggtttgatgataactatcatcaagacggtactaatcagaatgtggaggaaccggtacgtaa ccattctaaaaaattctacgacttgttagaaggtgctcaaaatccattgtacgatggttgtcgacaaggc cagtctcaattatcgttagcagctcgagtcatgcagaacaaggcggatcataatatgagtgaaagatgtg tggattcggtatgtcaaatgttgacagattttttaccagaaggaaaccaagctactgattcgcattacaa gacagaaaaattgatgcgcaatttaggccttccttattatacaattgatgtttgtattaacaattgtatg attttctggaaagaagacgaaaaggaagataaatgtcggttttgtagtgctcaaagatggaagcctatgg atgactaccgtcgaagaaccaaagtgccatatagtcgtatgtggtatctacctattggtgaccgattgaa gagaatgtatcagagccacaagacggcagctgcaatgcgttggcatgcagagcaccaatcaaaagaagaa gaaatgaatcatccttcagatgcggcggagtggagatattttcaagagctacatcccatgtttgccgaag aaccccgtaacgtttatctcggattatgtaccgatggattcaatccatttggcatgtcgcgaaatcattc tttgtggcctgtgatcctgactccatacaatttaccaccgggtatgtgcatgaatacagagtacttgttt cttacaattctgaattctgggccaaatcacccgcgaggtagtctcgatgtcttcctccaacctcttattg aggagctaaaagagttatggtctactggaatcgatgcatatgatgtgtctttgaatcaaaattttaatct aaaagcagtgttgctgtggacgataagcgactttccggcgtacagcatgttatcaggatggacaacccac ggtaaactgtcttgtccaatttgcatggaaagtactaactctttttatctacctaatggaaggaagacgt gctggtttgattgtcaccgaagatttctttctcatggtcatccattacggaagaacaaaaaagacttccg aaaaggaaaagacgcttctaccgagtatccacctgagtctttaaccggtgagcaaatttattacgagcgg ttgtctggtgtaaatccaccaagaaccaaggatgttggtggcaacggtcacgaaaagaagatgccaggct atgggaaggaacataactggcacaaggaaagcatattatgggagcttccatattggaaggatctgaatct ccgacattgtatcgatgtgatgcatacagagaagaattttttggataacataatgaatactcttatgagc gtgaagggtaaatcaaaggacaacataatggcaagaatggatatagaacgattttgttctcggcctgact tacatattgatagtaagggaaaagctccatttccagcttatacattgacaaatgaagccaaaatgagttt attgcaatgtgttaaacatgcaatcaaattccctgatggttattcgtctgatttgagtagttgtgttgat atggagaatgggaagttatcaggtatgaagagtcatgattgccatgtttttatggagcggttacttccat ttatctttgcagaactcctcgaccggaacgttcaccttgcattatcaggtaaaataagaatataatttac tctttatatcatatatgcacttgagatgatgattttttttttccaggggttggcgcattttttagggact tatgttcgagaactttgcagaaaagtcgcgttcaaattcttaagcagaacattgttttgatcatatgcaa cttagagaagatcttcccaccatcattttttgatgttatggagcacctgcctatacatctcccctacgaa gcagaattgggcggtcctgtccaatataggtggatgtatccttttgagaggtttttcaaaaagttgaaag gaaaagcaaaaaataaaagatatgcggccggatcaattgttgagtcatatatcaatgacgagatttctta tttctcagagcactactttgccgataatatacaaacaaaatcaaggtaaccatgaattatcgatgtgatg caagttttcaatgcgcaaaaataaccatgaattacctcttttggatcaggttaacaagattcaatgaggg tgaagttcctgtatatcatgttcctggagtacctactatatttagttctgttggtcgtccaagtggagaa atacgtgaagtatggctatcagaggaagactatcaatgcgcacatggatatgttatacggaattgtgatt attttcaagtaattgagaggtatataacaaagtcatactcccattttaagttaattatataaatgtttgt ttactaacccttatatgttgtttttgtctttatatagtatgttcgaagattttctttctattaaatatcc aggattgaacgaaaaagaactcttcgtgaaaagaaacgaggaatatcatgtgtgggtgaaagattatgta tgtctaaatagtttataatatatatgatttgcatttatttattgttattcttatctctataatgatgttt atttctaaatttgcatacaggttacatattggaactctagtaatccttttccaacttgggttcaagagat agtcaatggacctttgcacaaagtcaaaacatggccaatgtattttacaagaggctatttgtttcatacg cagagtcatggagcaggacgtaagacttgtaactatggtgtatgtgtgaaaggtgaaaattatacggatg catctgacgcagcagatttttacggcaacttaactgatatcatagaacttgagtatgagggggtggtcag tttgaaaatcacactttttaaatgttcgtggtatgaccctaagctcggaagaggtactcggaggagcaat agtggtgttgtcgacgttctttcatcgaggaaatataacaaatacgaaccctttattttaggtacgtatg gtatatatatatggtcatttagtttttctagttcatatacaaatatctaattctccaaatatggtgcatt tacaaacaacatctcaagcggaacaagtgtgctttattccttatccatacacgaaaaaaccaaagcggga gtggctcaatgttttaaaagtaaatccaaggggaaacatattaggagaatatgaaaataaagacccgagt ttattgcaaacagaaaatgatgatgctgttttaataacaacaatagaagatcttgtactcgacaatttga caatcaatcgcaaccccataaacctcgatttagatgccggagatgctgatccagaagatgaatttcgatg taatttatcgtcttctgatgatgaagaacaacaagacgaagaacaatattagttttgtgtttatgatatg tacttaagttatttctagtaattatgatatgtacttaagttattttgaaatactatgcaaacatttttaa attattttgaaaaactatatttgattaagtatgttatgtttatttatatttttatagataattactattt ttgaatatctatggcccaaaaaaacacaaggcccaaacccaattattaaaaattaaacaaagcgactaat ttgtgacaatttagcgactataaccattctattgggccaaaataactcgaggcccaaatacggcgactaa tttgtgacaatttagcgactacaaaaaaattattgggccaaaaatttcgaggcccaaaattggtgactaa tatgtgacaatatagcgactaatgtttaaatttcgagaactttttaggcccaaacccattaggcccaatc cctttcaatgtcaaaaccctaagttccatctctttctcttcgaaacatccgcagccttcttcttcgattt ctctcttcctcttcgatttcctctgcgaaaccctagcaaatttctctcttcctcttcgattttcctctta atctctctaccaaacatctgcagtcttcttcttcttcaatctctctcttcgatttcctctttgaaaccct accaaatcaatctttcctcttcgtttttctcttaaatctcttcttcttcttcgatttcgtcgatttgaac ctctgtctcatctagatctctccttaatatcatgtttgggcgaggaaaaaagaagcgaaccagtcctaac cttgctcagagagcctccacatccacggctggtcgacgacctcgttctcttccgtcccagtacgatttca cgccggcagcagaacggtcgcctcagctacagacgcctgcatcagaagacgccggtcctcctttacaggc aacggcagctcatgttcggaactatccgccgcctctgcagttgttccagcactctggcagtcgacaacct gaagtgcaaaggtctgcctccgtggaagtgcagaacaatcctgcgaatcaagccgcgactactcagcagg ttcctccggctcctcaacaagaccctccgccttctcagcaagaccctccgccttctgcagttcaagagtc tcgggctcacagtcacccatcctctcaaggcaacaacttcgaagaatatccacctctgtcgccggatctc caggaggacacacttcaatccctaaacgatcttcttatgttgccggagagggataagttcgtcaccgtcc tctctcccattcctcgaccgaataccacctggtatgatcttcttcttctcttgtttgtgatcccatgttg tgttcaattagggtttagttgagaaatgagctttgtgagattgattagattaagaaaatatctgaattga ttctttgttgtgttggtctgaaagtccttgatattatagatgttactttgcagccacaatctattctcat gtttgaagagtcttgttgtgttggtcttattttgttgtttgcgagaacaagtagtttaagttgtgttttg atcttagggtttagttgagtaatgagctttgtgagattgattagattgagaaaatgtctgaaatgattct ttgttgtgtctcgtgttgtgttggtctgaaagtctgaaagttttgttcttaggatttaagttatgtgttt tgttattagggtttagttgggtaatgatctttgtgacgtgttgtgttggtctgaaagtctttgatattat agatgttactttgcagccacaatctattctgttgtttgaagagtcttgttgtgttggtcttattttgttg tttgcgagaacatgttttgttaagttgtgttttgatcttagggtttagttgagtaatgagctttgtgaga ttgattagattgagaaaatgtctgaattgattctttgttttagtctcgtgttgtgttggtctttggtctt tgatattttgatgttctttgcagccacaatctattgcaatgtgtttgaagagccttgtcgtctaggtctc tactctacttgttgtttgcgagaacaactgcttgtggctgcaaagtatcgtgtctgtcattgtgtaagtt tagttctagatgatactttgcagccacaatctatgttctgctctatatgatctgtttgtactttgcagcc acaatttatgttttgctagtctttggtctgatatttagatattttctgccatataataactgcttgtttt tcttgtgtatggaggtttactcgggacacaaactcacgacttgttaggaatatcactagagtgtttacaa acaagtttgatggtccctactacagctggacatgtgtgcctcaagagagacaagagaaatacttcctcga gtttgctgttaagtgttctttcagcccttcttttatatagttgattacacttcttttttttactgacatt accttttctatttgtagaaaacacaccattgggatcctttgatcacagggactgttcagttttacttcaa cgagatctgtttaaggcgaatgaaaggcatggttagcactgtaagaactagtcgaaagaaacctaaatgg attgggaaaactctatggaaggaaatgactgcgtactgggacactgaagaagctcaggaaagaagtcaaa tctattcaaatgcccgtatgtctgaccgtaacggtctaggtcctcacatacacttctcagggtctaagtc atatcatcaaatccgggacgaattggtaagtcttctctctgtttactcctttgcaactttaactcgatgt ttcaatgtacttttctatgacttgttttttttttcattacaggaagaccaattgggcaaaactgtcagta ttggtgacgttttcatcaaaacacatacaaaacctgatgggacgtatgttgatcgaaaggcagagaagat tgcagagttatatcagaagaatttgcagctgaggcagtctgagctcgaggctgaagcttctgctgtttca gatggcacttcgcgggtacgggagctcacagctgaggaatgtacaaccatatttcttcaggtaacgtttt atgtttcccaatttagtgtttttgagttgtctttttcagtctctaatctcagcttttatcatactatgca gtccactgagagggattcgagaggcgttccttatggagtaggaagcctcaaagagtctcttgtcaatggc aagcggaagcaagcaggtgactcaacttcttttgtggctttgcaagaacaacgttttcatcaaaacacat acaaaacctgatgggacgtatgttgatcgaaaggcggagaagattgcagagttatatcagaagaatttgc aactgaggcagtctgagctcgaggctgaagcttctgctgtttcagatggcacttcgcgggcacgggagct cacagctgaggaatgtacaaccatatttcttcaggtaacgttttatgtttcccaatttagtgttttcgag ttgtctttttcattctctaatctcagcttttatcatactttgcagtccactgagagggattcgagaggcg ttccttatggagtaggaagcctcaaagagtctcttgtcaatggcaagcggaagcaagcaggtgactcaac ttcttttgtggctttgcaagaacaattactggaagctcaacgcaagatagaagagcaggtctcttacaat cagaggcgtgaatctgagattgctttgcgtgaagctgagaattcccgagctgcagatgagcagaagaaga agcttgagcacttgtccttagtggagaagtttttgcgcgaaaatgatcctcggttcctcaatttcctcga atctcattcagctaaggagacaaccacagatcctatctcaccctctccagctgcctctccctcttcatct gcttcataggtctgaatcttaactccttcctcaagactcaatatcacggctcaaagtttttatatatgtg tacttgtgttgcttgtacttatgctgaacaaagttctaagttgtagtgaatcgaacatgtttttgtaatt tgggttttgtacttttctgaaatgacaataaatttgtgttcatgcttcttgtattgtgtttatgcttctt gtaatgttttgttttctcaggaaggtttccatctttagaacaatcatataggtgttctaataacaacaat agtcgcaatttagtcatcaaaaatgtaaccaagcagtcggtttttgataacaatttagtcactaaaattt ccgactaaatgatatttcagtcgcaactcagtcttaaatatagtgacaacatagtcgtaaattttaacga ctaattacctacaacgactacataacccttaagttaatcgttaatttgtcgtctaatagtcgctaaattt aacgactaaaaggtcactaaaaaagcgattacttaaacttagtcgtaactttgtcgctctttggcgacta aaatacgactatcgtatttaccgactctcgattagcgactaaatttaatagtcgtttatttgtcgtaaag aggtgtttaacgactacagagtgactaatactgaagtcggtaaatggcaattttcttgtagtg1