;ID ATMU8 DNA ; ATH ; 4433 BP ;XX ;DE A molecular fossil of the ATMU8 autonomous DNA transposon. ;XX ;AC AF262040 ;XX ;DT 09-FEB-2001 (Rel. 6.1, Created) ;DT 09-FEB-2001 (Rel. 6.1, Last updated, Version 1) ;XX ;KW autonomous DNA transposon; MUDR superfamily; TIR; ;KW transposase; DNA-binding protein; ATMU8. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; ;OC Dilleniidae; Capparales; Brassicaceae. ;XX ;RN [1] (bases 1 to 4433) ;RA Kapitonov,V. and Jurka,J. ;RL Direct submission (February 2001) ;XX ;CC ATMU8 is an autonomous DNA transposon from the MUDR superfamily. ;CC There is only one copy of ATMU8 in the genome. ;CC It is flanked by a 10-bp target site duplication. ;CC ATMU8 is 77 and 74% identical with ATMU5 and ATMU4 respectively. ;CC ATMU8 has 230-bp terminal inverted repeats, which are 93% identical ;CC with each other. ATMU8 encodes two proteins, ATMU8p1 and ATMU8p2. ;CC ATMU8p1 is a transposase encoded by 4 exons (357-1733, ;CC 1965-2194, 2266-2629, 3041-3325): ;CC MPLVRLNGIISTIGSPMEDDELESSVPVVAPPAEVISESGGGDGGVEAAAVDNRRVSGRR ;CC RTGSRVRVVEDDEPEIDPELEIDPEPDLEDDCAVYGDDDCDVVDDAVAGEGNEDNVVEED ;CC ANLNIADDFPEAYRADEEAGSDSDTGDDIWDDEKIPDPLSSDDEDEVVRVGEEAVCGDED ;CC DPEVLLAIEKTFNSPDDFKRAVLMYSLKTRYNINFYRSESLMVAAKCCYVNELGVNCPWR ;CC VLCSYEKKKHKMQIRIYFNEHICVRSGYTKMLKRSTIAALFEERLRVNPKMTKYEMVAEI ;CC KREYKLEVTPDQCAKAKTKVLKARNASHDTHFSRIWDYQAEVLNRNPNSDFDIETTARTF ;CC IGSKQRFFRLYICFNSQKVSWKQHCRPVIGIDGAFLKWDIKGHLLAAVGRDGDNRIVPLA ;CC WAVVEIENDDNWDWFLKKLSESLGLCEMVNLALISDKQSGLVKAIHNVLPQAEHRQCSKH ;CC IMDNWKRDSHDMELQRLFWKISRSYTIEEFNTHMANLKSYNPQAYASLQLTSPMTWTIRQ ;CC ARRKPLLDMLEDIRRQCMVRTAKRFIIAERLKSRFTPRAHAEIEKMIAGSAGCERHLARN ;CC NLHEIYVNDVGYFVDMDKKTCGCRKWEMVGIPCVHTPCVIIGRKEKVEDYVSDYYTKEHN ;CC LIKNKQFQVMDHKNKQLRVMDHKDMVHKDMDHKNKQLLVMDHKDMGHKDQELMDRRDKGH ;CC HKDKDQELMLDQKHKHNLNHKRKPKNKDLLG ;CC ATMU8p2 is a putative DNA-binding protein encoded by the second ;CC strand (exons 4138-3974 and 3910-3605): ;CC MSCKSGNSYPSILDGGCWGRGLASKCHCGLEVVIYTSASKSNPGRPFFRCPTKQDDHLFK ;CC WVEYGVYEEVVEALPKISSIDSEIMKAKCEVAIEIEQLKTMIKEVKEEAMCSEREIKNWK ;CC RMIKCCLVCLGFIVIVIVVGMIMFGNTKEQKLVLGY ;CC ATMU8p2 includes a conservative motif (aa positions 26-63) ;CC CHCGLEVVIYTSASKSNPGRPFFRCPTKQDDHLFKWVE present at C-terminus ;CC of DNA topoisomerase III in human, mouse and drosophila, and ;CC at C-terminus of polyproteins encoded by ORF3 in banana streak ;CC and sugarcane bacilliform retroid viruses. ;XX ;DR Positions 28426 23994 Accession No AF262040 GenBank (rel. 119.0) ;XX ;SQ Sequence 4433 BP; 1484 A; 792 C; 948 G; 1209 T; 0 other; ATMU8 ggggaaaatgtcgtttaattgccgaactttaccaaaatgccattttatacaccaacttcgtataagatcg ttttaaacattaactttgcgttgactttcataattttgttgaccgggcctaaatatactcgtcgttaaat ccgataacggacccattaacagccgtttagatccgttaacatctctttgttagtgacaaaactgcgtcgt tttcgacttaatcggcacttatactcaaattagtcgacaaaaaatccccaattctcaaacttgatatccc taattcccagaaacctaaatcgaattggtaattacaaatctttcacaaatgcctttggttcgaaacaaag gcgttgtatctaggattgggacaacaatgtgagtctctaattcggcagctgaagttgaagatccagcgag atgtgaaattggaggtccaccggcggtagatggaagtgtagtagcctcactgtctgtagaagatggaggt ttagctcaacccaatgttcttggaggtcgagtctctagacagagtcgaaaacgaagaagagtcgttgaaa aatatgaggaaaagcctgaattggaagttgctcaagatatcgaagatgactgtgccgtatatggtgatga tgactgcaatggtgttgacgatgcaatacgtggtggagataacgatgctgtggaggaagatgctaacgat gctaacgatgctgatggagaagatgctaacgatgctgatggagaagatgctaacgatgcttttgaagaag atgcaaaccttaacattgaagaagattttcatgaggcttttagagttgatgaagacggttctgaccacga cagcggtgatgatatatgggatgaagacaggattcctgatccgttgtcatctgatgacgaagatgaagtt agaggagaagaagagacaactcgtagagatgaaaacgagccagagggtttgctagcattagagaaaacct acaactctccgaatgactcttcaagcgtgcattattgatgtactctttgaagacaaggtatggcataaag ctttataggtcagcatctatgatcattgcagctaagtgaagttatattgatgaagatggtgttaaatgtc cgtggagaatttattgttcttatgagaagaaaaaacataagatgcaaataaaagtttatgtcaatgagca ttcatgtgtgaggtcagggtattttaagatgctgaagcggtcttcgattgctgccttgtttgaggaaagg gtgagattgaatccaaagctgacaaaacacgagatggtggctgagataaagagagagtacaagttggaag taactccaaaccagtgtgctaaagcgaagaccaaggttatgaaagcaagaagtgctagtcatgaaaccca ctttgcaagaatttggtattatcaagcagaggttataaaccggaatccaggtactgagtttgatatcgag acagtaccagggccattgattgggagcaaacagaggttttttcgatgctatatttgttttaaatctcaaa aggagacttggaaacaaaattgtagaccagtcataggcatagatggagcttttcttaaatgggacattaa gggtcatcttctagcgtgggcagttgtagagatagaaaatgatgataactgggactggtttttgagaaaa ctcgctggaagtttggggcttattgaaatgactcatctagctatcatttcagacaaacaatcagtatcat tatagaatcttatagttatattttgttgtgaaataaatgtctcatacatatgtttttatgttttggttgc agggtcttatcaaagcaatccacaccatactgccacaagctgagcatcgtcaatgttgtaaacacataat ggataattggaagagagacagccatgatctagagctacaacgccttttttggaagatagcccggagctac accatagaagagttcaataatcatatggctgagctcaaggcgtacaatccacatgcttatgattccctac aacttactagtctgttgacttggtctagagccttctttagaatcggaacatgttgcaatgataacctcaa caatctgagtgagtctttcaataggactattagacaagcaagaagaaaaccattggttgatatgctagag gacattaggaggcaatgcatggttagaactacaaagaggtatctcatagttgacaagttgaagacaaggt tcactaagagagctcatgatgagaaagagaaaatggttgatggggcttcatggtgcgagagacatttggc cagaaacaatttgcatgagatatttgtgaataatgttgggtactttgtagatatggaccttaaaacttgt ggttgcaggaagtgggaaatggttgggatcccatgtatacacgcagcatctgtgataataggaagaagac aaaaggttgaagattatgtaagtgactattacataaagagaaagtggctagaaacgtacaaggatggtat tatgcatgtccaagggatgccattgtggcctagattgaatatgctgcatgtgttgcccaccttggagaag aggcaattccggaaggccaagtaactatcctaggaggaaaggaagaaatgaaactgcctcttctgcaaat cagaccaagatgtcacgggcaaggaaaataatgacctgttctaactgccatcaagaaggacacaacaaaa aaaggtgtaataatcctactgttctgctgtcaccatcgagaccaagaggtagaccaaggataaatcaggt ttagacttcttttaaaaaaactcattgaatcattaaatcactactgtttctgattctcatcaacatacca ctaaattaggaacctcaaggatcacaaccgcaatacgtacaaggatcacaaggatcacaaccgcaacaag tacaaggatcacaaggatcacaactgcaacaagcataaggatcacaaggatcacaaccgcaacaagcaca aggatcacaaggtgcaagagctcaaggatcacaaagacaaagagcatcatcaagagctcatgttcgatca caatcacaaccacaagcacatcctcaaccacaatcatcggcagcacaaactcaaccacaatcacaaggac ttgctggatggggatcatggtttaaatgttcaaggcgaagttcgtaagactctccggttttttgttatta tctttctctcttgttttcggtttttcaagctttgacttggaccaatgaatgtttgttgtttgctttttgt tgtaagacatgttttgttgtttagctatctaacatgtgtcttagattttgctataacgataaatctttgt tttcattaccgtttcattacaagatacttcataaaacaagacataacactaagacaaacatcaaaacaag acacaacactaagacaaacatcaaaagaaaagacataacactaagacaaacatcaaagacacaacaaaca aagaccagctaatgcacgctagtaaccaagaacaagtttctcttttttgatattaccgaacattaagatg ccaagaacaagaaccatgacaataaaaccaagacacaccaaagacaatgtgctcatccatttccatttct tgatttcccttttggtccacattccttcttccttgagttcttcgaccatagcctttaacacttcaatctc gattgtaacctcagatttagcattgtttatctcactgtcaatgatggagaattttggtaaagcgtctgca accttttcatacacaccatcttcgacccatttaaacaaatggtcctacattatcaatcgaaaattttaag atgaaaccaaaaaagaccaaacttaatttcaaatctacttacatcttgtttcgttggacagcggaagaat ggtcttccagggttagtcttcgatgccgaggtatagattacgacgtctaatccacattcacactttgaag gaaacccacgttcccaacgacctccgtcttgaatacgaggatacgaattcccagaaatataactcattgt tgtcgctcaagatggagaattcgggtatgaagaagaattggagttccaatcgcgtttagaaatgataaag aagaagaagattcagagaattaggatcaaaaattgattttagcgattttattggatttggggtttcgatt aagtcgaaaacgatgcagttttggtcactaacagagagatgtttacgggtctaaacggctgttaatgggt ctgatatcagatttaacgacgagtatatttaggtctggtcaacaaaattatcaagctttatatgaaagtc aacgcaaagttaaggtttaaaacgaccttatacgaagttggtgtataaaatggcgtttttggtaaagttc ggcagttaaacgacattttcccc1