;ID   ATMU8       DNA   ; ATH   ; 4433 BP
;XX
;DE   A molecular fossil of the ATMU8 autonomous DNA transposon.
;XX
;AC   AF262040
;XX
;DT   09-FEB-2001 (Rel. 6.1, Created)
;DT   09-FEB-2001 (Rel. 6.1, Last updated, Version 1)
;XX
;KW   autonomous DNA transposon; MUDR superfamily; TIR;
;KW   transposase; DNA-binding protein; ATMU8.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 4433)
;RA   Kapitonov,V. and Jurka,J.
;RL   Direct submission (February 2001)
;XX
;CC   ATMU8 is an autonomous DNA transposon from the MUDR superfamily.
;CC   There is only one copy of ATMU8 in the genome.
;CC   It is flanked by a 10-bp target site duplication.
;CC   ATMU8 is 77 and 74% identical with ATMU5 and ATMU4 respectively.
;CC   ATMU8 has 230-bp terminal inverted repeats, which are 93% identical
;CC   with each other. ATMU8 encodes two proteins, ATMU8p1 and ATMU8p2. 
;CC   ATMU8p1 is a transposase encoded by 4 exons (357-1733, 
;CC   1965-2194, 2266-2629, 3041-3325): 
;CC   MPLVRLNGIISTIGSPMEDDELESSVPVVAPPAEVISESGGGDGGVEAAAVDNRRVSGRR
;CC   RTGSRVRVVEDDEPEIDPELEIDPEPDLEDDCAVYGDDDCDVVDDAVAGEGNEDNVVEED
;CC   ANLNIADDFPEAYRADEEAGSDSDTGDDIWDDEKIPDPLSSDDEDEVVRVGEEAVCGDED
;CC   DPEVLLAIEKTFNSPDDFKRAVLMYSLKTRYNINFYRSESLMVAAKCCYVNELGVNCPWR
;CC   VLCSYEKKKHKMQIRIYFNEHICVRSGYTKMLKRSTIAALFEERLRVNPKMTKYEMVAEI
;CC   KREYKLEVTPDQCAKAKTKVLKARNASHDTHFSRIWDYQAEVLNRNPNSDFDIETTARTF
;CC   IGSKQRFFRLYICFNSQKVSWKQHCRPVIGIDGAFLKWDIKGHLLAAVGRDGDNRIVPLA
;CC   WAVVEIENDDNWDWFLKKLSESLGLCEMVNLALISDKQSGLVKAIHNVLPQAEHRQCSKH
;CC   IMDNWKRDSHDMELQRLFWKISRSYTIEEFNTHMANLKSYNPQAYASLQLTSPMTWTIRQ
;CC   ARRKPLLDMLEDIRRQCMVRTAKRFIIAERLKSRFTPRAHAEIEKMIAGSAGCERHLARN
;CC   NLHEIYVNDVGYFVDMDKKTCGCRKWEMVGIPCVHTPCVIIGRKEKVEDYVSDYYTKEHN
;CC   LIKNKQFQVMDHKNKQLRVMDHKDMVHKDMDHKNKQLLVMDHKDMGHKDQELMDRRDKGH
;CC   HKDKDQELMLDQKHKHNLNHKRKPKNKDLLG
;CC   ATMU8p2 is a putative DNA-binding protein encoded by the second 
;CC   strand (exons 4138-3974 and 3910-3605):   
;CC   MSCKSGNSYPSILDGGCWGRGLASKCHCGLEVVIYTSASKSNPGRPFFRCPTKQDDHLFK
;CC   WVEYGVYEEVVEALPKISSIDSEIMKAKCEVAIEIEQLKTMIKEVKEEAMCSEREIKNWK
;CC   RMIKCCLVCLGFIVIVIVVGMIMFGNTKEQKLVLGY
;CC   ATMU8p2 includes a conservative motif (aa positions 26-63) 
;CC   CHCGLEVVIYTSASKSNPGRPFFRCPTKQDDHLFKWVE present at C-terminus
;CC   of DNA topoisomerase III in human, mouse and drosophila, and
;CC   at C-terminus of polyproteins encoded by ORF3 in banana streak 
;CC   and sugarcane bacilliform retroid viruses.
;XX
;DR   Positions 28426    23994  Accession No AF262040   GenBank (rel. 119.0)
;XX
;SQ   Sequence 4433 BP; 1484 A; 792 C; 948 G; 1209 T; 0 other;
ATMU8
ggggaaaatgtcgtttaattgccgaactttaccaaaatgccattttatacaccaacttcgtataagatcg
ttttaaacattaactttgcgttgactttcataattttgttgaccgggcctaaatatactcgtcgttaaat
ccgataacggacccattaacagccgtttagatccgttaacatctctttgttagtgacaaaactgcgtcgt
tttcgacttaatcggcacttatactcaaattagtcgacaaaaaatccccaattctcaaacttgatatccc
taattcccagaaacctaaatcgaattggtaattacaaatctttcacaaatgcctttggttcgaaacaaag
gcgttgtatctaggattgggacaacaatgtgagtctctaattcggcagctgaagttgaagatccagcgag
atgtgaaattggaggtccaccggcggtagatggaagtgtagtagcctcactgtctgtagaagatggaggt
ttagctcaacccaatgttcttggaggtcgagtctctagacagagtcgaaaacgaagaagagtcgttgaaa
aatatgaggaaaagcctgaattggaagttgctcaagatatcgaagatgactgtgccgtatatggtgatga
tgactgcaatggtgttgacgatgcaatacgtggtggagataacgatgctgtggaggaagatgctaacgat
gctaacgatgctgatggagaagatgctaacgatgctgatggagaagatgctaacgatgcttttgaagaag
atgcaaaccttaacattgaagaagattttcatgaggcttttagagttgatgaagacggttctgaccacga
cagcggtgatgatatatgggatgaagacaggattcctgatccgttgtcatctgatgacgaagatgaagtt
agaggagaagaagagacaactcgtagagatgaaaacgagccagagggtttgctagcattagagaaaacct
acaactctccgaatgactcttcaagcgtgcattattgatgtactctttgaagacaaggtatggcataaag
ctttataggtcagcatctatgatcattgcagctaagtgaagttatattgatgaagatggtgttaaatgtc
cgtggagaatttattgttcttatgagaagaaaaaacataagatgcaaataaaagtttatgtcaatgagca
ttcatgtgtgaggtcagggtattttaagatgctgaagcggtcttcgattgctgccttgtttgaggaaagg
gtgagattgaatccaaagctgacaaaacacgagatggtggctgagataaagagagagtacaagttggaag
taactccaaaccagtgtgctaaagcgaagaccaaggttatgaaagcaagaagtgctagtcatgaaaccca
ctttgcaagaatttggtattatcaagcagaggttataaaccggaatccaggtactgagtttgatatcgag
acagtaccagggccattgattgggagcaaacagaggttttttcgatgctatatttgttttaaatctcaaa
aggagacttggaaacaaaattgtagaccagtcataggcatagatggagcttttcttaaatgggacattaa
gggtcatcttctagcgtgggcagttgtagagatagaaaatgatgataactgggactggtttttgagaaaa
ctcgctggaagtttggggcttattgaaatgactcatctagctatcatttcagacaaacaatcagtatcat
tatagaatcttatagttatattttgttgtgaaataaatgtctcatacatatgtttttatgttttggttgc
agggtcttatcaaagcaatccacaccatactgccacaagctgagcatcgtcaatgttgtaaacacataat
ggataattggaagagagacagccatgatctagagctacaacgccttttttggaagatagcccggagctac
accatagaagagttcaataatcatatggctgagctcaaggcgtacaatccacatgcttatgattccctac
aacttactagtctgttgacttggtctagagccttctttagaatcggaacatgttgcaatgataacctcaa
caatctgagtgagtctttcaataggactattagacaagcaagaagaaaaccattggttgatatgctagag
gacattaggaggcaatgcatggttagaactacaaagaggtatctcatagttgacaagttgaagacaaggt
tcactaagagagctcatgatgagaaagagaaaatggttgatggggcttcatggtgcgagagacatttggc
cagaaacaatttgcatgagatatttgtgaataatgttgggtactttgtagatatggaccttaaaacttgt
ggttgcaggaagtgggaaatggttgggatcccatgtatacacgcagcatctgtgataataggaagaagac
aaaaggttgaagattatgtaagtgactattacataaagagaaagtggctagaaacgtacaaggatggtat
tatgcatgtccaagggatgccattgtggcctagattgaatatgctgcatgtgttgcccaccttggagaag
aggcaattccggaaggccaagtaactatcctaggaggaaaggaagaaatgaaactgcctcttctgcaaat
cagaccaagatgtcacgggcaaggaaaataatgacctgttctaactgccatcaagaaggacacaacaaaa
aaaggtgtaataatcctactgttctgctgtcaccatcgagaccaagaggtagaccaaggataaatcaggt
ttagacttcttttaaaaaaactcattgaatcattaaatcactactgtttctgattctcatcaacatacca
ctaaattaggaacctcaaggatcacaaccgcaatacgtacaaggatcacaaggatcacaaccgcaacaag
tacaaggatcacaaggatcacaactgcaacaagcataaggatcacaaggatcacaaccgcaacaagcaca
aggatcacaaggtgcaagagctcaaggatcacaaagacaaagagcatcatcaagagctcatgttcgatca
caatcacaaccacaagcacatcctcaaccacaatcatcggcagcacaaactcaaccacaatcacaaggac
ttgctggatggggatcatggtttaaatgttcaaggcgaagttcgtaagactctccggttttttgttatta
tctttctctcttgttttcggtttttcaagctttgacttggaccaatgaatgtttgttgtttgctttttgt
tgtaagacatgttttgttgtttagctatctaacatgtgtcttagattttgctataacgataaatctttgt
tttcattaccgtttcattacaagatacttcataaaacaagacataacactaagacaaacatcaaaacaag
acacaacactaagacaaacatcaaaagaaaagacataacactaagacaaacatcaaagacacaacaaaca
aagaccagctaatgcacgctagtaaccaagaacaagtttctcttttttgatattaccgaacattaagatg
ccaagaacaagaaccatgacaataaaaccaagacacaccaaagacaatgtgctcatccatttccatttct
tgatttcccttttggtccacattccttcttccttgagttcttcgaccatagcctttaacacttcaatctc
gattgtaacctcagatttagcattgtttatctcactgtcaatgatggagaattttggtaaagcgtctgca
accttttcatacacaccatcttcgacccatttaaacaaatggtcctacattatcaatcgaaaattttaag
atgaaaccaaaaaagaccaaacttaatttcaaatctacttacatcttgtttcgttggacagcggaagaat
ggtcttccagggttagtcttcgatgccgaggtatagattacgacgtctaatccacattcacactttgaag
gaaacccacgttcccaacgacctccgtcttgaatacgaggatacgaattcccagaaatataactcattgt
tgtcgctcaagatggagaattcgggtatgaagaagaattggagttccaatcgcgtttagaaatgataaag
aagaagaagattcagagaattaggatcaaaaattgattttagcgattttattggatttggggtttcgatt
aagtcgaaaacgatgcagttttggtcactaacagagagatgtttacgggtctaaacggctgttaatgggt
ctgatatcagatttaacgacgagtatatttaggtctggtcaacaaaattatcaagctttatatgaaagtc
aacgcaaagttaaggtttaaaacgaccttatacgaagttggtgtataaaatggcgtttttggtaaagttc
ggcagttaaacgacattttcccc1