;ID   ATMU7       DNA   ; ATH   ; 3134 BP
;XX
;DE   ATMU7, autonomous DNA transposon - a partial consensus sequence.
;XX
;AC   .
;XX
;DT   01-FEB-2001 (Rel. 6.1, Created)
;DT   01-FEB-2001 (Rel. 6.1, Last updated, Version 1)
;XX
;KW   autonomous DNA transposon; MUDR superfamily; TIR;
;KW   transposase; DNA-binding protein; ATMU7.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 3134)
;RA   Kapitonov,V. and Jurka,J.
;RL   Direct submission (January 2001)
;XX
;CC   ATMU7 is an autonomous DNA transposon from MUDR superfamily.
;CC   Its copies are flanked by 10-bp target site duplications.
;CC   There are 2 copies of ATMU7 in the genome, they are 98%
;CC   identical to each other. ATMU7 was active recently.
;CC   However, both copies have lost some portions of the
;CC   transposase and DNA-binding protein.  
;CC   ATMU7 has ~320-bp terminal inverted repeats, 92% identical
;CC   with each other. ATMU7 encodes two proteins, ATMU7p1 and 
;CC   ATMU7p2. 
;CC   ATMU7p1 is a truncated 357-aa transposase encoded by 3 exons 
;CC   (358-843, 899-1267 and 1530-1748). 
;CC   ATMU7p1:
;CC   MRKKLPADVDPPPPVVEPPHVVVEPPSASIEPPSASIEPPSVAIEPPSVAELRRASVEAP
;CC   AIHEPSTRPSHSVVDNRRRKKSKQNGNRLDGDNRRKKSKSKQNGNESDGEGLSENDCNVT
;CC   GDDDFDEVHYADEEDGNRLGVTEENCEDFEAHSGPAARDDGDGLVKAIHNRIPAAEHRQC
;CC   AKHIMDNWKRNSHDMELQRLFWKIARSYTIGEYTANLEELKTYNPGAAASLMNTKPMEWS
;CC   RAFFRIGSCCNDNLNNLSESFNRTIRQARRKPLLDMLEDIRSQCMKVEDYVSDWYTTRMW
;CC   QLTYNDGIAPVQGQLLWPRVNRLGVLPPPWRRGTPGRPSNYARRKGRRSWFECSNLT
;CC   ATMU7p2 is a truncated 143-aa DNA-binding protein encoded by 
;CC   the second strand (exons 2786-2575 and 2482-2264):   
;CC   XLKSISKIGSSYAMSSSSASSGVVQNRGFPVKCWCGDDVTIFTSKSVDNPGRPFFRCETK
;CC   RDPKTWTNKRDSHLFKWVEDAVYEEVEDVLPKFVIIANELNKAKSEANELNVMIHELKEE
;CC   AMLSKQEICKWKVCLKICFFGFV
;CC   ATMU7 is only ~65% identical with ATMU3-ATMU6.
;CC   The ATMU7 segment (2107-2750) that encodes ATMU7p2 is most 
;CC   close (81% identity) to the one present in VANDAL7 (6516-7144). 
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 3134 BP; 1009 A; 607 C; 616 G; 902 T; 0 other;
ATMU7
gggaaaaatgtcattaaaatcccgaactttgctaaaacattgatttaaatcccaaactttagttaaacga
aaaaaacatcgaacttttgttgtcttttctaataaatcttaaacttgtgttgaccgcgccattttagtct
tgccgttatctgggttaacagacttaattgacagggttaattaggcgttaacagagacttaatttgcaaa
cgacacagtttcgtttttagtaaacgacgtcgttttatcaaaaagactgaacttccccaattcgatttga
aaaatcacgagccttaaaaaacaaaaattctcaattctcttttgtcgatcgaggaaggaaaattcgaaac
tgtaatgatgagaaagaagctgcctgctgatgtcgacccgcctcctcctgttgttgagccgcctcatgtt
gttgtcgagcctccgtctgcttctatcgagcctccgtctgcttccatagagcctccgtctgttgctatcg
agcctccgtctgttgctgagctacgacgtgcttctgtcgaggctcctgctattcacgagccttctactag
gccgtctcattctgtcgttgacaataggagaagaaaaaaatcgaaacagaatggtaacaggttagatggt
gacaatagaaggaaaaaatcgaaatccaaacaaaatggtaacgaatcagatggtgagggcttgagtgaaa
atgactgtaatgttaccggtgatgatgatttcgatgaagttcactatgctgacgaagaagatggtaatcg
gttaggtgttactgaagaaaattgtgaagactttgaagcacactctggaccagcagctagagatgatgga
gatgtgagtgatgcggacagtggagacgatatttaacttatgtccaattactttgcagggtttagttaaa
gctatacacaaccggattccagcagctgaacatcgtcaatgtgctaagcacataatggacaactggaaga
gaaacagtcacgacatggaactccagcgtctgttctggaagattgcaaggagctacaccattggagaata
tacagctaatctggaggaattaaagacttacaatccaggtgcagctgcttctctaatgaacactaaacca
atggaatggtctagagcatttttcagaattggaagctgctgcaatgataacttgaataatctcagtgaat
ccttcaaccggacaatccgacaagcaagaagaaaacctctactggatatgttggaggatataaggtctca
gtgtatggtacgcaatgagaagaggtacattattgctggaaggtggaaaagcagattcacaaagagggca
catgaggagatagagaagatgattgcggggtctcaattttgtgaaagaagcatggcaaggcataataagc
atgagatatcacattttggtagaaaatattctgtggatatgaatgacaatacatgtggttgcaggaagtg
gcaaatgacaggtataccttgtgttcatgcagcctctgttataattgggaaaaaaacagaaagtagaaga
ctatgtgagtgactggtacacgacgaggatgtggcagctaacttacaatgatggtattgcgccggtccaa
gggcagttgttgtggcctagagtgaataggttaggtgtcttgccaccaccatggagaagaggtactcctg
gaagaccaagcaattatgctagaagaaaaggaaggagatcatggtttgagtgttcaaacttgacatgaga
acatcgattggtctttggttgttggtatttggattgtttcataacttgagatgtctcttttgttgtctct
tttgttgtttgatttttggttcggcaagttttaagacatgttggttaactacttttatatgaaacgacct
cttcctcaacaaaattatctcgcctgcaccgtatcactcatagttttaagacaagtttttgtttgagtga
aattattccatctctttactccgattctcatatcaagaccaaacactcacaccgaccaaacactcacacc
ttatatcaagaccaaacactcacattcacccaagaaaaataaagaccaaacactcacaagttttattaag
acagcaacaaagccaacattacaaagtcataaaagtcacaaatactacattgtcattacaagcttggcga
agacaacaagataaatagacttacactattaagcaacaccaaattcatttctttttgccttaccaagcat
catgtaaagaatgaaaatgctgatcaaacaaagccaaaaaaacatattttcaaacacactttccacttgc
agatttcttgtttactcaacattgcttcttcttttagttcatgtatcatcacatttagctcatttgcctc
ggatttggctttgttaagctcatttgcaatgatcacaaattttggtaaaacatcttcaacctcttcgtat
acagcatcctcaacccatttaaacaaatgactctgttacacaatcaaaccaaatccgatatgattagctg
ctaagttacataacataaacacaaactcaaactcaaactcaaactcaaacttacatctcttttgtttgtc
cacgtctttggatctcttttcgtttcacaacgaaaaaaaggtctgcccggattatcaacactcttcgatg
tgaagattgtcacatcatctccacaccaacacttcaccggaaaaccacggttttggacaacacccgatga
cgctgaacttgaactcattgcgtaggaagatccaattttcgaaatcgatttcaattctgtcgtagaagaa
gagaactgagaaattttgtttttttagggcttgtgatttcttcgatttctgatcgaattgggggaaaaca
tagcaagtcgttttgaaaaaacgacgtcgtttactaaaaacgaaactgtgtcgtttgcaaattatgtctc
tgttaacgcctaattaaccctgtcaattaagtctgttaacccagataacggcaagactaaaatggcgcgg
tcaacacaagtttaagatttattagaaaagtcaacaaaagttcgatgtttttttcgtttcatctaaaatt
cgggatttaaatcaacgttttagcaaaattcgggattttaatgaaatttttccc1