;ID   VANDAL21    DNA   ; ATH   ; 8244 BP
;XX
;DE   Autonomous DNA transposon VANDAL21 - a consensus.
;XX
;AC   .
;XX
;DT   13-OCT-2000 (Rel. 5.9, Created)
;DT   13-OCT-2000 (Rel. 5.9, Last updated, Version 1)
;XX
;KW   Autonomous DNA transposon; VANDAL superfamily; MuDR-like transposase; 
;KW   VANDAL21.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Charophyta/Embryophyta group;
;OC   Embryophyta; Tracheophyta; euphyllophytes; Spermatophyta;
;OC   Magnoliophyta; eudicotyledons; Rosidae; Capparales; Brassicaceae;
;OC   Arabidopsis.
;XX
;RN   [1]  (bases 1 to 8244)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (October 2000)
;XX
;CC   VANDAL21 is a consensus sequence of a VANDAL-like autonomous DNA
;CC   transposon. It was reconstructed based on six copies of the
;CC   transposon. These copies are flanked by 9 bp duplications of different
;CC   target sites, and they are ~98% identical to the consensus sequence.
;CC   VANDAL21 encodes three proteins, VANDAL21p1, VANDAL21p2 and
;CC   VANDAL21p3. VANDAL21p1 is a 778-aa MuDrA-like transposase.
;CC   Functions of 132-aa VANDAL21p2 and 718-aa VANDAL21p3 are 
;CC   unknown.
;CC   VANDAL21p1 (position 790-3126):
;CC   MKTLGEKITLSMLEDRIMTKLGLDANKVKLHMRYNPRLFGVEEEMNVCDDEDVFVYVTSA
;CC   KNNRRSVLVVEEISKPPEPEQLPEQLSRVGKSSVGKNYTELNSEEDEMRVDDGALIVLLE
;CC   EEQGTQHQLEAIVEDHGTQHQLEAIVEDHGTQEDETRYDESMDNSDRGEQYVESPPAVEP
;CC   GMFKKEWEDGIGLTLRQEFPNKAALHEVVDRAAFANSFGYVIKKSDKERYVLKCAKESCS
;CC   WRLRASNISNTDIFSIRRYNKMHSCTRLSKGSSRLRKRKGNPQLVAALLHDHFPGQLETP
;CC   VPRIIMELVQTKLGVKVSYSTALRGKYHAIYDLKGSPEESYKDINCYLYMLKKVNDGTVT
;CC   YLKLDENDKFQYVFVALGASIEGFRVMRKVLIVDATHLKNGYGGVLVFASAQDPNRHHYI
;CC   IAFAVLDGENDASWEWFFEKLKTVVPDTSELVFMTDRNASLIKAIRNVYTAAHHGYCIWH
;CC   LSQNVKGHATHTNRDVLAWKFQELSRVYVVADFNRAYDGFKLRYPKATKYLEDTTVKEKW
;CC   ARCCFPGERYNLDTSNCVESLNNVFKNARKYSLIPMLDAIIKKISVWFNEHRMEAASGSL
;CC   ENKMVPLVENYLHDLWVFAEKLKVVELNSFEREYVVTCDKGIDYTVSLLLKTCSCKVFDI
;CC   QKYPCIHALAAFINIMDDEDRRRGLELHDLVTKYYWAELWALAYYRTIYLVPDRSQWEVP
;CC   DEVKALKIVPLSKKPKKGRKKMLRFPSTGEKRPKRQRTQNKRRPRQSCQWLLFGNTPI
;CC   VANDAL21p2 (exons 4025-4204 and 4382-4600): 
;CC   MDRLCERDPYYDDMKVAKRAIEQMEMVAMMEGIPKFCPCGGSIVDTRKDEKRYYQCEKFK
;CC   DDRTDCMHIRKLWDKAMEEEVSSLRESVDYNRKKVLSHEYLIEEMQKELKAHRAEIVNVS
;CC   KVVFRNPMAPKK
;CC   VANDAL21p3 (exons 7869-7090, 6673-5933, 5858-5697, 5619-5260
;CC   and 5176-5063): 
;CC   YFRSISDSKSVMPPKTRGGGQGKRKEIEASAPAKTEKVKAPAEKVKEKVPAKKAKVQAPA
;CC   KKAKVQAPAKKAKVQAPAKKDKEKVPAEEQSPAQTTATAMATNAAPTTAAPTTTAPTTAP
;CC   TTESPMLDDSTFYDALKHIPAEEIQENMQTDEVEDENEKEEASEEEESGSSSRTLGSDSD
;CC   SEETETNKELACANPVEEAERQDDGLAVIEEEEERSSASDEDVNVEKSVEDEGDEDERDE
;CC   DVIVEKPVEERTIDEDIANVDMEEAMAMQPLGMYFPASEYTKKMKLATRCYISEVLKTFA
;CC   DLEHPLTDVEKNYFMEHPSFKHIYHLPSGYTHKLMGMWMLFLRTASIEKKKEVWFVVNGV
;CC   PIRYGIREHALISGFNCKAYPANYQSAGNMNFANRYFKTGVIRREDVKTKLMEMEPARSK
;CC   DRLRMAVLYFLTSIIAVPTKTGERASPIDDFCVRAASDLTFCKTFPWGRYSFEYMLKSIS
;CC   HTLDHFNGVVPNTQSPWPVPGFCVPLEFLAFEAIPSLRERFIEEKEGAHAGCPRMCKVNF
;CC   KRTEMKGFTLEQINHVLGTTEVIESIIREKAEEVPLLAEITGVEDDVDKHDVVVDSWMKR
;CC   LGQGREIRFEEVYNEDVQARMEAPNEEEVPTAVGPGDPTLVDVMEKLHSINDKLNEALLA
;CC   LMEMEEKQATFEAFMDEMKAKMSQNPPDEEESTIKENAAAPVVPKRVTRSTRAKSSNV
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 8244 BP; 2417 A; 1611 C; 1645 G; 2571 T; 0 other;
VANDAL21
gaatgatcatctcttgtgtcccttttgtgggacaactaatctgcaaaaccactttagagtcttcatatcc
cctttgagttaatgaattgtgttaagacgaatttacccctgtaatttattttgaatgaataattaaaata
tttctagaaaacattttgaaaaaaaaaaaaacggaaaaagaaaaaaaaaaagctcgagctcgagctcgac
ttcttcttcgggcttcttcctccggcgagctccggcgagccgcattgtcgtttttctcctcctgcaagct
gcatcaatctactaagtccaggtatggagagagcttcaaaagctttgaattacatattttgaggattttg
agattcaatccaattagatttgggttattgtgttttagcacctacattagaagtttaagtatagatttag
gcttcatttggtctgttaatgggtaagcaattgatttcacatgctgattcatactaaattgaactcaaat
cgaatatttgttcttgtcgcgaaaataatactggtcgaactagctatagtagtaccgtaatttatataga
tagtatgttttaatagcaccagtattacttagacacgtagtatttcatatatgtaatagtattactcaca
catgtaatatttcagaattagtactagtattactgaaaatgtactagaatcggtaatactagtaatattg
actgagtatgtgtattgttttggcaggttacaatgaaagagaacgtgatcatatattttaaattccaagg
tcgcatgtataatgtgatgatgaagacattaggggagaagattactctctcaatgttagaagataggata
atgacgaagcttggattagatgcaaataaggtaaaattgcatatgaggtacaatccacggttgttcggag
tagaggaagaaatgaacgtttgtgatgatgaggatgtctttgtttatgtaacatccgcaaaaaataaccg
gagaagtgttttggttgtggaggagatctctaaaccgcccgagccggagcaattgcccgagcaattgtct
agagttggtaaaagttctgttggtaagaactatacggagttgaattcagaggaggatgaaatgagagtgg
atgatggtgcactcatcgtcttattagaagaggaacaaggaactcaacatcaacttgaggcaatagtgga
ggatcacgggactcaacatcaacttgaggcaatagtggaggatcacgggactcaagaagatgaaacacgc
tatgatgagtctatggataattctgataggggggaacagtatgttgagtcgccacctgctgtagaaccgg
gtatgtttaaaaaagaatgggaagacggaattgggttgaccttacgtcaagaatttccaaacaaggcggc
attgcacgaggtggtggatagagctgcatttgctaacagttttggttatgtgattaagaagtcggataag
gagcgctatgtcctaaagtgtgccaaagagagctgttcttggcgtttacgagcgtccaatatcagtaata
ctgatatattctcgattagaaggtacaataagatgcatagttgcactcggctaagtaaaggtagtagtag
gctcaggaaaagaaaaggcaacccacaattagtcgcagctctccttcatgatcattttccgggacagttg
gaaactccggttccaagaattatcatggagctagttcagacgaaattaggtgtgaaagtatcatactcga
cagcgctaagggggaaatatcatgcgatttatgatttaaaaggtagcccggaagaaagctacaaggatat
caattgttatttatacatgttgaagaaggtaaatgatggtacagttacttatctgaaattggatgagaat
gataaatttcagtacgtattcgtagctttgggagctagcattgaaggttttagagtgatgaggaaagttt
taattgtggatgcaacacatttgaagaacggatatggcggagtgctagtgtttgcctcggctcaagatcc
taaccgtcaccattacatcatagcgtttgccgtactcgacggtgagaatgatgctagttgggagtggttt
ttcgagaagctaaaaacggttgtacccgatacttcagaattggttttcatgacggacagaaatgcaagcc
tcataaaggccatacggaacgtgtataccgcggctcatcacgggtattgtatttggcatttgtcccaaaa
tgtgaaaggtcatgctactcacaccaaccgagacgtactcgcatggaagtttcaggagttaagtcgggtc
tacgtcgtggcggacttcaaccgagcgtatgacgggtttaagttgagatatcctaaggcgaccaagtatt
tggaggatacaaccgtgaaagaaaaatgggcaaggtgttgttttcccggagaaagatacaacttagacac
aagcaattgtgtggaatctttgaacaatgtgtttaaaaacgcaaggaaatactcgttaataccaatgctt
gatgcgatcatcaaaaaaatctccgtttggtttaatgaacatcggatggaagccgcgtctggatccttag
aaaataagatggtgcctttggtcgagaattatttgcatgatttgtgggtttttgccgagaagctgaaagt
ggtggagctaaactcattcgagcgtgaatatgtagtcacatgcgacaaaggaatagattatacggtgagc
ttgcttttgaaaacttgcagttgcaaggttttcgatatccaaaaatatccttgtattcatgcattagccg
ctttcattaacattatggatgatgaagatcggagaagaggtttggagttacatgatttggttacaaaata
ttattgggcggagttgtgggcattggcctattataggactatttatcttgttccggataggtcgcagtgg
gaagtaccagatgaagtaaaggcgttgaagatagttccgctgtctaaaaaaccgaagaaaggaaggaaaa
aaatgctaaggtttccatcaaccggggaaaagcggccaaaacgacaaaggacgcaaaacaaaaggcgtcc
aaggcaatcgtgtcaatggttattatttgggaatacgcctatctgagttttttactttgtttctgcagtg
atttgttgtttttatggtatggacttactatgtaatactgtatttccccttctgtaatactatgtctgtt
tctattgtttttttgtaatactggttataacaagtaatatcatgtctgtttctgtcgaatttgtggtcat
attagtactactggtaatactaagttggtgttatggagttcgaatttgtttacctcaaaactcaatgaaa
atgaatacagtgagttattaacagataaaaatgtattattaatcatttatacttaattaagttagaaatt
ttaatattaaatcgttttaacttatctttagtagactcaatacaacatatcccttatacttttaagctca
cttttaatattatcaccaacttaaaatgaaaaattcatattgctacatatatttaatttataattgcaca
aaatatttaaacctctatatttataaagttcaatccagtgtactactaaataaacaaattgtaaatatag
gaaatactgctaaacatgaaaaacatggaaagtaatactatttgagaaattggcactaccaaaaccttag
gaaatactatgctgtaaaaaggctgcgcacatgtcaaaaatctccaaaaaacggcgcacacgtggggaat
taatttttttggttttcccccaaaatttttggcgcctaacaaattttgagattccctccaaaaccaaaaa
tttcacttttttctttttcctctttctttttcgatctcttcttacaactttctttcatctttcatcttct
ttcctctttttctttcatctctcatctctcttctcccacatctctcatctctctcttcgactcgaattct
ctcacatctctcatctttctttcatctttctaccatggatagactgtgtgagagagacccctactacgat
gatatgaaagtggcgaagagagccattgagcaaatggaaatggttgcgatgatggaagggattcctaagt
tttgtccatgtggtggtagcattgtcgacactcgaaaggatgaaaagagatactatcaatgcgagaagtt
taaggtatgttgatgtagagcacaagtttttcaagtttgtgtagatctagatctagatatatttggtatt
tcagagtatttcccgttttggtaatactgcctagaacaagtaatactgcacaagtttttctagtttttgt
agatctagatctatatttgataaatatttggtactttgtaggatgatagaactgattgtatgcacatccg
taaactttgggataaggctatggaagaagaggtgagtagcttaagggagagtgttgattacaatcggaag
aaagttctaagtcatgagtatctcatagaagaaatgcaaaaagaattgaaagcccaccgtgcagagattg
tgaacgtgagcaaagtggtattccgtaatcctatggctccaaagaagtaatgtgttatcctattgttcct
taatcttattgttaatttgcttgtaagactttccctatgctttgtaagactttccgtatgctttgtaaga
ctttcccttcgattgtaagacttctctttgattggtaatactatggttgttcagtttgatatttccagta
tatctcttaatagtagtagcatgttgtatttccagtatatctcttagttgtagtagccttttgtatttcc
agtatatctcttagtcgttgtagtacgaggtatttccagtatttcagtaaaattcaatatcttagtagta
gtagcacgaggtatttcccgtatttcagttaacaatatcataaaacatcctaatacattaattgaatcca
aacatagttttacgagtacttgcgaaatcctaaaacaaagtcaaagtacaagaaatcaaaatacaaatcc
ggagaagttacacgaatagttttcacacgttactagactttgcacgggttgaacgggtcactcgtttggg
cacaacaggagcagccgcattttccttgattgttgattcctcctcatctggtgggttctgagacatctgc
attgttaatttagttaaaccttggtattacgcgatattaccaattatggaacagtatatccggtattacc
cagatttaccttggctttcatctcatccataaatgcttcgaacgttgcttgtttttcctccatttccata
agcgccaacaaagcttcattcagcttgtcattgatggaatggagcttctccattacatcaactaacgtcg
ggtctcccggtccaacagctgtgggaacttcctcttcatttggggcttccattcgtgcctgcacatcctc
gttgtacacctcctcaaatctaatttcacgcccttgaccaagacgcttcatccaactatcaactacaaca
tcgtgcttatcaacatcatcctctactcccgtgatttcagccaaaagaggtacttcttcagccttctctc
taattatactctcaatgacctatggaaataatagacaatgaaatattagaaagaaataaaatcatatttg
ctcaaggaataataggaaatactaacctcagttgttccgagtacatggttgatttgctcaagggtaaacc
ctttcatctcggtcctcttgaaattcaccttgcacatccttggacaaccggcatgtgcaccctctttttc
ttctatgaatctttccctaagtgatggaatggcctcaaatgctaaaaactacacaaaacacatgaaattt
aattgttagcttcaattacgaaaatgtaacaaggataaaacatgtgacttacctcaagcggcacacagaa
tccgggaacaggccatggtgactgagtgttcgggacgacaccgttaaaatgatccaatgtgtgtgagatc
gattttaacatgtactcaaatgaatatctcccccatggaaatgtcttacaaaacgtaagatcacttgcag
ccctaacacagaaatcatcaattggactagccctttctccggtcttagtaggcacggcaatgatgcttgt
tagaaaatagaggaccgccatccgcaacctatcctttgatctagccggttccatctccattagcttggtt
ttcacgtcttcacgtctaattaccccggttttgaagtacctgttggcgaagttcatattaccagcactct
gataattagctggataggccttgcagttgaaaccagagatcaaagcatgctccctaataccatagcggat
gggaactccattaacaacgaaccaaacctccttcttcttctcaatagatgccgttcgaagaaaaagcatc
cacatccccattaacttgtgggtatatccggaaggcaggtggtagatgtgcttgaaactcggatgctcca
taaaataattcttctcaacatccgttagaggatgttccaagtcagcgaaggtcttcaacacctccgagat
ataacacctcgttgctaacttcatcttcttggtatactccgacgccgggaagtacataccaagtggctgc
attgccatcgcttcctccatatcctgaaagaaacaaacattcattcatatatcactcaattgtaaatagt
actgccacagtattactctgtatttccaaaaagtattccccgaaattaccagtattacacaaatcctact
attcatatatcactcaattgtaaatagtactgccacagtattactctgtatttctcagaagtattccccg
taattaccagtattacacagatactcctattttactatgtagtatttctcaaagttactagtactaacta
tcgataaatcacaattgtagtattactctaaggtattacccacatccgccaatattacctaaatatcaac
atagcatttcccaatgttactagtactgtccacagttcacaattgcattattacccaattagatcggtat
tacgaaatggaatacttaccacatttgcgatgtcctcgtctatggtcctctcctccacgggtttttccac
tatcacgtcctcgtctctctcgtcctcgtctccctcatcctccacggacttttccacattgacgtcctcg
tcgctcgcggagctcctctcctcctcctcctcgataacggctaggccgtcatcttgtctctccgcctctt
ccacgggattagcacacgcgagttctttgtttgtctccgtctcctctgaatcggaatcggagcccaaggt
tcggctactactccctgattcttcttcttcacttgcttcttctttttcattttcatcttccacctcatca
gtttgcatattctcttgaatttcttcagcaggaatatgcttcagagcatcataaaacgtagaatcatcaa
gcatcgggctctccgtcgttggagccgtcgttggagccgttgtcgttggagccgctgtcgttggagccgc
attggttgccatcgccgtcgcagtcgtttgtgccggcgactgctcctcagccggcactttctctttatct
ttcttcgccggcgcctgaaccttcgctttcttcgccggcgcctgaaccttcgctttcttcgccggcgcct
gaaccttcgctttcttcgccggcaccttctccttcactttctccgccggcgccttaactttctctgtctt
cgccggcgccgacgcttctatctcctttcttttcccttgacctcctccacgcgtcttaggcggcatgact
gatttcgaatcggaaatagaacggaaatactggtgaaaggaagaagaacgatcgattgaatttaaagaaa
tagggaatgttgcggttaatttcagcggttactaaggagaagaagaagaaaaatcgaaaacggttgtaat
gagaaaaagaatgaaattcgaattgggaaatccctggaaatattagggttttttttatttgattttcaaa
taaagagtaaaggtggttcgggtggtttggttcggaaacgaaccatggtttttctggtaattaataggtt
cagatggggatatactagtaatatccatatatttcgaattaaaaaaaaaaaaaattatttctgtttttta
caaggattttttggcttcctctttgcaaaaaagggactggccagacgatttttc1