;ID   VANDAL18    DNA   ; ATH   ; 11260 BP
;XX
;DE   DNA transposon VANDAL18.
;XX
;AC   AL161500
;XX
;DT   06-SEP-2000 (Rel. 5.8, Created)
;DT   06-SEP-2000 (Rel. 5.8, Last updated, Version 1)
;XX
;KW   DNA transposon; VANDAL superfamily; MUDR transposase; protease;
;KW   TIR; VANDAL18.
;XX
;OS   thale cress
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Charophyta/Embryophyta group;
;OC   Embryophyta; Tracheophyta; euphyllophytes; Spermatophyta;
;OC   Magnoliophyta; eudicotyledons; Rosidae; Capparales; Brassicaceae;
;OC   Arabidopsis.
;XX
;RN   [1]  (bases 1 to 11260)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (September 2000)
;XX
;CC   VANDAL18 is a fossil of an autonomous DNA transposon. 
;CC   It is flanked by a 9 bp target site duplication.
;CC   Its non-autonomous derivate is NVANDAL18A.
;CC   It preserves short subterminal inverted repeats.
;CC   It encodes MUDR like transposase (~position 400-3600), which 
;CC   is most similar to the transposase encoded by VANDAL2 and VANDAL1 
;CC   transposons. 
;CC   VANDAL18 transposase (7 exons, predicted by GENSCAN):
;CC   QLTESTTFSYTWKFVDEITLKFGSSICLFGCDKCNYLIFQNGIFVLFEEQKRMDEGVMVL
;CC   VGGWECKDNGEWRFKMSDNKYGKWVDLSFWFEGGDTVYTHQKMSPVSVDNESSLKKFKEK
;CC   RQEKGGLNMYLSIDDVDVAEISSPEIRGHKDNNVVEVEVAGYMGGRMDDGNSTHAIPLTY
;CC   EEDGFLEEIILGQSRMRAELPDTQATTLEESLDKDCPRCLDNVHGQSLQGVGIVDVEVTT
;CC   IPDADAQETLDTCLVPESGEYVGPVDVVVTSLEASASEEEGQSDVEDNAAGVASDKANMN
;CC   LRKGGDAIYIGWVFNNKVELHKTLTMYSMKRLFNFRIKASDKTRVIAVCDDKKCDWRVYA
;CC   TFHENSEKVEIRTATLKHTCDVEARSKYGMKATRSMLGELLKTKYTHGKKGPRACELPEI
;CC   VLAELPYMKAWHAAIYVGLRTVHSLAKHACCTVHLFRNVVHNFQCEGLVKMVSMAARSYT
;CC   VGDLRYWFEEIQKRNIQCAKYLVEIKLSHRTLAYFPGMRYIVMRSNISESLNAAIQKAID
;CC   FPVVTMVEFIRTMLMRWFCERRQTAAKTKTRCTPKIEDLLIDHLKLATDCAVIAADEWIY
;CC   QVNDSFGMIFTVDLEKKTCTCRVFDVLMVPCCHALAAVGVRNVDIYSLIGDYAFVTEWRK
;CC   L
;CC   
;CC   Another protein is encoded by the second strand, position
;CC   10975-6714. It is similar to the proteins encoded by
;CC   other VANDAL-like transposon, including the SUMO-like proteases.
;CC   This protein is encoded by 15 exons, predicted by GENSCAN:
;CC   MSPMRSSTSKAKQKSALGGARRSVSFEDDDDPFLSAKSTAEQSSDPPIIPRIPEFNRERL
;CC   PKRLYATDCYPLNGRINTYSKPKYLLYLVDILDGTKELNYICAPCFSPFFQFEVRKCSFS
;CC   GKLVHQMLCRGLYTRKKHEIWFVFAGQPFRFLLREFAILSGLNCGPYPSRRDILKSQQPI
;CC   NLRHPYWNVLIVRSKLSISPLLPIDDPPYPNINDYWIDEVEDPTLDYMERVIGSRYKFQQ
;CC   SDWRGGCRTWPKIVVDENHPKEDNARKAKKVPPHRPSDIPDSFQAGPSAKGKGKVHEEFS
;CC   TPPPRTDGHDILHGLIMEQVNSKFRELKDELKVELFGDLTENSLLKDELLDEIISNLRHG
;CC   DKFPSPKSGGKSPKSTSGDGIVTPPPKAASQKDIADEGQISDPAKNLDPKIVPERQSNMP
;CC   SRIADSSSKNPAPSGQTHAQLLNALQQQAWSNYGGLSEVIKNINQSHASSSGPPQAGKFG
;CC   STCSDPSGGPSSSKSSSDHNPKDFQTPLQTIPEDTEAEDQDAALGDAAIGDEEMDGKSDN
;CC   VNITEDHNPNDFLTPQDEAVLVSDSESGDDFDDGANQEDEGDKDPQRGDEGDDTHQGDGD
;CC   PKQGDEGDDESEGSKPFVITVAIHPEELLQSSKVDTEQLSDQEKGKKKPQRFCETVTVSV
;CC   VKGPHDDLHPIAKVNDEVRKSFLSTIKAYRQRKYVIEGHVVPKTFFKDMHTPQAAVHQEA
;CC   WFADVDILYTPLHLTAGHCVSIVINQKEGTIGVLDPIADSKTAKQMDDLRTNEVPLEFAY
;CC   DRLPEVFQTEHADDSGPLVVKFIELHFQAISHDSITEESVEDLRMRFAVDIYEEFIEALR
;CC   V
;XX
;DR   Positions	167349	178608  Accession No AL161500	GenBank (rel. 116.0)
;XX
;SQ   Sequence 11260 BP; 3607 A; 2039 C; 2645 G; 2969 T; 0 other;
VANDAL18
gagaatcttgcaaaacctgccatattcccctcaaaatcttacaatccctatcatattcacagtcagctta
cagaatctaccacatttagctacacgtggaaattcgtggacgaaattacccttaaggtgggtcagtgctc
tgacttagatggcgacagcttcgtccagcgaaaagaggcagcccctgttccgtaataaatgtttccgttt
ccgattggatgttatgtggatttattttttaaaaatggtaatataggggttagtcactttaattgtttat
taatgttgcagaggaaagatgttgtgttgaacagaacaaagtggtaaactggggaatttgggaggtgttg
ttgaagacacaaccatttattgcggacgaggtaaaatttattatctggaatttctgtccttcgattcgct
ttagtttggtagttctatttgtttgtttggttgtgataaatgcaactacctcatattccaaaatggtata
tttgttctgtttgaagaacagaaaaggatggacgaaggagttatggtattggtcggtggctgggaatgca
aagacaatggggagtggcgtttcaagatgtcggataataagtacgggaagtgggttgacgtaagcgaatg
agatatgttgactgaaattgtatcaaaagttgcaacaacgtttggactagacaggaagaccaaatttcag
ctgagtttttggttcgaaggaggagacactgtctacacgcatcagaaaatgtcgcctgtctctgttgaca
acgaaagctcgttgaagaaattcaaagagaaaaggcaagaaaaaggaggtttgaacatgtatctatctat
agacgatgtcgacgtggctgagatttcgtcccctgaaataaggggacacaaagacaataatgttgtagag
gtagaagtggcgggatacatgggcgggcgaatggatgatggtaatagcacgcatgcaattccattgactt
atgaggaagacggcttcctagaagaaataattgtgtgtgaagaaacttaaagaggaaggctagtttacta
gagttggcctgcgcaaagcggggacgagaaaaggaagtggatgtagagttggcaactgagacaagaggat
gggcatcagactcagaagacgaatcgttaagggaggtatcagattcagaggattatgattttgataatat
gagggatatgattgatacagattacctgccggattgggatccttggaaggattggaggaacagcacattg
acaagttttgacggttagccggggacacaaatgtgtggaacaactttgggggataagtcatctaggccca
ataatatgggggacgatatactgctttgtggcgggggacaaagccgggttgaccttgtcgaagtggaagt
tattgagatgaacggaatgggcgatgagagtgcaaaatgtcgattcagctggggcaaagtagaatgagag
ctgaattaccggatacgcaagcgacaacattggaagaatcgttagacaaggactgtccaagatgtttgga
caacgtgcatggacaaagcctgcaaggggtaggtatagtggacgttgaggttacaacaatcccagacgct
gatgcacaaggtgatagtgagaatggaattgaggataatgatgaagaaaatacgagaatgtctatataat
tggggcagagggcagtcccaaatatgaggacagattcaatgacagagacgttggatacgtgtttggtccc
tgaatctggagaatatgtaggaccagtggacgttgtagtgacatcgttggaagcaagtgcatctgaagag
gaaggacaaagtgatgtagaggttggtaatacaacagtgggggaaatgataatacagcaagggcactggc
agaggacatgtcgagaagattaacagacggagagactatatttactgaattagcaggggacgagatgttg
atgtgtagagatgccgttccgttcaaggacaacgcagccggggttgcttccgacaaagctaatatgaatc
ttaggaagggtggcgatgcaatatatatcgggtgggtattcaacaacaaagtcgagttgcataaaacatt
aactatgtactctatgaaaaggctattcaatttccgtataaaagcgtccgacaagacacgagtcattgcg
gtgtgcgatgataaaaagtgtgattggagggtatatgcaacgtttcacgagaattctgaaaaggtggaaa
ttcgtacggcgactctaaagcatacttgtgatgtcgaggcgcggtctaagtatgggatgaaagcaacacg
atcgatgttaggtgagttgcttaagacaaaatatacgcatggtaagaagggaccaagggcctgtgaacta
ccagaaatagtattggctgaactaccatacatgaaggcttggtacgcaaaagaaatagcaatgaaaaaag
ctcgtggtagtgaggaagaaggttataagtttttcgcagacatgtctccacctgttgaggacaactaatc
taggtacgttgtccacagtccatacggattatacggaggaagggaatattcgtttcaaatatcttttttt
tttgcgtttgggcttcgattgttgggtaccagtatttgcggaaagttattgtgatagatggtgcacagac
caaagggaaatacaaaagctgcttagttgctgcaagtggacaagatgaaaattaccaaatattcccatta
gcgtttggtattatagacaatgaaaacatagcaggttggcagtggtttttcgaatagttgtcacagttta
tccctgatgaggaagatctagttttcgtctctgacaggcatgcggccatttatgttgggcttaggacagt
tcactctcttgcgaagcatgcgtgttgcacagtacatttgttcaggaacgttgtccataatttccagtgt
gagggacttgtaaaaatggtctccatggcagcgagatcttacactgtgggagatttacgatattggtttg
aagaaatacagaagagaaacattcaatgtgccaagtatttagtagagataaagctatctcacaggacact
ggcgtacttcccgggtatgcgatacattgtgatgagaagcaatatctcggagtccctaaatgctgcaatt
caaaaagccatagatttcccggtagtgaccatggtggaatttataaggacaatgttgatgcgttggtttt
gcgagagacgacaaactgcagctaaaacaaagacaagatgtacgccaaaaatcgaggatttgctaataga
tcatcttaaattagccacagattgtgcggtcatagcagcagacgaatggatctatcaagtaaacgacagt
tttggcatgatttttactgttgaccttgagaaaaagacatgtacttgcagagtatttgatgtgctaatgg
tcccgtgttgtcacgctttagcagcggtgggtgtaaggaacgtggacatatactccttgatcggagacta
tgccttcgtgacagagtggcgtaaactatagcgtgaacatattctacccccgccaaaggaaaaggacaca
gaggtcccgaacgatattagtctagtggttgtatatccaccaaatacaagaagaccagttgggagaccaa
ggaccgttcagataccatcgaggggggagccggtaaataactaaactcttagcatagttttaagtcctcc
gaatctatcataaccaaacgtgtcatctacttgtagggcaaaggatggaaaaagaagaagacaagacaat
gcagcacatgtgggaaggatggccacaacagaggctgggggaaaaagtgcatcgagcgcgtgttaacgtt
tttttgtagtttttgatgtttcgaaaagacgttttttggtccatataagtttttggttttggtctccaga
ttttaatacagatcgggggtgtctttcgggaaggggatgtaattcagtcgaacgattgctttttgtaatg
ttaacttatgatgttgactaccaactagtttattgtaattgctacatatagaacaataattattatatta
gacagtggaagagtaaaaacgcagaaagatgcattaaaagtgctggaagcaaagtaaacgtgaagactga
aagcgctggaagataatcacagtacccaccaatcaatacaaagaggacaggggttgcaataattgcattt
cttatattctgagtatcgacgagggggatcaatcatgtaacacggccagcttatccagttcaagccaagt
tagtgtggcgcaattatattgatattggacataacactcatagccacacaatcggcgttgcggcaagaga
caacattgagtttaaatagattcatggcggtaatggcatcatttggtctattaaccgcaatatggtagag
acccttcgtaaaaagtattttgggggacggatcgggtatataggcgatcatgtccattgcttctttgatt
aacctttcaaatgaagccagcctaatgctctctaagtagcaagcaaccggattaccagcctttacacatc
gtaacagaacaagtaaatgggtgaaccactggagactaattgagaatgcgcttcaatgagggccagggat
gtctcccgcgaaacggatggatcaagtgctagacgcaagccagcgttgccagagacaatgaggggaccta
agtgttggtaagataactttccaacaagaaggacaattgaaaccctaagatgttgaggaagcgattgcca
gtttaccaaattgtgatgtttgaagtatggtttagtttggatttcacacagaaagaagaactatatatat
agttgggattaaaggagaaaataaaatggatgcaattagtaggtatggacaatgaatattagatgtgaca
aaaagctataaaagctacaccattaggacaaagcaaactctgcaaagtacaaaatttaattaaacggttt
atagaatttagaatattagttgaggaccataaatatgaaaactaaatgaatgatgaagtgacaagacaat
tagccattattaggcaaatggggacatggaaaagtaaatatatagactaaatattgaaaaatcataaaac
gaaatatataaaaaacttatcaaatccgaaaaaaatggacacatactcatatggtccacatcttacagta
cggttacataattagtggacaaaataatcaaagataaaacaatagttctgtggaaagaaaaagatttaat
agagctctagcaatgaaattatcgctacatataagtgccaaggacgaaagcgacaatggggacaaggatc
acgaggacgagaattatatgcgggggacaagaatggacttggtcgatgatggtacgaggggagcgttcga
gaagaccaacttgatgctgaagagatttgatttcttgctggtagaggtctaggcgatcccacatgtacgg
aggatcaatgcgttcaccattagacccggtaactacgaacttaatgtgttcatcctgttccctgacagag
cacttgagcgctgcaatcttgtccaccaatgactcgtccaaccatttgacaagatgacgcccaggtcggg
ccttgaaaaagatagcaaaagcaggttttatacgtgtggagatagaaatgaaaaatgggggttaatgggc
atacctctctcttctcgcactggtagaactgtctgttgtcctcagaggacgaactcgcaagaaccacaat
cccaccacaataacagcgacgtggaataccaagaatgggcgtcgacggttgggaagggtaaactagtcca
tatgacgtcggagattctgaatgctgagaagattccgatggttgtgtataactatgaaggcccatgttac
agtggctgtggataattttgaacaagttaaaaggaaagtgaataattaatggggttggaaaattgagaaa
gtggacaagttatccttcaaaaggtttgaaagatttgaaggataattaatttagagagataaaagggggg
ggggagaatttagagagataaaaggccggggggggggggggaagaagaacttcagtaacctaggcaccta
ataatataataatgttaaaaatattaagttgatgtgatgtgacataaaggtaaaatcatagaaaagcgta
agggataaagattagtagaaaaaattgtggagacagttgactgagactgttcttagtgggaagatacccc
acaatagtccacgaatgtgaagtgtattgcatttgcatacttacaaagccacaacataattgttgtcccc
tctaaatatataatgtccctccccatccgttgataagtgattttacctccacataaaatattttacataa
agataagagttatatatggaataaaaggtagtaaaaagttaagtctccaaaaaacatacaagttcggaat
aaaagtagagtcactaaatattaaaacagacataaattctaacatggacatcaaacacgaagagcttcaa
taaactcctcatagatgtctacagcaaatctcatcctgagatcttcaacggactcctcagttatggagtc
atggctaatggcctgaaaatgaagctcaataaacttcacaacaagcgggccactgtcatctgcatgctca
gtttggaagacctcagggagacgatcataagcaaactcaaggggcacttcatttgtcctcaggtcttggg
ggacaagcgactttataataagcggaatcaggacaataaaaggtttcatcaaaaccgtccatctgtttgg
cagtcttcgagtcagcgatagggtctagcacacctatggtaccctctttctgatttatgacaatggaaac
gcagtgaccggcagtgagatgtaaaggagtgtacaagatgtcgacatcagcgaaccaagcctatcggatg
ctacggccagggacgatacctttcacatagtcccagatcttactgttccacgaccagtcagatttacttt
tggccttcttataacgtggatagagctcaacaaagttaagtgcgaacgaagaatctagtataacagccat
actctgtaccaactcatcgccatacctccgcgccaacaggcttagcatagcctcgatgtattggattaca
taaagacaaccagttagatctgagtaaattttaacatattcccaggtttagcattgatttggtacatacc
tcctggtggacagcagcctgaggagtgtgcatgtccttgaaaaatgttttagggacaacatgtccctcga
taacgtacttcctgaccattcattcataataatgattaatagatacgtagaagaatattagttaccaaaa
caataattcagcggaaaacttaattacctttggcgataagctttgatagtggacaaaaagcttttccgaa
cctcatcgtttactttggcaattgggtgcaaatcatcatggggccctttgactactgagacagtaactgt
ctcacaaaaacgctggggttttttcttgcccttctgggaggaagcatagacttagaagtatccccttctc
caacctctctttttcttttaatcctagagggaaaagtctctgaaattgggacaccagcatgtgcattacc
ctataagacaaagatttatgtcagatccataaaagacaacaaaagattgaaaaaacatagatttatacct
cttgatcggatagttgttcagtatcaacttttgaggattgaagaagctcttctggatgtatagcaacagt
aataacaaaaggtttgctaatcgcaataaaagatataataaggattaaatgaggaatacttcgttcatgg
tctgatcatcttgttgatcaccggagtcagaccttccttcactctcatcatcaccctcgtccccttgttt
tggatctccatccccttgatgagtatcatcaccctcgtcccctcgttggggatctttgtcccctggatta
gtatcatcaccctcgtccttttatttgggatcttcgtccccttgaagagaatcatcaccctcgtcctctt
ggtttgctccgtcatcaaaatcgtctccggattcagagtcactaactaagacagcctcatcttgaggcgt
taggaaatcgttgggctacaagtaaaaagaaagaattaaaggtaaataagattctcaagaccaaaataat
tatttataaaaaaaaacacaactatctagaaacttacgttgtgatcttcagtaatattcacattatcaga
cttcccatccatttcctatttaaaaatgttatgcaaacatttacaatgttatatgataccaaaaaagaaa
taaagggacaatactcacctcgtctccaatagcagcgtctccaagagcagcgtcctggtcttccgcttct
gtgtcctctggtatggtctgcagcggcgtttgaaagtcttttggctaaataagaagtaagcacaattggt
taaagtaaaaggaaattctaacgataagtaatagatgccaaaccattaaacttacattgtgatcactcga
ggatttcgaggacgatggaccaccagaaggatccgaacaagtagaaccaaactttccagcctgcggagga
ccagaggaagatgcatgagattggttgatgttcttaataacttcagacaatcccccataattagaccagg
cttgctgctgcaatgcattcaagagttgagcatgagtctgtccacttggggcagggttcttgcttgacga
atcagcgattcgcgagggcatattactttgtctctcagggacaatcttggggtcgaggttcttcgcgggg
tcactaatttgtccctcatctgctatgtccttttgggacgcagccttagggggtggagtgacaattccat
caccactagtggactttggagactttccaccagatttgggacttggaaacttgtcaccatggcgcaaatt
gcttattatctcgtctaatagttcgtcctttaacaaagaattctctaagataaacataccagtaaggtcc
ccaaataattcaaccttcagttcatccttcaattcacgaaatttagaattaacctgttccatgatcagac
catggagaatatcatgtccatcagtcctcggtggtggagtactgaactcttcatgcaccttccctttacc
tttagcagagggcccttttggggaccagatggaccagcctggaaagaatcagggatatccgatggtctgt
gtggaggcaccttctttgcctttcgagcattgtcttctttaggatggttctcatcaacgacgatctttgg
ccaggttcgacaaccacctctccagtcagattgttggaacttgtatcggctaccaatgaccctttccata
tagtcaagcgtcgggtcttcaacttcatcgatccagtagtcgttgatatttgggtatggtggatcatcaa
tgggcaacaacggagaaatactcaactacattgaagccgaaacaatttatcaactaaacaaatcatcaaa
ggcaaactagtgagagactaataagaatctaataaaattaggtccacagattaccctaggaattagctct
gtctccattatgttggagttgtggtaagtcttcaacggaggcaaactggtgagagacgtctcactgatag
tctgcgtatcatcggggtttggaaggtatctttccaattcagggataacgttaaagacaaaaagttggaa
gacaagagggaatccatgtgttgaattgttagactgattgaacttcgtgattagtcctgggagattgctg
accttggatctaactttgtgagaaggaaggactctcgtccccagggatacgcctcaaaagcctccaagtc
cttcaccatttcaacgacttatttagaaacttttacaggttgggaactacagatcaagatcccctcaaca
ataactataacagcgagacaaagccttctccaaggttccatcctctcttctttaggcttcagcttatctg
tcttcaaccatttgaatatatccttgatgaggacgactttgtgttccttacctattaggacattccagta
aggatgtctcagatttatgggttgctgcgacttcaatatgtctctccgagaggggtaaggaccacaattc
aaaccactaagaatggcaaactcccttaacaaaaaccggaagggctgaccggcaaaaacaaaccaaatct
cgtgttttttccttgtgtagaggcctctacacaacatctggtggaccagtttcccggagaaagaacattt
acggacttcaaattgaaaaaagggactaaaacaaggggcgcaaatgtagtttaactccttcgtcccatct
agaatatcgacgagataaagaaggtacttaggcttagagtatgtatttatacgcccattcagcggatagc
agtcagttgcgtaaagtctcttgggaagtctttcccgattgaattcagggatgcgaggaattattggcgg
atcagagctctgctcagcggttgatttggcagagagaaatggatcgtcgtcgtcttcaaaggacaccgac
cgcctggcaccgccaagcgctgacttctgtttcgctttgctggtggaagacctcatctgcgttggggtaa
aatcaaacaattatcaaatcagttattcgatgtgataaaataattaaacacacaggaggacgaaaaaccc
taaaagctgtaatgggggacatttcgattaatcggaacaaactcaatcaaccggcttaattgctaaaggt
tttcgaactttctgggtaaatgaatcgcacaggagaggacgaactatctaaaatcataatgggagacaaa
ctcgatttttcataacagaccagtaacaagggatttgctaaagattttccaaaaatacttcactgaataa
aatatgaagacatgattgaagatagcaacatgcagaaacttgaaatgaccttaccggagacataatccga
tgaaagttatcggtgaaggaaagggaaagtatgagagagatagacgaaagagagagagttaactgttgtt
gtttttccgcacggtttacaatgaaaaagagataaaaggtattgtaaaaagacgcagttgttgagcgcga
aaactgctcagtaaaccgtgggaacaaaatggacatttcacaaccaaatgtggtatgagttgtaatgccc
gaagaaatatggcagattacataacgattgggaaaaatatggcagattttataattttcc1