;ID   ATIS112A    DNA   ; ATH   ; 5333 BP
;XX
;DE   ATIS112A is an autonomous DNA transposon.
;XX
;AC   AC018660
;XX
;DT   01-FEB-2001 (Rel. 6.1, Created)
;DT   01-FEB-2001 (Rel. 6.1, Last updated, Version 1)
;XX
;KW   autonomous DNA transposon; 5 bp-long target-site duplication; 
;KW   transposase; TIR; IS112 superfamily; ATIS112A.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 5333)
;RA   Kapitonov,V. and Jurka,J.
;RL   Direct submission (January 2001)
;XX
;CC   ATIS112A is a autonomous DNA transposon.
;CC   There are several copies of the transposon in the A.thaliana
;CC   genome which are more than 90% identical to each other.
;CC   They are bordered by 5 bp-long target site duplications.
;CC   ATIS112A has 25 bp-long terminal inverted repeat.
;CC   ATIS112A encodes two proteins, ATIS112Ap1 and ATIS112Ap2.
;CC   Function of 320-aa ATIS112Ap1 is unclear. It is encoded by
;CC   3 exons (834-922, 1262-1444 and 1498-2188).
;CC   ATIS112Ap1:
;CC   VIWRICNKVCCFLIEYNPNAKVVGLRSNASFINLLNSQDDSHILPPNPYECFELGSANVP
;CC   VYSTEWSDDDPSEDEAPIAGKKGRKGRKGRNGWLNTSKDPVVGNEQKGAAFWERIAAYYN
;CC   SSPKLKGVEKRGHICCKQRWSKVNDVVNKFVGSYLAASKQQTSGQNDDDVVSLAHQIFSK
;CC   DYGCKFTCEHAWRELRYDQKWIAQSTHGKAKRRKCEDDSEPVGLEDKEARPIGVKAAKAA
;CC   AKAKGKGKLSPDEGEETNALKEIKSIWEIKEKDHAAKEKLIIIKEKKNRTKLLERLLGKT
;CC   EPLSDIEIELKNKLINELLA
;CC   ATIS112Ap2 is a 279-aa DNA transposase, which is related to the
;CC   transposase encoded by the bacterial IS112 transposon.
;CC   ATIS112Ap2 is encoded putatively by 3 exons (3752-3902,
;CC   4200-4319 and 4458-5026).
;CC   ATIS112Ap2:
;CC   MAGSSSNYNLDDMFDDKFDQCFDQALESYGNRQRVKPRKKKTYIERNREEEGTINLFGDE
;CC   YLRRPTRDDLIRLLHIGEQRGFPGMIGSIDCTLNDINVFDRSPVFDDILQGRAPKVKYVV
;CC   NGKDYNLAYYLTDGIYPKWATFIQSISIPQGNKASLFATTQEACRKDVERAFGVLQARFA
;CC   IIKHPALFHDKVKIGNIMRACIILHNMIVEDERDGYTQFDVSEFVHPESASSSQVDFTYA
;CC   TDMPSNLGNMMATRARVRDRIKHEELKADLVEHVWQHYN
;CC   There are several highly divergent members of the IS112 superfamily
;CC   present in the A. thaliana genome (including HARBINGER).
;XX
;DR   Positions   25609   30941  Accession No AC018660   GenBank (rel. 119.0)
;XX
;SQ   Sequence 5333 BP; 1638 A; 879 C; 1028 G; 1788 T; 0 other;
ATIS112A
agcaagtctagcaagggtttttagtgggtttctatgttgggttttttaaaaaaaatgggcccggatgaaa
aaaaacagaacaatccatttaggatcgttcctaaaattgggtttttcgcgacccaatagaaactcatacg
tggcttcattttattgggtaaacgttttgactaaacaaaaaaaaaaaacacgcgtttttttgttttcttc
ttctctctgatttcttcgtggtcgccgacggagaccccaaatcgattctgcttctaaccaaatcgaaacc
ctcatcatggttcgtatctcttcttcttcgaatcgatttgattttatgagctttgaaattgaattttgaa
agagattatgatctgattgcattggtatgtatattgaattacgaattctgggaatttttagttcgtgtac
taagcttgagctgcatatgttgttattagcttgttcttagttttctatatgtttgcttcattctcagtga
agtcatattgttgtttctatattaacatagaaatttcagatctttatgtgtttgttctgtagctttgttt
gtacctatatccttcaaagaaaaacttacggattcaggtatatgttttgttgtggtagatagaggagcat
tatgtggctcaaggtttaagtattgttggatactttgatttggttgtctgaaaagtttagatgtcgtgta
ttcattgcataaactggcttttgcttgttaatctttcaagccttcaaagcatctggacaacgattttgca
ttatgtatttgatcttttggtagttcgttttaattaagtttttgtttctatattttttctgaggttattt
ggaggatttgcaacaaagtttgctgcttcttaattgaatataatccaaatgcaaaggtggttggcttgag
atcgaatgcaaggtaatttatcattttgtatttatttatgtgatcaatgctttgctgttatttactgtga
taaatgcttctttgttgatataaaactgtagttagataaacgtcaattgcttttgtaggtaaacgtcaaa
tgctagttcttatagcatttaaaactgtaaaactgtgatgattgcttggtaggttaattttagtcaatag
tatttagtatggcattgattgtgagttgtctaacaaagccataatttgcctttaaaacacaaaaccgtaa
cctttgatttttaaaacacacaaccacaaacaataatatgattcaccgaatccatatcatcatacttcaa
gctttataaaccttttgaacagtcaagatgacagtcacattctcccaccaaatccatatgaatgctttga
acttggatcagcaaatgtccctgtgtatagcactgagtggtctgatgatgatccaagtgaagatgaagca
ccaattgctgggaaaaagggaagaaagggaagaaagggaagaaagtcaagaaaccaaagaggaattggtc
atcaactgaagatgttgttctcataagtgggtggctgaacacttccaaggatccggtggtgggcaatgag
caaaagggggcagctttctgggaacgtatagcagcgtactacaactcaagtcccaagctcaaaggtgtgg
aaaagagaggccacatatgttgtaaacaaaggtggtcaaaagtgaatgatgttgtgaacaagtttgtagg
gagctacttggctgcaagcaaacaacaaacaagtggtcaaaatgatgatgatgtggtgtctttggcacat
cagatcttctcaaaagattatggctgcaagtttacttgtgagcatgcttggagggagcttagatatgatc
aaaagtggattgctcagtctactcatgggaaagcaaagagaagaaaatgtgaagatgactctgaacctgt
tggccttgaagacaaggaagctcgtcccattggtgttaaggcggccaaagccgcagccaaagccaaagga
aaaggcaagctaagtccggatgaaggtgaagaaaccaatgctttgaaggagataaaaagtatatgggaga
taaaagagaaagatcacgctgccaaagagaagctaatcattattaaagagaagaaaaataggactaagtt
gcttgagcgtctgttagggaaaacagaacctctatctgacatagagattgaactgaagaacaagttaatc
aatgagttgttggcttgagtttcagttgtaggttaagtgtctgtttctagattatggttctctaagtgtc
tgttttaactctgtttttaagtttagtgtctgtttcaagtgtgtttttatatttagtgtctgttttttgt
ctatgttttatgtttaagtgtatgactatgttaatcaatgagttgtttcagtttatgagtttgagttgtt
tcactttggcactttcagtttcagtttatgagtttgagttgtttgactttcactttcacttcagttctgt
tctatttttgatctagtctccaagcttacttggtattttactgatgcagaacatgtgagaaagaagaaaa
tgaagagccaacaaaacaaaagttaggtcttgttccagacgaccagtttcttgctcaaaatccggttaag
ttttcttctgactcttaccttttttttttttttttggttgtatcttattagcagtcatacatttgcagaa
taacagaacagtaggattatggtttctattttagtcaactggttaacgagcttggttgtgctttggatct
tcagtgtggagttttattattagttttctcgcacaaatttcttcatactgtatctgttctaaaaaattat
ggctatgtaggaatagaatttgggctacgaagaagtaaaaccagatacattttttcgatctgcaatagat
gtatgattgaaggtatcaatgattacaagctttaaacatcaatctcctatgtatttttgatagaatttgc
agacaatgaaacttaaacatgcacctttattagcttcacgctcactaatctgtagaatcattttctgcct
tggagctttgagtgtgtatatataacagaatcatatcgggtgaatcctgcagggacaattgagcttcact
acctgttaggtcttgcaccacacatggagcatctaactgaggaaacagagccaacaaaacaaaagttagg
tcttgtatcttactagcaacaaaaacgtttcagaaacgttgaaaaagttataagtccgtgctctttggaa
tgtattatatcattggcaaccaatttgtttgtttgtgaacattttcaatgattttggaccgcatttaatt
ggagtcacatgtggacgcagaccgcatttaaccggtttgacaaaatatgtgtttcttttaacgatgacaa
accactactgaatcacagaatcataagccttgttttaatggagaagaaagaatatcaagtccaagtgttt
gccattctttagtgccgacatcaagtactttagtatgtcaccaaggaatgtactttagttttgtcaccac
aaacttgattgttcttttcatttgaaccattgattattgtaccattcatttgcaccattcactactcctt
tatattgtatgaatgaatgtacaccgaataaaaagaattccgaacacactactcctttattgttctatat
ataatccgaacactacttcaatatttctatcaactctctttatcataataataaacactcttttttatca
aatcatttttatctcttctctctttctataattttgctaaaatggcaggttcttcttcaaactataatct
cgacgacatgttcgatgataagtttgatcaatgctttgatcaagcattagagagttatggtaatcgccaa
cgagtcaagccaaggaaaaaaaaaacatatatcgaaagaaatcgtgaagaaggtcacatccaattagtga
atgattattttacagagaatccaacatatcctccccatatattctgacgccggtttagaatgaacagacc
gttgttcatgcgtatcgttgagaaattctccaatgaagtcccatactttaaacaaagaagagatgccacc
ggaagactcggtttctcggcactacagaagtctacagcagcaattcgtatgctcgcatatggtattgcgg
ctgatgcggttgatgaatacctctgaattggtgaaagcacgtcgctgttatgtttggaacattttgcaga
aggaaccataaatctgttcggggatgagtatctaagaagacccacacgagatgatcttatacgacttctc
catattggagagcaacgtggatttcccgggatgataggaagcatcgactgtatgcattgggagtggaaaa
attgtcccaccgcttggaaaggacaatatacacgttgatcaggaaaaccaacaatcgttttagaggccgt
tgcttcacaagatctctggatatggcacgcgttttttggacccccaggtaccttaaacgatatcaatgtt
tttgaccgctcaccagtttttgatgatatattacaaggtcgagctccgaaagtgaaatacgtagtcaacg
ggaaagactataatttggcttactacctcacagatggaatttatcctaaatgggctacatttatccaatc
aatttcaattccacaaggtaataaagcttctttatttgctacaactcaagaagcttgtcgtaaagatgtt
gagcgtgctttcggagttttgcaagctcggtttgccattatcaaacatccagctctttttcatgataagg
tgaaaattggcaatattatgcgagcatgtataatacttcacaatatgatagtagaggacgaacgagatgg
ttacactcagtttgatgtatctgaattcgtacatccagaatcagctagctcttcacaagtggatttcact
tatgctacagatatgccttcaaatctcggtaatatgatggccactagagctcgagttcgtgatcggatta
aacatgaagaattgaaagctgatttagttgaacatgtatggcaacattataactagaatcaatcttaaaa
tgtaatcgtttttttttccgcttttaaatatgtttttttgcttcttctttctatatcatgtatgtttaaa
atttgataattcttaatcataatcatatttgctcaaataaaaatatttaacaaattttctttttttcaaa
atttttttgttttagaaacccacttaaggattccaccaatggaccaaaaaatttccaaaaactcgtaaaa
gtttctaaactacaaaactatccaaatttattaataaaaaatacatagaaactatactttagaaactaat
cactggatttgct1