;ID   ATCOPIA4I   DNA   ; ATH   ; 4493 BP
;XX
;DE   Internal region of ATCOPIA4 LTR-retrotransposon.
;XX
;AC   Z97342
;XX
;DT   12-APR-1999 (Rel. 3.3, Created)
;DT   12-APR-1999 (Rel. 3.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; ATCOPIA4;
;KW   ATCOPIA4I.
;XX
;OS   thale cress
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryotae; Viridiplantae; Charophyta/Embryophyta group;
;OC   Embryophyta; Tracheophyta; seed plants; Magnoliophyta;
;OC   eudicotyledons; Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1]  (bases 1 to 4493)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (March 1999)
;XX
;RN   [2]
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Molecular paleontology of transposable elements from
;RT   Arabidopsis thaliana.
;RL   Genetica 107 (1-3), 27-37 (1999)
;XX
;CC   ATCOPIA4I is an internal region of a copia-like LTR-retrotransposon
;CC   ATCOPIA4. ATCOPIA4 can be a currently active element since its LTRs, 
;CC   ATCOPIA4LTR, are identical. ATCOPIA4 has 5 bp-long target site 
;CC   duplication. Its left and right LTRs have been wrongly identified by 
;CC   the authors deposited Z02342 locus in GenBank as two solo LTRs.
;CC   Also, the copia-like polyprotein encoded by ATCOPIA4I was wrongly
;CC   annotated as a part of 2000 aa-long protein (see GenBank, Z97342).
;XX
;DR   Positions  56668   52176  Accession No Z97342     GenBank (rel. 109.0)
;XX
;SQ   Sequence 4493 BP; 1238 A; 1295 C; 791 G; 1169 T; 0 other;
ATCOPIA4I
tggtatcagagcaaacgataccctaattttttttttcaaaacacaaacctagccgcctaacatgggctcc
tccgcaaacggtctcccagccaccactgatgaagcaattgtcttcactccgcaaacaatcttcaacatta
acacgtctaatgtcacgaaactcacctccaacaattacctcatgtggagccttcagatccacgccttgct
tgatggatatgaactcgcaggacatcttgatggttctatcgagactcctgctccaacactcactacaaac
aatgttgtctccgctaatccacaatacacgttgtggaagagacaagacaggctcatcttcagtgccttga
ttggcgccatctctccaccggtgcaaccattagtgtctcgtgcaaccaaagcctctcaaatctggaaaac
cttaaccaacacgtatgctaagtctagctacgaccacatcaaacagctccggactcaaattaagcaactc
aagaagggaaccaaaaccattgacgaatacgttctgagtcacacaactctccttgatcaattggctattc
tcggcaaaccaatggaacacgaagaacaggtggaacgtatccttgaaggtcttcctgaagactacaaaac
tgttgttgatcagatcgaaggcaaagacaacactccctctattacggagattcatgaacgactcattaat
catgaggccaagcttttgtccactgctgctctgtcatcctcgtcgcttcccatgtcagctaacgttgctc
aacaacgccatcacaacaacaatcgtaacaataaccaaaacaagaatcggactcaaggcaacacctacac
caacaattggcagccctctgcaaataacaagtcaggtcagcgccctttcaaaccttacttggggaaatgc
cagatttgcaatgttcaaggacacagtgcgcgtcgatgcccacagctgcaggcaatgcaaccgtcttcga
gctcctcggcctccacgttcacaccatggcagccacgagctaacttagcgatgggagcgccatacacagc
aaataactggcttctcgatagtggagctacccatcatatcacgtccgatctgaacgctcttgcccttcac
cagccctacaatggtgatgatgtcatgatcgctgatggcacaagtcttaagattacaaaaactggttcca
ctttcttaccttctaatgcccgtgaccttactttgaataaagtgttatatgtacccgatatacagaagaa
tttggtctcagtgtaccgcctatgcaatactaatcaagtgtccgttgaatttttccctgcctcttttcag
gtgaaggacctcaacacggggaccctgttgctccaagggagaactaaagacgagctctatgaatggccag
tgactaatcctaaagctacagctctgttcacaacaccaagtccaaagaccactctttcttcctggcattc
tcgcctaggccatccttcttcttctattctaaacactttaatttcaaagttttcacttcccgtttcagtt
tctgcttcaaataaacttgcttgttcggattgtttcattaataagagccataaactcccattttctatct
catccattaaatccacctcaccgcttgaatatatattttctgatgtctggatgtctcccatattgtcacc
agataactacaaatattaccttgttcttgttgatcatcacacacgatatacatggctttaccctttgcag
caaaagtctcaagtaaaatccacttttattgcgtttaaagcgttggtcgagaacaggtttcaagcaaaaa
tccgaacactttactcggacaatggcggagaatttatcgcactacgagagtttctcgtttccaatggtat
ctctcatctcacctctccaccacacactcccgagcacaatggcctatccgaacgcaagcacaggcacatc
gttgaaacaggactcaccttactcactcaagcttcggttccacgagaatactggccatacgcattcgccg
cagctgtttatctcattaaccgaatgccgactccggtgctatccatggagtcaccgtttcagaagctgtt
cggatccaagccgaattatgagcgtctacgagtattcggttgtctgtgctttccatggctcagaccttac
actcacaacaaattagaagaacgatcgagacggtgtgtgttcctcggttactctttaactcaaacagcct
acctctgtttcgatgttgaacataagcgactttacacatctcgccatgtcgtgtttgatgaagcctcctt
tcccttctccaacctcacatcccaaaattctctccccaccgtaacctttgaacagagctcctcgccgtta
gttacgcccatactctcatcatcgtcggttctcccatcttgtttgtcttccccgtgtacggtccttcacc
aacaacaaccgccggtgactacgccgaactcaccacattcatcacagccgacaacctcaccggctcctct
gtctcctcaccggtcaaccacaatggactttcaagtcccacaggtacgctcttcgtcacccttattatct
tcttcttcatctttaaattctgagcccactgctccaaatgaaaatgggcctgaacctgaggcccagtcac
cacctataggcccactgtcgaatccaacccatgaagcctttattggtccactcccaaacccaaaccgaaa
cccaaccaatgaaattgaaccaacacctgcgcctcaccctaaaccggtcaaacccacaaccaccactacc
actccaaatcgaaccaccgtctccgacgcctctcaccaaccaactgcaccacaacaaaatcaacacaaca
tgaaaacccgagctaaaaacaatatcaaaaagccaaacacaaaatttagcctcactgctactctcccaaa
tcgttctccatccgagccgaccaatgtcactcaagcccttaaagacaaaaagtggcgttttgccatgtcc
gatgagtttgacgcccaacaacgaaatcatacatgggatctcgttccccatgaatctcagcttcttgtcg
gttgcaagtgggtcttcaaactcaagtatctcccaaatggtgccattgacaaatacaaagcacgcttagt
ggccaaggggttcaatcaacaatatggtgtcgactatgcggaaacgtttagtccagtcattaaatctaca
acaattcggcttgttcttgatgtcgcagttaagaaagattgggagattaaacaactagatgtcaacaatg
ctttcttacaaggaactctcaccgaagaagtatatatggctcagcccccgggtttcatcgacaaagatcg
tcccactcatgtttgtcgccttcgcaaagctatatatggactgaaacaggccccccgagcgtggtatatg
gagctgaagcaacacctattcaacatcggcttcgtcaactcactctccgatgcgtctttatttatctact
gtcatggcaccactttcgtctatgtacttgtctatgttgatgatattattgtcacagggagcgacaagtc
atccatcgatgcggtgctgacttcccttgcggaacgtttctccatcaaagatcccacagatcttcactac
ttccttggtatagaagcaacccgaacaaaacaaggtttgcaccttatgcaaaggaagtatatcaaggatc
ttctcgcaaagcacaacatggctgacgcaaaaccggtgttaacacctttacccacctcaccaaagctcac
tctccatggtggtacaaaactcaacgatgcatctgaatatcgatcggtggtgggtagcttgcaatactta
gcgtttacacgtcctgacattgcgtatgccgtcaaccgattatctcagctcatgcctcaacccacagaag
atcattggcaagcagctaaaagagttcttcgatatcttgccggcacatcaacgcatggtattttcctaga
cactacctcaccattgaatctccatgccttttcggatgcagattgggccggggattccgatgattatgtt
tctaccaatgcatatgtcatctatctgggcaagaatccgatctcttggtcctctaagaagcagcgtggtg
ttgcccgctcctccacagaatccgaatatcgagctgttgcaaacgctgcatctgaagttaagtggctttg
ctcacttctctctaagttacacatccggttaccaattcgcccttctatattctgtgacaacattggagct
acctacttgtgtgctaatccggttttccactctcgtatgaagcacatagccatcgactaccatttcgttc
gcaacatgattcagtccggtgctcttcgagtctcacatgtatcaacacgagatcaactagcggatgccct
caccaaacctctctctcgagctcactttcagtccgcacgtttcaagattggagttcgtcaactccctcca
tcttgagggagcg1