;ID   ATCOPIA10_I DNA   ; ATH   ; 4220 BP
;XX
;DE   Internal region of ATCOPIA10 copia-like LTR-retrotransposon -
;DE   a consensus sequence.
;XX
;AC   .
;XX
;DT   03-MAY-1999 (Rel. 3.3, Created)
;DT   16-DEC-1999 (Rel. 4.0, Last updated, Version 2)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   ATSASHA4LTR; ATSASHA4I; ATCOPIA10_LTR; ATCOPIA10_I.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4220)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (April 1999)
;XX
;RN   [2]
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Molecular paleontology of transposable elements from
;RT   Arabidopsis thaliana.
;RL   Genetica 107 (1-3), 27-37 (1999)
;XX
;CC   ATCOPIA10_I is a consensus sequence of internal region of a recently 
;CC   active ATCOPIA10 copia-like retroelement. The consensus sequence has
;CC   been reconstructed based on two provirus sequences found on 
;CC   chromosomes 4 and 5. Internal sequences of these proviruses are
;CC   94% identical to each other and they are flanked by 5 bp-long 
;CC   target-site duplications. Interestingly, there is only 85% identity
;CC   between LTRs flanking different proviruses although every provirus
;CC   is flanked by 97% identical LTRs.
;CC   ATCOPIA10_I ORF (position 53-4138) codes for 1361 aa-long 
;CC   copia-like polyprotein.
;CC   The name has been changed from ATSASHA4I to ATCOPIA10_I (Dec.16, 1999).
;XX
;DR   [1] (consensus)
;XX
;SQ   Sequence 4220 BP; 1393 A; 649 C; 1055 G; 1123 T; 0 other;
ATCOPIA10_I
gattggtatcagagcccaggctttgggtgtttcgttcacatcgcatcggtcgatggcaacgacgacgaga
gtggagatcaaagccttcgacggcgataacaatttctcgttatggaagatcaggataatggcacagctcg
gagttcttggtttgaaaggaactctaactgactttgctttgacaaagactgagacattaacaaagagtga
ggagaagcaagtagcttctggagatgaatcatcggattcaagtgctgtgttgactaaagaagttccagat
ccgatcaagattgaaaaatcagaacaagcgatgaacattatcatcaatcacataagtgacacagtcttga
gaaaagtaaatcactgtaagactgcagctacattgtgggaattgttgaatgaattgtatatggagacgtt
gttacctaaccggatctatgcacaattgaaattctactcgtttagaatgatgacttcaaagacgattgat
caaaacgttgatgattttctgaggatagttgcagaattaggaagtcttgatatcaaggttgcagaagagg
ttcaagcgatcctaatcttgaattctttgcctgttacctatgatcagttgaagcacaccttgaagtatgg
aaataagaccttgtctgtgaaagatgtagtatcttcttcaaagtctcttgaaagagaaatggctgagctt
aaagaaaacactaaggtggtgaatacaactctatacactgcagaaagaggcagaccacaaacccgaaatc
aaaatggtagtcaaggcaacaatcaaggtaataaccaaggtaaaaatcaaggaaaaggcaaaagcaggtc
gaattccaaatctcgtgtaacctgctggttctgcaagaaagagggacacgtaaagaaggattgttttgct
aggaagaagaagttcgaaaatgaagaacaaggtgaggcaggtgtgattactgagaagttggtgtactcag
aggcacttagcatgcatgaccaagaagctaaagagaagtgggttattgactctggatgtacctaccatat
aacttcaagaatggactggttcacagatttcaatgaaaatgagtcaacactaatcttgttgggtgatgat
cacactgttgaatcaagaggttctggcatagttaagatcaacactcatggtggaaccataagaatgttaa
agaatgtcagattcgttctaaatctgagaaggaacttgatctctacaggtactttggataagttgggctt
taagcatgaaggtggagacggtaagatcaggttttacaaagaaaacaaaacagctttgcgtggaaatttg
gttaatggactgtatgtccttgatggtcacacagttctgaatgaaagctgtaatgctgaagggtcaacaa
aaaggacaagtttgtggcattgcagacttggtcacatgagtgtgaataacatgaagattctgactgagaa
aggtttgattgaaaagaaggatatcaaagagctgggtttctgtgaacactgtgttatgggaaagtcaaag
aagttgagttttaatgtgggaaagcacaatactgaggatctactaggatacctacatgcagatctttggg
gatctccaaatgtcaccccatctctctctggtaagcagtactttctgtctataatagatgacaagactcg
taaggtgtggttgatgttcttgaaaaccaaagatgagacatttgataagttctgtgaatggaaagagttg
gtggagaatcaggttaacaagaagatcaaagtgttgagaacagacaacggcttagaattctgcaatctga
agtttgatgagtattgtaagaagaatggtattgagagacatcgaacttgtacctatacaccacagcaaaa
tggtgttgcagaaaggatgaatcgaactctcatggaaaaggtgagatgtcttcttaatgaatcaggtcta
gatgaaagtttttgggctgaagcagcctcaactacagcctatttggtgaaccgatcacctgcatcagtag
tagaccacaacgttcctgaagaattgtggctaggcaagaaaccaggttataaacacctaagaaggtttgg
ctcgattgcttatgttcatcaagaccaagggaaattaaagcctagagctttaaaaggcgtttttctgggt
tatccacaaggagttaaaggctacaaagtgtggttgttggatgaagagaaatgcgtcattagtcgaaatg
ttgtatttgatgaagattcagtctacaagagtctgctacctgaaagtgataaagaacagattgatgggaa
actcagtaaagagactaccgttactgtgaatgacagtgttaaagaaaaaggagaaagttctgcttcaggt
ggagctattgaggaaatcagtgacagtagtgactcagaggttgctgctacagaagaagactcacccatac
agactgtaaatctcgaaaactaccagctagctcgagacagaacccgaagggttactagaccacctactaa
gctgtcagactatacccattttgcttatgcgttagtaatggcagaagaacttggtgaagaagaagaacct
caatgctatcatgatgcacaaaatgataaagactgggagaaatggaatggtgggatgtctgaggaaatgg
attcattactgaaaaatgaaacctgggatattgttgataggccaaaggatcaacatgttattagctgcag
atggctatacaagataaaaccaggaattctaggtgttgagtcaaagagatacaaggccaggctagttgca
agaggtttcactcagaagaaagggattgactatgaagaggtatttgctccagtggttaaacacatttcca
ttagaattctaatgtctattgtagttgcagatgacatggaattagagcaaatggatgttaagacggcttt
actgcatggagagcttgatcaagtgctatatatggagcagcctgagggatttgaagcagatccaaacaaa
gatcaagtgtgtttgttgaagaagtcactctatggcctgaaacaagcacctagacaatggaacaagaagt
tcaatgctttcatgatggatcaaggctttactaggagcttacatgattcgtgtgtatacgtcaaagaggt
catccctgatcagtttgtgtatctactgttttatgtagacgatatgttgatagcaggaaagagtatggct
gaggtcaataaggtcaaagaaggattgagtttacattttgagatgaaagatatgggtgcagcgagtagga
tactgggaattgatattgaaaggaatagagaggaaggaactttgtgcttatctcagtcaaagtacttaga
gaaggtcattcaacgttttagaatggcagatgcaaaaggtgtgagtactcctattggtgctcatttcaag
ttgtcagcagtcaggaacaatgatgagagtgttgacacagaagtttgtccttactcaagtgtagtaggaa
gtgttatgtatgctatgatagggaataggccagatgtagcgtatgctctcggattggtgagcaggtttat
gagtaacccaggtcatatgcattgggaatcagttaagtggttactaagatatcttaaaaggtcaatggac
ctgaagttggtttacacaaaaggaaaagacatgaagatacatggtttctgtgactcggattatgctgcag
acctcgataagagaaggtccataagtggatatgttttcacggttggtgggaacactgtgagttggaagtc
aagcttgcagcatgttgtagcattgtccactacagaagctgagtttatggcacttactgaagcagtaaaa
gaagccatttggatccgaggtctcttggatgatatgggattgaagccagaagcagcttcagtgtggtgtg
actctcagtcagcgatttgtctgtcgaagaacaatgcatttcatgaaagaacaaagcatattgctgtgaa
tttctatttcattcgagacataatcgaagctggtgatgtagaggttgagaaaatccacacttcaaggaat
cctgcagatatgcttactaaagtcatacttgtgcacaagtttgaagcagctttagatcatctaaagctcc
tcaagtgatacttggaattatgattgccgaaggttaagttcaagtagtgcaattcactggagaatgttga
aagattgaatcaaggtggag1