;ID   ATCOPIA54_I DNA   ; ATH   ; 4117 BP
;XX
;DE   Internal region of ATCOPIA54 copia-like LTR-retrotransposon.
;XX
;AC   AL161507
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; ATCOPIA54LTR; 
;KW   ATCOPIA54_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4117)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA54 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 6 (2001)
;XX
;CC   ATCOPIA54_I is an internal region of the ATCOPIA54 copia-like 
;CC   endogenous retrovirus flanked by the 7% divergent ATCOPIA54LTR 
;CC   long terminal repeats, and by a 5-bp target-site duplication
;CC   (AATTT). 
;CC   ATCOPIA54_I encodes remnants of the ATCOPIA54p copia-like 
;CC   polyprotein. A long fragment of ATCOPIA54_I (positions 133-3949)
;CC   is inverted in the genome and is flanked by a 80-bp inverted
;CC   repeat (positions 53-132 and 3950-4029). The ATCOPIA54_I sequence
;CC   reported here represents a reconstructed prototype of the ATCOPIA54
;CC   provirus followed by the invertion. The reconstruction is defined
;CC   below in DR lines. 
;XX
;DR   Positions  151204  151255  Accession No AL161507   GenBank (rel. 124.0)
;DR   Positions  155152  151336  Accession No AL161507   GenBank (rel. 124.0)
;DR   Positions  155153  155400  Accession No AL161507   GenBank (rel. 124.0)
;XX
;SQ   Sequence 4117 BP; 1471 A; 656 C; 959 G; 1031 T; 0 other;
ATCOPIA54_I
tttggtatcagagcggttataatcattttgtgatcaagagagattagaagagagtagagatcaacgtttc
aagctagaaacatgggtgacatagttgtggcaaaaccaaaggagaatatctcatcatcaataacatgtcc
tatgctcaatgctacgaactacacgggttgggccatacgtatggagattacgcttagtatacataaggtg
tgggaggtaataaatccaggatctgatgatgttgacaagaatctcatggctaggggtttcatattgcaac
ctataccagagactttgacactacaagtcgggaatcttaatacaacaaaaaaagtatgggaatcaataaa
aactcgacatgtaggagtggagagggtcaaagaagcaaggttacaaaccttgatggcagagtttgagaaa
ataaagatgaaggaaagaacatattgataacttcgttggaagactttcggaactctctacaaaatctgcg
gaactaggagttgagattgaagtaccaagactcgttaagaaatttcttaacggtttgccaagaaagagat
atatacaaacaagttcttgaccttaataatacaagatttgaggatattgtgggccgtatgaaagtatatg
aagaataagttggtgatgtaggagatgagcaagatgacataagaaaactcatgtaagttaatactaattc
acaatcctatcaagataactgctagtagaggaagaggtcaaagaggacgatttggtggaagaggaagagg
acgtggtcgtaatacaagagataagtcaaagatcatgtgttacaggtgtgataagatagggcattatgct
tctaattgtccagatagattacttaagcttcaagaagcatgtgcaaacaagaagaagaaactcaagaagc
ggatgagctcatgatacatgaggtagtctatttaaatgaaaagaatgtcaaacttttgaaacacaatcag
atggagataatgtgtggtatcttgacaacggggcaaggaatcacatgacagaaaaccgttcttatttctc
taaaatcgacgagtcaatcacagggaaagtgagatttggagataactctcgtattgatatcaaagggaag
ggctcaatactctttgtaagtagaagtgtactacataccggatctaaagagcaatatcataagtcgtggt
caagccaccaaagcaggatgcgatgtgaggatgaaagaaaactatctaacattgtatgatcgtgatggaa
agttgttggtgaaagcgataaggtcaaagaatcggctttacaaagttaccatggaaaccgaagctaagaa
gtgtttacaactaaatcttatcgacgattcatcaatatggcactcaaggttaggacatgttgggttaaac
actatgaggtgaatgatgaacaaagagttagttgtcgggttaccaaagatcacagtcgaaaaggaaacat
tggcctcatgttcgcttgggaaataagtaagaagaatattccctcaagctacttcttttcgagcctcacg
actacttgaactcatacatgcggatctctgcggacctatcacacctatgacagcagcacaaaataggtat
atctttgttcttatcgacgatcactctcgttatatgtggacagtgctattgaaggaaaagagtcaatcat
tcgacaaattcaaaaaatttaaagcactagttgaagaagaaacaggagcaaagatcaaaacacttcgtat
agatagaggtggtgagttcacttcacatgaatttcaagatttttgtgataaatccggaatcacaagatat
ttgtgataaatccagaatcacaagacacataactgcaccttactcaccacaacaaaacggagttgttgaa
aggaggaatagaaatttgctagagatgaccataagcatcatgaagcacatggatgtaccaaactatctat
ggggagaaccagtgaggcatgctaccaatctttttaatagagtcgcaataagatcactggttaaacaaac
tccatatgaggtattcaagggaagaaggccaaatattgaacatttacgtgtgttcgggtgtatcggatat
gcatagactgagagtccacagttaaagaagctagatgacaggtcgagaaggttagttcatctgggaacag
aacctggctctaaagcttatcgcttgttggatccatctaggcggagaattattgtgagtagggatgtcgt
ttttgatgagagtaaaaactggtcttggaatgagacaaaaaacgagacaagtgagagcccatgaacgttt
aaagtcagctttggaaacaatggtattgaaaatgaggactcagtacaagaaacagaggagaacggagccg
atgagaataacgagggttcagttgaagaggaagaagacattccaaacgataacgatcaagatgaacagac
taatgaggtcatcttaaggagatcagagagacaacgtcatagacctaatcatcaagatgactatattttg
tttgctgaacttgaagtcgaaaaactcttgatgacaatcagtgaagaaccatgggattacattgaagcaa
aagagctaaaggtatggagagactcgtgtagaagaaatcatgtctattaccaaaaataaaacatgggacc
tagtagaacttccagtcggagtcaaggctataggactaaagtgggtgtttaaactaaagcaaaattctga
tggtagtattaacaagcataaagcaaggttgtagcaaaatgttacatacaaagacatggaatagattatg
acgaagtcttcactccggtagcaagaaaagaaaccattcgccttatgcttgttcttgctgcttcacatgg
atagcaagttcaccacctcgatgtcaaaacggcgtttctacatggggagctgaaagaagaagtttatgtt
atacaaccggagggttttgttacaagagggagtgaggagaaagtttataagttaaacaaagcattgtgtg
gcctcaaacaagcgcctagggcctagaatcataagcttaactcgatacttaatgagttaaagtttgtcaa
gtgtcctaaggaaccttcattgtatcagaaacaagacaaagataaagttcttctagttgcagtctatgtg
gatgatctattaatctcggggtttagcttgaagttgattctcgagttcaagaaggaaatggcgaaaaaat
tcgggatgagtgaccttggtttgttaacatactatctcggtcttgagtatgtcaacacgaaggaggtatt
acgttgaagcaagaaaagtatgcatcaaaaattctaagtgaaactcaaatggaagaatgcaatgttgtag
acataccaatgaacgcgaacttaaagctaagtaaagcacatgatgagaaaaacatcgatgagaaggagta
tagaagaaatatcgggtgccttcgatatttacttcatacaagccctgatctttcttatagtgttggagtc
ttgagcaggtacatgcatgaaccaaaggagtctcatggtgcagctctaaaacaaatacttaggtactgac
aaggtacacgggcttatggtctctccttcactcagaaaaacgaagccaagttgataggcttcagtgatag
cagtcacaacgttgatgaggacgatggaaggaaaacaatatgtcacattttctatctcaacaagtgtctg
atcacttggtgctcgcaaaagcaagataatgtggctttatcatcatgtgaggccgagtttatggccgcta
ctgaggcagcaaaataagcactatgacatcaagagcttcttggagagatcaatggaaaaccatgcgagaa
gatgctgattttacttgacaacaaatctgcaattgcactcaccaagaacccggtgtttcacggacgaagt
aagcatatacacaaaaggtatcattttattcgtgagtttgtgacgaatgaacaagtggaggtagagcacg
ttcctagaaaaagaccaaaggcagatattctaaccaaggctctaggaaggatcaagtttaaagaaatgag
ggagctagttggagttcaagatgtgtcgaagtatggcttcaaacttaagagggtgaa1