;ID ATCOPIA54_I DNA ; ATH ; 4117 BP ;XX ;DE Internal region of ATCOPIA54 copia-like LTR-retrotransposon. ;XX ;AC AL161507 ;XX ;DT 05-NOV-2001 (Rel. 6.2, Created) ;DT 05-NOV-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; ATCOPIA54LTR; ;KW ATCOPIA54_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4117) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal region of ATCOPIA54 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(2) p. 6 (2001) ;XX ;CC ATCOPIA54_I is an internal region of the ATCOPIA54 copia-like ;CC endogenous retrovirus flanked by the 7% divergent ATCOPIA54LTR ;CC long terminal repeats, and by a 5-bp target-site duplication ;CC (AATTT). ;CC ATCOPIA54_I encodes remnants of the ATCOPIA54p copia-like ;CC polyprotein. A long fragment of ATCOPIA54_I (positions 133-3949) ;CC is inverted in the genome and is flanked by a 80-bp inverted ;CC repeat (positions 53-132 and 3950-4029). The ATCOPIA54_I sequence ;CC reported here represents a reconstructed prototype of the ATCOPIA54 ;CC provirus followed by the invertion. The reconstruction is defined ;CC below in DR lines. ;XX ;DR Positions 151204 151255 Accession No AL161507 GenBank (rel. 124.0) ;DR Positions 155152 151336 Accession No AL161507 GenBank (rel. 124.0) ;DR Positions 155153 155400 Accession No AL161507 GenBank (rel. 124.0) ;XX ;SQ Sequence 4117 BP; 1471 A; 656 C; 959 G; 1031 T; 0 other; ATCOPIA54_I tttggtatcagagcggttataatcattttgtgatcaagagagattagaagagagtagagatcaacgtttc aagctagaaacatgggtgacatagttgtggcaaaaccaaaggagaatatctcatcatcaataacatgtcc tatgctcaatgctacgaactacacgggttgggccatacgtatggagattacgcttagtatacataaggtg tgggaggtaataaatccaggatctgatgatgttgacaagaatctcatggctaggggtttcatattgcaac ctataccagagactttgacactacaagtcgggaatcttaatacaacaaaaaaagtatgggaatcaataaa aactcgacatgtaggagtggagagggtcaaagaagcaaggttacaaaccttgatggcagagtttgagaaa ataaagatgaaggaaagaacatattgataacttcgttggaagactttcggaactctctacaaaatctgcg gaactaggagttgagattgaagtaccaagactcgttaagaaatttcttaacggtttgccaagaaagagat atatacaaacaagttcttgaccttaataatacaagatttgaggatattgtgggccgtatgaaagtatatg aagaataagttggtgatgtaggagatgagcaagatgacataagaaaactcatgtaagttaatactaattc acaatcctatcaagataactgctagtagaggaagaggtcaaagaggacgatttggtggaagaggaagagg acgtggtcgtaatacaagagataagtcaaagatcatgtgttacaggtgtgataagatagggcattatgct tctaattgtccagatagattacttaagcttcaagaagcatgtgcaaacaagaagaagaaactcaagaagc ggatgagctcatgatacatgaggtagtctatttaaatgaaaagaatgtcaaacttttgaaacacaatcag atggagataatgtgtggtatcttgacaacggggcaaggaatcacatgacagaaaaccgttcttatttctc taaaatcgacgagtcaatcacagggaaagtgagatttggagataactctcgtattgatatcaaagggaag ggctcaatactctttgtaagtagaagtgtactacataccggatctaaagagcaatatcataagtcgtggt caagccaccaaagcaggatgcgatgtgaggatgaaagaaaactatctaacattgtatgatcgtgatggaa agttgttggtgaaagcgataaggtcaaagaatcggctttacaaagttaccatggaaaccgaagctaagaa gtgtttacaactaaatcttatcgacgattcatcaatatggcactcaaggttaggacatgttgggttaaac actatgaggtgaatgatgaacaaagagttagttgtcgggttaccaaagatcacagtcgaaaaggaaacat tggcctcatgttcgcttgggaaataagtaagaagaatattccctcaagctacttcttttcgagcctcacg actacttgaactcatacatgcggatctctgcggacctatcacacctatgacagcagcacaaaataggtat atctttgttcttatcgacgatcactctcgttatatgtggacagtgctattgaaggaaaagagtcaatcat tcgacaaattcaaaaaatttaaagcactagttgaagaagaaacaggagcaaagatcaaaacacttcgtat agatagaggtggtgagttcacttcacatgaatttcaagatttttgtgataaatccggaatcacaagatat ttgtgataaatccagaatcacaagacacataactgcaccttactcaccacaacaaaacggagttgttgaa aggaggaatagaaatttgctagagatgaccataagcatcatgaagcacatggatgtaccaaactatctat ggggagaaccagtgaggcatgctaccaatctttttaatagagtcgcaataagatcactggttaaacaaac tccatatgaggtattcaagggaagaaggccaaatattgaacatttacgtgtgttcgggtgtatcggatat gcatagactgagagtccacagttaaagaagctagatgacaggtcgagaaggttagttcatctgggaacag aacctggctctaaagcttatcgcttgttggatccatctaggcggagaattattgtgagtagggatgtcgt ttttgatgagagtaaaaactggtcttggaatgagacaaaaaacgagacaagtgagagcccatgaacgttt aaagtcagctttggaaacaatggtattgaaaatgaggactcagtacaagaaacagaggagaacggagccg atgagaataacgagggttcagttgaagaggaagaagacattccaaacgataacgatcaagatgaacagac taatgaggtcatcttaaggagatcagagagacaacgtcatagacctaatcatcaagatgactatattttg tttgctgaacttgaagtcgaaaaactcttgatgacaatcagtgaagaaccatgggattacattgaagcaa aagagctaaaggtatggagagactcgtgtagaagaaatcatgtctattaccaaaaataaaacatgggacc tagtagaacttccagtcggagtcaaggctataggactaaagtgggtgtttaaactaaagcaaaattctga tggtagtattaacaagcataaagcaaggttgtagcaaaatgttacatacaaagacatggaatagattatg acgaagtcttcactccggtagcaagaaaagaaaccattcgccttatgcttgttcttgctgcttcacatgg atagcaagttcaccacctcgatgtcaaaacggcgtttctacatggggagctgaaagaagaagtttatgtt atacaaccggagggttttgttacaagagggagtgaggagaaagtttataagttaaacaaagcattgtgtg gcctcaaacaagcgcctagggcctagaatcataagcttaactcgatacttaatgagttaaagtttgtcaa gtgtcctaaggaaccttcattgtatcagaaacaagacaaagataaagttcttctagttgcagtctatgtg gatgatctattaatctcggggtttagcttgaagttgattctcgagttcaagaaggaaatggcgaaaaaat tcgggatgagtgaccttggtttgttaacatactatctcggtcttgagtatgtcaacacgaaggaggtatt acgttgaagcaagaaaagtatgcatcaaaaattctaagtgaaactcaaatggaagaatgcaatgttgtag acataccaatgaacgcgaacttaaagctaagtaaagcacatgatgagaaaaacatcgatgagaaggagta tagaagaaatatcgggtgccttcgatatttacttcatacaagccctgatctttcttatagtgttggagtc ttgagcaggtacatgcatgaaccaaaggagtctcatggtgcagctctaaaacaaatacttaggtactgac aaggtacacgggcttatggtctctccttcactcagaaaaacgaagccaagttgataggcttcagtgatag cagtcacaacgttgatgaggacgatggaaggaaaacaatatgtcacattttctatctcaacaagtgtctg atcacttggtgctcgcaaaagcaagataatgtggctttatcatcatgtgaggccgagtttatggccgcta ctgaggcagcaaaataagcactatgacatcaagagcttcttggagagatcaatggaaaaccatgcgagaa gatgctgattttacttgacaacaaatctgcaattgcactcaccaagaacccggtgtttcacggacgaagt aagcatatacacaaaaggtatcattttattcgtgagtttgtgacgaatgaacaagtggaggtagagcacg ttcctagaaaaagaccaaaggcagatattctaaccaaggctctaggaaggatcaagtttaaagaaatgag ggagctagttggagttcaagatgtgtcgaagtatggcttcaaacttaagagggtgaa1