;ID ATCOPIA4I DNA ; ATH ; 4493 BP ;XX ;DE Internal region of ATCOPIA4 LTR-retrotransposon. ;XX ;AC Z97342 ;XX ;DT 12-APR-1999 (Rel. 3.3, Created) ;DT 12-APR-1999 (Rel. 3.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ATCOPIA4; ;KW ATCOPIA4I. ;XX ;OS thale cress ;XX ;OC Arabidopsis thaliana ;OC Eukaryotae; Viridiplantae; Charophyta/Embryophyta group; ;OC Embryophyta; Tracheophyta; seed plants; Magnoliophyta; ;OC eudicotyledons; Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4493) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (March 1999) ;XX ;RN [2] ;RA Kapitonov,V.V. and Jurka,J. ;RT Molecular paleontology of transposable elements from ;RT Arabidopsis thaliana. ;RL Genetica 107 (1-3), 27-37 (1999) ;XX ;CC ATCOPIA4I is an internal region of a copia-like LTR-retrotransposon ;CC ATCOPIA4. ATCOPIA4 can be a currently active element since its LTRs, ;CC ATCOPIA4LTR, are identical. ATCOPIA4 has 5 bp-long target site ;CC duplication. Its left and right LTRs have been wrongly identified by ;CC the authors deposited Z02342 locus in GenBank as two solo LTRs. ;CC Also, the copia-like polyprotein encoded by ATCOPIA4I was wrongly ;CC annotated as a part of 2000 aa-long protein (see GenBank, Z97342). ;XX ;DR Positions 56668 52176 Accession No Z97342 GenBank (rel. 109.0) ;XX ;SQ Sequence 4493 BP; 1238 A; 1295 C; 791 G; 1169 T; 0 other; ATCOPIA4I tggtatcagagcaaacgataccctaattttttttttcaaaacacaaacctagccgcctaacatgggctcc tccgcaaacggtctcccagccaccactgatgaagcaattgtcttcactccgcaaacaatcttcaacatta acacgtctaatgtcacgaaactcacctccaacaattacctcatgtggagccttcagatccacgccttgct tgatggatatgaactcgcaggacatcttgatggttctatcgagactcctgctccaacactcactacaaac aatgttgtctccgctaatccacaatacacgttgtggaagagacaagacaggctcatcttcagtgccttga ttggcgccatctctccaccggtgcaaccattagtgtctcgtgcaaccaaagcctctcaaatctggaaaac cttaaccaacacgtatgctaagtctagctacgaccacatcaaacagctccggactcaaattaagcaactc aagaagggaaccaaaaccattgacgaatacgttctgagtcacacaactctccttgatcaattggctattc tcggcaaaccaatggaacacgaagaacaggtggaacgtatccttgaaggtcttcctgaagactacaaaac tgttgttgatcagatcgaaggcaaagacaacactccctctattacggagattcatgaacgactcattaat catgaggccaagcttttgtccactgctgctctgtcatcctcgtcgcttcccatgtcagctaacgttgctc aacaacgccatcacaacaacaatcgtaacaataaccaaaacaagaatcggactcaaggcaacacctacac caacaattggcagccctctgcaaataacaagtcaggtcagcgccctttcaaaccttacttggggaaatgc cagatttgcaatgttcaaggacacagtgcgcgtcgatgcccacagctgcaggcaatgcaaccgtcttcga gctcctcggcctccacgttcacaccatggcagccacgagctaacttagcgatgggagcgccatacacagc aaataactggcttctcgatagtggagctacccatcatatcacgtccgatctgaacgctcttgcccttcac cagccctacaatggtgatgatgtcatgatcgctgatggcacaagtcttaagattacaaaaactggttcca ctttcttaccttctaatgcccgtgaccttactttgaataaagtgttatatgtacccgatatacagaagaa tttggtctcagtgtaccgcctatgcaatactaatcaagtgtccgttgaatttttccctgcctcttttcag gtgaaggacctcaacacggggaccctgttgctccaagggagaactaaagacgagctctatgaatggccag tgactaatcctaaagctacagctctgttcacaacaccaagtccaaagaccactctttcttcctggcattc tcgcctaggccatccttcttcttctattctaaacactttaatttcaaagttttcacttcccgtttcagtt tctgcttcaaataaacttgcttgttcggattgtttcattaataagagccataaactcccattttctatct catccattaaatccacctcaccgcttgaatatatattttctgatgtctggatgtctcccatattgtcacc agataactacaaatattaccttgttcttgttgatcatcacacacgatatacatggctttaccctttgcag caaaagtctcaagtaaaatccacttttattgcgtttaaagcgttggtcgagaacaggtttcaagcaaaaa tccgaacactttactcggacaatggcggagaatttatcgcactacgagagtttctcgtttccaatggtat ctctcatctcacctctccaccacacactcccgagcacaatggcctatccgaacgcaagcacaggcacatc gttgaaacaggactcaccttactcactcaagcttcggttccacgagaatactggccatacgcattcgccg cagctgtttatctcattaaccgaatgccgactccggtgctatccatggagtcaccgtttcagaagctgtt cggatccaagccgaattatgagcgtctacgagtattcggttgtctgtgctttccatggctcagaccttac actcacaacaaattagaagaacgatcgagacggtgtgtgttcctcggttactctttaactcaaacagcct acctctgtttcgatgttgaacataagcgactttacacatctcgccatgtcgtgtttgatgaagcctcctt tcccttctccaacctcacatcccaaaattctctccccaccgtaacctttgaacagagctcctcgccgtta gttacgcccatactctcatcatcgtcggttctcccatcttgtttgtcttccccgtgtacggtccttcacc aacaacaaccgccggtgactacgccgaactcaccacattcatcacagccgacaacctcaccggctcctct gtctcctcaccggtcaaccacaatggactttcaagtcccacaggtacgctcttcgtcacccttattatct tcttcttcatctttaaattctgagcccactgctccaaatgaaaatgggcctgaacctgaggcccagtcac cacctataggcccactgtcgaatccaacccatgaagcctttattggtccactcccaaacccaaaccgaaa cccaaccaatgaaattgaaccaacacctgcgcctcaccctaaaccggtcaaacccacaaccaccactacc actccaaatcgaaccaccgtctccgacgcctctcaccaaccaactgcaccacaacaaaatcaacacaaca tgaaaacccgagctaaaaacaatatcaaaaagccaaacacaaaatttagcctcactgctactctcccaaa tcgttctccatccgagccgaccaatgtcactcaagcccttaaagacaaaaagtggcgttttgccatgtcc gatgagtttgacgcccaacaacgaaatcatacatgggatctcgttccccatgaatctcagcttcttgtcg gttgcaagtgggtcttcaaactcaagtatctcccaaatggtgccattgacaaatacaaagcacgcttagt ggccaaggggttcaatcaacaatatggtgtcgactatgcggaaacgtttagtccagtcattaaatctaca acaattcggcttgttcttgatgtcgcagttaagaaagattgggagattaaacaactagatgtcaacaatg ctttcttacaaggaactctcaccgaagaagtatatatggctcagcccccgggtttcatcgacaaagatcg tcccactcatgtttgtcgccttcgcaaagctatatatggactgaaacaggccccccgagcgtggtatatg gagctgaagcaacacctattcaacatcggcttcgtcaactcactctccgatgcgtctttatttatctact gtcatggcaccactttcgtctatgtacttgtctatgttgatgatattattgtcacagggagcgacaagtc atccatcgatgcggtgctgacttcccttgcggaacgtttctccatcaaagatcccacagatcttcactac ttccttggtatagaagcaacccgaacaaaacaaggtttgcaccttatgcaaaggaagtatatcaaggatc ttctcgcaaagcacaacatggctgacgcaaaaccggtgttaacacctttacccacctcaccaaagctcac tctccatggtggtacaaaactcaacgatgcatctgaatatcgatcggtggtgggtagcttgcaatactta gcgtttacacgtcctgacattgcgtatgccgtcaaccgattatctcagctcatgcctcaacccacagaag atcattggcaagcagctaaaagagttcttcgatatcttgccggcacatcaacgcatggtattttcctaga cactacctcaccattgaatctccatgccttttcggatgcagattgggccggggattccgatgattatgtt tctaccaatgcatatgtcatctatctgggcaagaatccgatctcttggtcctctaagaagcagcgtggtg ttgcccgctcctccacagaatccgaatatcgagctgttgcaaacgctgcatctgaagttaagtggctttg ctcacttctctctaagttacacatccggttaccaattcgcccttctatattctgtgacaacattggagct acctacttgtgtgctaatccggttttccactctcgtatgaagcacatagccatcgactaccatttcgttc gcaacatgattcagtccggtgctcttcgagtctcacatgtatcaacacgagatcaactagcggatgccct caccaaacctctctctcgagctcactttcagtccgcacgtttcaagattggagttcgtcaactccctcca tcttgagggagcg1