;ID ATCOPIA8AI DNA ; ATH ; 4487 BP ;XX ;DE Internal region of ATCOPIA8A LTR-retrotransposon. ;XX ;AC AC005171 ;XX ;DT 12-APR-1999 (Rel. 3.2, Created) ;DT 12-APR-1999 (Rel. 3.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW ATCOPIA8ALTR; ATCOPIA8AI. ;XX ;OS thale cress ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Charophyta/Embryophyta group; ;OC Embryophyta; Tracheophyta; euphyllophytes; Spermatophyta; ;OC Magnoliophyta; eudicotyledons; Rosidae; Capparales; Brassicaceae; ;OC Arabidopsis. ;XX ;RN [1] ;RA Rounsley,S.D., Lin,X., Kaul,S., Shea,T.P., Fujii,C.Y., Mason,T.M., ;RA Shen,M., Ronning,C.M., Fraser,C.M., Somerville,C.R. and Venter,J.C. ;RT Arabidopsis thaliana chromosome II BAC T4E14 genomic sequence ;RL Unpublished ;XX ;RN [2] ;RA Rounsley,S.D. and Lin,X. ;RT Direct Submission ;RL Submitted (23-JUN-1998) The Institute for Genomic Research, 9712 ;RL Medical Center Dr, Rockville, MD 20850, USA, rounsley@tigr.org ;XX ;RN [3] (bases 1 to 4487) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (March 1999) ;XX ;RN [4] ;RA Kapitonov,V.V. and Jurka,J. ;RT Molecular paleontology of transposable elements from ;RT Arabidopsis thaliana. ;RL Genetica 107 (1-3), 27-37 (1999) ;XX ;CC ATCOPIA8A is a copia-like retrovirus; its ORF encoding copia-like ;CC polyprotein is not interrupted by any stop-codons; its LTRs are ;CC identical. We presume that ATCOPIA8 may be an active retroelement [3]. ;CC ATCOPIA8A is 80% identical on the DNA level with ATCOPIA8B element. ;CC Since ATCOPIA8B is flanked by 99% identical LTRs and has preserved ;CC almost perfect ORF, we may say that both divergent forms, ATCOPIA8A ;CC and ATCOPIA8B have infected A.thaliana approximately at the same ;CC time [3]. Originally reported [1-2] LTR, internal region and 6 bp- ;CC long target site are not correct [3]. ATCOPIA8, as a copia-like ;CC retroelement, has 5 bp-long target site [3]. ;CC 343 bp-long 3' tail of ATCOPIA8AI is a perfect copy of the sequence ;CC followed by the ATCOPIA8A provirus. Based on the known LTRs from ;CC other ATCOPIAs, we consider this sequence as a part of the internal ;CC region [3]. ;XX ;DR Positions 64373 59887 Accession No AC005171 GenBank (rel. 109.0) ;XX ;SQ Sequence 4487 BP; 1239 A; 1046 C; 876 G; 1326 T; 0 other; ATCOPIA8AI tggtatcagagctcatggaacatagatccatggaactctactctgttccttctctcaatatttcaaattg tgtcaccgtcactcttactgccaagaactatattctctggaaatctcagttcgaatcttttcttgatggt caagggcttctgggttttgtcacaggctcgattcctgcaccaagccaaaccagcgttgtttcagacattg atgggtcaacatcggcttcacccaatcctgagtactacacctggttcaagacagatcgagttgtcaaatc ctggctccttggttctttcttagaagatattctcagtgttgtggtcaactgcaacacttctcatgaggta tggatctctgtagcaaaccactttaatagggtttcctcctctaggttatttgagctgcaaagacgcctcc aaaatgttagcaaacgtgataaatctatggatgaatatcttaaggaccttaagactatttgtgaccaact agcctctgtaggaagtcctgtaacagaaaagatgaagatttttgctgctttgaatgggttagggcgagag tatgagcctatcaaaaccactattgaaaactccatggatgctctgcctggtcctagtctagaggatgtta ttcctaagcttacaggctatgacgatcgacttcagggttatcttgaagagacagcagtttcacctcatgt cgccttcaacatcaccacttcagatgactctaatgcttctggttacttcaatgcttacaatcgtggcaaa gggaaatctaacagaggcagaaactcatttagtactcgtggtcgtggtttccatcagcagatatcttcaa caaacagttcttcaggttctcagtctggtggtacttcggttgtctgtcagatttgcgggaaaatgggtca tccagctcttaagtgttggcatcgcttcaacaacagctaccaatacgaagagcttcctcgtgccttagct gcaatgcgcatcactgatattacagatcaacatggcaatgaatggcttccagattctgctgcaactgctc atgtcacaaacagtcctcgatccctgcaacaatctcaaccataccatggatctgatgctgtcatggttgc tgatggtaattttctgcccattacacacactggctcaacaaacttggcttcctcatcaggtaatgtccct ctcactgatgttcttgtttgcccaagcataacgaaatctctcttgtctgtgtctaagcttactcaagatt atccatgtactgttgagtttgactctgatggtgtgcgtatcaatgataaggcaaccaagaagcttctcat aatgggaagcacttgtgatggtttatattgtctgaaggatgactctcagttcaaggctttcttctccact cgtcagcagtccgcaagtgatgaagtgtggcacagacgccttggacatcctcatcctcaagtcctgcagc aactggtcaagaccaactctatctctatcaataagacttctaagtcactctgtgaagcatgtcagcttgg gaagagcactaggctgccatttgtttcttcttcatttacttcaaatagacctcttgagagggttcactgt gacttgtggggaccctctccaattacttctgtacaaggctttagatattatgcagtctttattgatcatt attctcgattcagttggatttatcctttgaagctcaagtcagatttctacaatatctttgttgcatttca caagctagttgaaaaccaacttaatcataagattagtgtgtttcaatgtgatggtggtggagagtttgtc aatcataagtttctgcaacatcttcagaatcatgggatacaacaacacatctcatatcctcacactcctc aacaaaatggactagcagaaaggaagcacaggcatttggtggaattaggcttatctatgctgtttcagag taaagtccctcttaagttttgggttgaagccttcttcactgccaactttctgattaatcttctccctaca tctgctgttgaggatgctatttcaccttatgaaaagctgcatcagacgactccggactatacagctctca gatctttcggttgtgcttgttttccaactatgcgtgactatgctatgaacaaatttgaccctcgctctct taagtgcgttttcctggggtacaatgacaaatacaaggggtataggtgtttatatccacctacaggacgg gtttatataagcaggcatgtgatttttgatgaaacagcttatcctttctctcatcactataaacaccttc attcgcagcctacaactccattacttgcagcatggttcaaggggtttgaatcctccgtgtctcaggcacc accaaaagtgtctccagcacaaccaccacagagaaaggcaacactacccacgcctcctctttttactgct gctgattttcctcctttaccacggagaagccctcagttgtctcagaattctgctgctgcacttgtgtctc aaccttcaacaacaacaatcaattcaactcatccacctgctgtggtgaatgagagttctgagcgtacgat aaacttcgattctgcttctattggcgacagctctcactcatcccagcttttggtggatgacactgtagaa gatctcatggcagctccagttcctactcaacaagctccacctcctactaacactcacccaatgatcacaa gagctaaggtgggaatcacaaaaccaaatcctcgttatgtttttctgtctcacaaggttacttatcctga gccaaagacagtaactgcggctttaaagcatccgggttggacaggcgccatgacagaagaaatgggcaac tgttctgaaactaatacatggtccctggtgccatacacacctaacatgcatgttcttggaagcaaatggg tcttccggactaaactccacgctgatggaaccttaaataagctcaaggctcggatagttgcaaaatgttt tcttcaagaagaagggattggctatcttgagacctacagtcctgtagtaagaacacctacagtccaattg gttctccatttggctactgctttgaactgggagttgaagcaaatggatgttaaaaacgccttcttacatg gcgatctaaatgagactgtttacatgactcagcctgctggttttgttgataagagtaaaccaactcatgt ttgcttgcttcacaaatctatctatggtttaaaacagtccccaagagcttggtttgacaagttcagcacc tttctcttggagtttggatttttctgcagtaaatctgacccttcactattcatctatgctcataataata acctcattctgcttcttctttatgttgatgacatggtgattacaggaaatagttcacagacattatccag tcttctagcagctctcaacaaagaattcagaatgactgatatgggacaactccactacttcctgggaatt caggttcagcgaaatcaacacggcctgtttatgtctcagcagaagtatgctgaagatctcttggtggctt ctgcaatggagaactgcactcctctgccaactcctctacccgttcagcttgacagagttccacaccaaga agaacctttcactgatccaacttatttcaggagtattgctggaaagctccaatatctcaccttgactcgc cctgacatacattttgctgtaaacttcgtgtgccaaaagatgcaccagccaacaatgtcagactttcatc ttctgaagcggattctaaggtacataaaaggtaccatcactatgggaatctcttacaatcaaaattctcc tactcttttgcaagcttacagtgacagtgactggggcaactgtaagctcacaagacgctctgttggtggc ctctgcacctttatggccacgaacctggtgtcatggtcgtcaaagaaacatccaactgtctcccgaagct ccacagaagctgaataccgcaccttatctgatgctgcctctgagatcctctggctgagcactcttctccg tgagcttggcattcctctcccagatactcctgaattgttttgtgacaacttgtctgcagtctaccataca gctaatccagcgtttcatgcaatgattgataaagcaacaaaaagtaccaactttcgagtgctatatccaa atggtgacagtattttgacgtggcatgaaaatgcataacaagacagcacagaaaaattaatatttatact cagagacaaagaagatgatgcaaggttcatgagtttacccggagtaaccggaacatcagagaatatgaga tggatctttggaggagaatcgaaattaaatctacgaaatgaaaagagtcaaaaatgtcagaattttgtga atatataatatcagaaaggaacatttttcatttctttacagtaaagaagatgaatcatgaaacaagactt tgggaat1