;ID ATCOPIA10_I DNA ; ATH ; 4220 BP ;XX ;DE Internal region of ATCOPIA10 copia-like LTR-retrotransposon - ;DE a consensus sequence. ;XX ;AC . ;XX ;DT 03-MAY-1999 (Rel. 3.3, Created) ;DT 16-DEC-1999 (Rel. 4.0, Last updated, Version 2) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW ATSASHA4LTR; ATSASHA4I; ATCOPIA10_LTR; ATCOPIA10_I. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4220) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (April 1999) ;XX ;RN [2] ;RA Kapitonov,V.V. and Jurka,J. ;RT Molecular paleontology of transposable elements from ;RT Arabidopsis thaliana. ;RL Genetica 107 (1-3), 27-37 (1999) ;XX ;CC ATCOPIA10_I is a consensus sequence of internal region of a recently ;CC active ATCOPIA10 copia-like retroelement. The consensus sequence has ;CC been reconstructed based on two provirus sequences found on ;CC chromosomes 4 and 5. Internal sequences of these proviruses are ;CC 94% identical to each other and they are flanked by 5 bp-long ;CC target-site duplications. Interestingly, there is only 85% identity ;CC between LTRs flanking different proviruses although every provirus ;CC is flanked by 97% identical LTRs. ;CC ATCOPIA10_I ORF (position 53-4138) codes for 1361 aa-long ;CC copia-like polyprotein. ;CC The name has been changed from ATSASHA4I to ATCOPIA10_I (Dec.16, 1999). ;XX ;DR [1] (consensus) ;XX ;SQ Sequence 4220 BP; 1393 A; 649 C; 1055 G; 1123 T; 0 other; ATCOPIA10_I gattggtatcagagcccaggctttgggtgtttcgttcacatcgcatcggtcgatggcaacgacgacgaga gtggagatcaaagccttcgacggcgataacaatttctcgttatggaagatcaggataatggcacagctcg gagttcttggtttgaaaggaactctaactgactttgctttgacaaagactgagacattaacaaagagtga ggagaagcaagtagcttctggagatgaatcatcggattcaagtgctgtgttgactaaagaagttccagat ccgatcaagattgaaaaatcagaacaagcgatgaacattatcatcaatcacataagtgacacagtcttga gaaaagtaaatcactgtaagactgcagctacattgtgggaattgttgaatgaattgtatatggagacgtt gttacctaaccggatctatgcacaattgaaattctactcgtttagaatgatgacttcaaagacgattgat caaaacgttgatgattttctgaggatagttgcagaattaggaagtcttgatatcaaggttgcagaagagg ttcaagcgatcctaatcttgaattctttgcctgttacctatgatcagttgaagcacaccttgaagtatgg aaataagaccttgtctgtgaaagatgtagtatcttcttcaaagtctcttgaaagagaaatggctgagctt aaagaaaacactaaggtggtgaatacaactctatacactgcagaaagaggcagaccacaaacccgaaatc aaaatggtagtcaaggcaacaatcaaggtaataaccaaggtaaaaatcaaggaaaaggcaaaagcaggtc gaattccaaatctcgtgtaacctgctggttctgcaagaaagagggacacgtaaagaaggattgttttgct aggaagaagaagttcgaaaatgaagaacaaggtgaggcaggtgtgattactgagaagttggtgtactcag aggcacttagcatgcatgaccaagaagctaaagagaagtgggttattgactctggatgtacctaccatat aacttcaagaatggactggttcacagatttcaatgaaaatgagtcaacactaatcttgttgggtgatgat cacactgttgaatcaagaggttctggcatagttaagatcaacactcatggtggaaccataagaatgttaa agaatgtcagattcgttctaaatctgagaaggaacttgatctctacaggtactttggataagttgggctt taagcatgaaggtggagacggtaagatcaggttttacaaagaaaacaaaacagctttgcgtggaaatttg gttaatggactgtatgtccttgatggtcacacagttctgaatgaaagctgtaatgctgaagggtcaacaa aaaggacaagtttgtggcattgcagacttggtcacatgagtgtgaataacatgaagattctgactgagaa aggtttgattgaaaagaaggatatcaaagagctgggtttctgtgaacactgtgttatgggaaagtcaaag aagttgagttttaatgtgggaaagcacaatactgaggatctactaggatacctacatgcagatctttggg gatctccaaatgtcaccccatctctctctggtaagcagtactttctgtctataatagatgacaagactcg taaggtgtggttgatgttcttgaaaaccaaagatgagacatttgataagttctgtgaatggaaagagttg gtggagaatcaggttaacaagaagatcaaagtgttgagaacagacaacggcttagaattctgcaatctga agtttgatgagtattgtaagaagaatggtattgagagacatcgaacttgtacctatacaccacagcaaaa tggtgttgcagaaaggatgaatcgaactctcatggaaaaggtgagatgtcttcttaatgaatcaggtcta gatgaaagtttttgggctgaagcagcctcaactacagcctatttggtgaaccgatcacctgcatcagtag tagaccacaacgttcctgaagaattgtggctaggcaagaaaccaggttataaacacctaagaaggtttgg ctcgattgcttatgttcatcaagaccaagggaaattaaagcctagagctttaaaaggcgtttttctgggt tatccacaaggagttaaaggctacaaagtgtggttgttggatgaagagaaatgcgtcattagtcgaaatg ttgtatttgatgaagattcagtctacaagagtctgctacctgaaagtgataaagaacagattgatgggaa actcagtaaagagactaccgttactgtgaatgacagtgttaaagaaaaaggagaaagttctgcttcaggt ggagctattgaggaaatcagtgacagtagtgactcagaggttgctgctacagaagaagactcacccatac agactgtaaatctcgaaaactaccagctagctcgagacagaacccgaagggttactagaccacctactaa gctgtcagactatacccattttgcttatgcgttagtaatggcagaagaacttggtgaagaagaagaacct caatgctatcatgatgcacaaaatgataaagactgggagaaatggaatggtgggatgtctgaggaaatgg attcattactgaaaaatgaaacctgggatattgttgataggccaaaggatcaacatgttattagctgcag atggctatacaagataaaaccaggaattctaggtgttgagtcaaagagatacaaggccaggctagttgca agaggtttcactcagaagaaagggattgactatgaagaggtatttgctccagtggttaaacacatttcca ttagaattctaatgtctattgtagttgcagatgacatggaattagagcaaatggatgttaagacggcttt actgcatggagagcttgatcaagtgctatatatggagcagcctgagggatttgaagcagatccaaacaaa gatcaagtgtgtttgttgaagaagtcactctatggcctgaaacaagcacctagacaatggaacaagaagt tcaatgctttcatgatggatcaaggctttactaggagcttacatgattcgtgtgtatacgtcaaagaggt catccctgatcagtttgtgtatctactgttttatgtagacgatatgttgatagcaggaaagagtatggct gaggtcaataaggtcaaagaaggattgagtttacattttgagatgaaagatatgggtgcagcgagtagga tactgggaattgatattgaaaggaatagagaggaaggaactttgtgcttatctcagtcaaagtacttaga gaaggtcattcaacgttttagaatggcagatgcaaaaggtgtgagtactcctattggtgctcatttcaag ttgtcagcagtcaggaacaatgatgagagtgttgacacagaagtttgtccttactcaagtgtagtaggaa gtgttatgtatgctatgatagggaataggccagatgtagcgtatgctctcggattggtgagcaggtttat gagtaacccaggtcatatgcattgggaatcagttaagtggttactaagatatcttaaaaggtcaatggac ctgaagttggtttacacaaaaggaaaagacatgaagatacatggtttctgtgactcggattatgctgcag acctcgataagagaaggtccataagtggatatgttttcacggttggtgggaacactgtgagttggaagtc aagcttgcagcatgttgtagcattgtccactacagaagctgagtttatggcacttactgaagcagtaaaa gaagccatttggatccgaggtctcttggatgatatgggattgaagccagaagcagcttcagtgtggtgtg actctcagtcagcgatttgtctgtcgaagaacaatgcatttcatgaaagaacaaagcatattgctgtgaa tttctatttcattcgagacataatcgaagctggtgatgtagaggttgagaaaatccacacttcaaggaat cctgcagatatgcttactaaagtcatacttgtgcacaagtttgaagcagctttagatcatctaaagctcc tcaagtgatacttggaattatgattgccgaaggttaagttcaagtagtgcaattcactggagaatgttga aagattgaatcaaggtggag1