;ID ATCOPIA55_I DNA ; ATH ; 4424 BP ;XX ;DE Internal region of ATCOPIA55 copia-like LTR-retrotransposon. ;XX ;AC AL161511 ;XX ;DT 01-OCT-2001 (Rel. 6.2, Created) ;DT 01-OCT-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; ATCOPIA55LTR; ;KW ATCOPIA55_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4424) ;RA Kapitonov,V.V. and Jurka,J. ;RL Repbase Reports 1:(1) p. 12 (2001) ;XX ;CC ATCOPIA55_I is an internal region of the ATCOPIA55 copia-like ;CC endogenous retrovirus flanked by 1% divergent ATCOPIA55LTR ;CC LTRs and a 5-bp target-site duplication. ;CC ATCOPIA55_I encodes the 1421-aa ATCOPIA55p copia-like polyprotein. ;CC ATCOPIA55p: ;CC MEQPIELYSQPLLNISNCVTVKLNGRNYLLWKTQFESFLSGQGLLGFVTGALKPPDPVLATPLTAEAAAV ;CC ETVNPAYLSWVKSDQVVRSWLLGSLSEDILSEVVNTTTSQEVWLALAKHFNRVSSSRLFELQRKLQTIEK ;CC RDRSMSDYLKEIKSICEQLASVGSPVNEKMKIFAALHGLGREYEPIKTSIEGSMDTVPTTFEDISPRLTG ;CC FDDRLLAYTDAASITPHLAFNTQRYDSTTYYNKGRGSSSQKSKGRGGYTTQGRGFHQQISSGSSVSSGQS ;CC VERPVCQICGKIGHPALKCWHRFDNAYQHEDMPTALAALRITDVTDQAGSEWCADSAATAHVTSSPHHLQ ;CC QSRAYSGSDTVMVGDGNFLPITHTGSALLPTTSGTLPLLDVLVVPDIAKSLLSVSKLTTDYPCTLEFDAN ;CC GVIVKDKVTKRLLTLGQNKNGLYTLKDPPVQAFYSSRQQAASDEVWHRRLGHPNSKILQQLVSTKAIIIN ;CC KSTNRMCESCQIGKSSRLSFSDSQFVATRLLERVHCDLWGPSPVLSNQGFKYYVIFIDHWSRYCWFYPLK ;CC CKADFYITFCKFQKFVETQFNQKISTFQCDGGGEFISHRFLKHLEESGIQQSISCPYTPQQNRLAERKHR ;CC HITELGLSMLFSAKLPQKVWVEAFFTSNFLSNILPTTTLPNQMSPFERLHGHQPEYSALRTFGCSCFPTL ;CC RNYASNKFDPRSLKCVFLGYNDRYKGYRCIYPPTGRVYISRHVIFDESSFPFQDTYLHLQNLGSTKLLEA ;CC WQQNFMPSQKNQSETQAASVFSEDDFPPLPVTRVQVSPPNVTPQAAQSTVQREEQPADTDIQSNSPRNQA ;CC ESPALVDRECIERTTGSDPASIGDNALSPQDSATQRSPVQSTETAGTSDQNQRTEAAVDPVQQVHPMVTR ;CC SKKGVVKPNPRYVLLTQKASHPEPKTVTQALKHEGWKGAMGEEIDTCVETNTFSLVPYTPDMNVLGSKWV ;CC FRTKINADGSLNKLKARLVAKGYHQEEGIDYLETYSPVVRTATVRLVLHIATVMEWNLKQLDVKNAFLHG ;CC DLNETVFMHQPAGFVDKTKPNHVWHLHKSIYGLKQSPRAWYDKFTNYLLEFGFVCSIQDPSLFFYEQGRD ;CC VLILLLYVDDIVLTGSNNILMDRLLQEMSKEFRMTDMGSLQYFLGIQAQNSDQGLFLSQQKYAEDLLQVA ;CC GMIDCAPMPTPLPVQLHKVPKQNELFSNSTYFRSLAGKLQYLTLTRPDIQFSVNFVCQKMHAPTTADYNL ;CC LKRILRYVKGTITMGLLFNKNTDFTLRTYTDGDYSQHSKQKKSATNNDAVFKLRAFSDSDEKQDVLQEDS ;CC VPFLATISSPGRRRSNQLSPRAQQKPSIKPCQIQLLKSSGSITCSEISTFHNLIHRSSMETTFPPSILLQ ;CC TRYFTHALNTFKLTIILLEKG ;XX ;DR Positions 190949 195248 Accession No AL161511 GenBank (rel. 124.0) ;XX ;SQ Sequence 4424 BP; 1304 A; 1013 C; 898 G; 1209 T; 0 other; ATCOPIA55_I tggtatcagagccatggagcaacctattgagctctattctcaaccactcttaaacatttcaaattgtgtt actgtcaaactaaatggaaggaactatcttctgtggaaaacacagtttgaatcgtttctctccggccaag gtttactgggtttcgtcaccggcgctctcaaaccaccagatcctgttcttgcgactccactcaccgctga agctgcagctgtggagacagtgaaccctgcgtatctctcttgggtgaaatctgatcaagtggtccggtca tggcttcttggatctctgtctgaagacattctctctgaagtcgtcaacacaaccacgtctcaggaggtat ggctagctctagcaaaacatttcaatcgtgtttcttcttcacgcttgtttgaactacaaagaaagttaca aaccattgaaaagcgtgacagatccatgagtgattatttgaaagagattaagtctatctgtgagcaactt gcttctgttggcagtccagtgaatgaaaagatgaaaatttttgctgccttacatggtctaggcagagagt acgaaccgattaagacatctattgaggggtccatggatactgttcctacaacctttgaagacatctctcc tcgtcttactggttttgatgatcgtcttttggcttacactgacgctgcaagcatcactcctcatcttgca ttcaatacacagcgttatgactcaacaacctactacaacaaaggcagaggcagttcatctcaaaagtcca aagggcgtggaggctatacaacacaaggaaggggatttcatcagcaaatctcctctggttcttctgtgtc ttcgggtcagtctgttgaaagaccagtgtgtcagatttgtggaaaaataggacatccggctctaaagtgt tggcatcgctttgacaatgcatatcagcatgaagatatgccaactgctctcgctgctctccgaatcactg atgtcacagatcaagcaggcagtgaatggtgtgcagactctgcagctactgctcatgttacaagctcacc tcatcacctgcagcagagtagagcttattcaggatctgacacggtcatggtaggagatgggaacttctta ccaatcactcacacagggtctgctctcttaccaacgacatcaggtactctccctcttcttgatgttttag ttgtccctgatattgcaaagtctctgttatcagtttcaaaactcacaaccgattacccatgtactcttga atttgatgctaatggggtcattgtaaaggacaaggtaacaaagaggcttctcactctgggtcaaaataag aatggtctgtacacgctgaaggatccacctgttcaagccttctattcatctagacagcaagcagcctcag atgaagtgtggcatagacgtcttggacatccgaatagtaagatcctgcagcagttagtcagtactaaagc tatcatcatcaataagagcaccaataggatgtgtgaatcatgtcagattgggaagagtagtagactttct ttttcagattctcagtttgttgcaactagactactagagagagttcattgtgatctttggggaccctctc cagttttgtcaaatcaggggtttaagtactatgtaatcttcattgaccattggtctcgttattgctggtt ttatcctttgaaatgcaaggctgatttctacattactttctgcaagttccaaaagtttgttgaaacacag tttaatcaaaagatcagtacctttcaatgtgatggagggggtgaatttataagccatagatttctcaaac atttagaggaaagtggtatacaacagtcaatatcgtgtccttacacgcctcagcaaaatagacttgctga gaggaagcacagacacatcacagagcttgggctgtcaatgctgttctcagctaagctgccacaaaaagtt tgggtggaagcgttcttcacttcaaatttcctgagcaacattcttcctacaactactctaccaaatcaga tgagtccatttgagagattacatggccatcaaccggaatattcagctttaagaacctttggctgcagttg ttttcccactctaagaaactatgcatcaaataagtttgaccctcgttctcttaagtgcgtgttcttgggc tacaatgatcgctataaaggctatagatgcatctatcctccaacaggaagagtttatattagccgccatg tgatcttcgatgagtcttcttttcctttccaagatacctatcttcacctgcagaacttgggatcaacaaa gcttcttgaagcgtggcaacagaatttcatgccttctcaaaagaatcaaagtgaaactcaagctgcttct gtgttctctgaagacgactttcctcctctaccagtcacacgggttcaagtttcaccaccaaatgtcacac ctcaagctgctcagtccacagtacaacgagaagaacaacctgcagatacagacattcaatcaaactcacc aagaaatcaagccgagtcaccggctcttgtggacagagagtgcattgagcgtacgacaggctcagatcct gcttctataggcgacaacgctctcagtccacaagacagtgccactcaacgttctcctgttcagtcaacag aaacagctggaacttcagatcaaaatcagaggacagaagctgcagttgatccggttcagcaagttcaccc aatggtaacaagatcaaagaagggagtagtcaaaccaaaccccagatacgtccttctaacacagaaagca tcacatccagaaccaaaaactgtgacacaagcactgaaacatgaaggctggaaaggtgctatgggcgaag aaattgacacttgtgttgaaaccaacactttttctttagtcccatacacacctgacatgaatgttttagg aagtaaatgggtgttcagaaccaaaataaatgctgatggcagtttgaacaagttgaaagctagactagtg gctaaaggatatcaccaagaagaaggaatagactacttggagacctacagtccagttgtgagaacagcca cagtgagacttgtcttacatatagcaacagtgatggaatggaatctgaaacagttggatgtgaagaatgc tttcttacatggagacttaaatgaaacagtctttatgcatcaaccagctggatttgtggataagacaaaa ccaaatcatgtttggcatctccacaaatctatatacgggttaaaacaatctccccgagcctggtatgata agtttactaactacttgttggagtttggttttgtttgcagcatacaagatccatcactattcttctatga acaaggacgagatgtgctcattctacttttgtatgtagatgatatagtcctaaccggtagcaacaacatt ctcatggatagacttctgcaggaaatgagcaaggagtttcgaatgactgacatgggatctctgcaatact ttctcgggattcaagcacagaactctgaccaaggcttgttcttatctcaacagaagtatgctgaggatct tctacaagtcgcaggaatgatcgattgtgcaccaatgcctactcctttgccagttcaacttcacaaagtt cctaaacaaaatgagctattctcaaactccacttacttccgcagtttggctggcaagcttcagtatctga cattgactaggccagatattcagttttcagtaaacttcgtatgtcaaaagatgcacgctccaacaacagc tgattacaatctgcttaagaggatccttaggtatgtaaagggaaccataaccatggggttactcttcaac aagaacacagacttcactcttcgaacctacactgacggtgactatagtcaacactcaaagcaaaagaagt ctgctacaaataatgatgcagtcttcaagcttcgagccttcagtgatagtgatgagaaacaagacgttct acaggaggattctgtacctttcttggcaacaatatcatctcctggtcgtcgaagaagcaaccaactgtct ccaagagctcaacagaagccgagtataaagccttgtcagatacaacttctgaaatcatctggctcaataa catgctcagagatctccacattccacaacctgatccaccggagctctatggagacaacctttcctccatc tatcttgctgcaaacccggtacttcacacacgctctaaacactttcaaactcactatcattttgttagag aaagggtagcgttgggttcgttgattgtcaagcatgtgccatcccaccagcagttggctgatatattcac caagccattgcccttcgatgctttcacttcgctaaggtacaaactgggtgtagatttgccacccacacca agtttgcgggggag1