;ID ATCOPIA45_I DNA ; ATH ; 4211 BP ;XX ;DE Internal region of the ATCOPIA45 copia-like LTR-retrotransposon. ;XX ;AC AL161592 ;XX ;DT 01-OCT-2001 (Rel. 6.2, Created) ;DT 01-OCT-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; ATCOPIA45LTR; ATCOPIA45_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4211) ;RA Kapitonov,V.V. and Jurka,J. ;RL Repbase Reports 1:(1) p. 2 (2001) ;XX ;CC ATCOPIA45_I is an internal region of the ATCOPIA45 ;CC copia-like endogenous retrovirus flanked by 1% divergent ;CC ATCOPIA45LTR LTRs and 5 bp-long target-site duplication. ;CC ATCOPIA45_I encodes the 1367-aa ATCOPIA45p copia-like polyprotein ;CC (it is disrupted by two stop codons emerged after integration of ;CC ATCOPIA45 in the genome). ;CC ATCOPIA45p: ;CC MGDIVPVTNNTKEGSSSSSIQCPMLTATNYTFWTIRMTMALKVHKVWETIEEGLDDIDKNNMASALLLQS ;CC IPEALTLRVGKLKTAKKIWDAIKARNLGADRVKDARLQTLMGEFERIKMKETEKIDDFAGRLSELSTKSA ;CC DLGNDIEEPKLVKKFLNSLPRKRYIHIIAALEQVLDLNTTSFEDIVGRLKAYEERICDEEDNQDDQGKLL ;CC YANSEEKSAQNNWNPNKGKSQGGRGYGRGRGRDRFGNAQGNRDMTNVICYRCDKLGHYASDCPDRLLKLQ ;CC ETQDVKNDDTQKADALMMHEVVFLNEKNVMPNKLEASLDVDNVWYLDNGASNHMTGNLAYFGEIDERVTG ;CC KVRFGDDSRIDIKGKGSITFIAKNEERKILADVYYIPDLRSNIVSLGQATESGCDVRMRDDHLTLYDRDG ;CC KLLIKATRSRNRLYKVIMEVDDTKCLQLESLSETTKWHARLGHIGTDNLKRMMQKELVIGIPNIKVEKEM ;CC CGSCLLGKQARKPFPQATPYRATSILELLHGDLCGPITPSTVA*NRYIFVLIDDYSRYMWSLLLKEKSEA ;CC FNKFKSFKACVEQETGATIKTFRTDRGGEFVSQEFQAFCDASGIKRHLTAPYSPQQNGVVERRNRTLMEM ;CC TRSILKHMSVPNYLWGEAVRHSTYLINRVATRTLVDQTPYKVLKSKKPNVEHLRVFGCIGYAKAEAVHLR ;CC KLDDRSRMLVHLGTEPGSKGYRLLDPTRRKVIVSRDVVFDEEKRWKWNNSEDEINNVPGMFSLSFEEFGN ;CC NGIREEDDITEETEINDGENHDREAEIPTQAIETVEQ*VEPQVTLRKSVRVISKPSYLDDYVLLASIECE ;CC RLLLMINEEPWDYNEAKELQEWKKACVEEIASITKNHTWDLVDLPIGAKPIGLKWVFKLKRNSDGSVNKH ;CC KARLVAKGYVQRHGIDFDEVFAPVARIETVRLIIALAASNGWEIHHLDVKTAFLHGELKEVVYVSQPEGF ;CC VIGGSEDKVYKLNKALYGLKQAPRAWNNKLNKILMELKFTKCSKEPSLYCRRDKDELLVVVVYVDDLLVT ;CC GSNLQVILEFKEEMAKKFEMSDLGKLTYYLGIEVFQHEGGIMLKQERYANKILEETKMDDCNAVQIPMDA ;CC NLKLSKAQEEKNIDEKEYRRNIGCLRYLLHTRPDLSYCVGVLSRYMHEPKESHGAALKQILRYLRGTQSF ;CC GLCFKRMNKTELVGFSDSSHNVDEDDGRSTTGHIFYLNDCPITWCSQKQETVALSSCEAEFMAATEAAKQ ;CC AVWLQELLEEIVGKTCKQVLILIDNKSAIALTKNPVFHGRSKHIHKRYHFIRECVANEQVEVEHVPGTEQ ;CC RADILTKALGRIKFKEMRDLVGVQDMTKCSFKLKGEH ;XX ;DR Positions 14885 19095 Accession No AL161592 GenBank (rel. 124.0) ;XX ;SQ Sequence 4211 BP; 1448 A; 686 C; 1019 G; 1058 T; 0 other; ATCOPIA45_I attggtatcagagcataatattgattcaagcatctttacgatcaaacaaaggcaatcaacgaagttagcg ttgattccgtacggagaaagaaggtaacttgtgaagaagacatgggcgatatagtaccagtaacgaacaa caccaaagaaggtagcagctcttcctctattcaatgtccgatgcttacggcaacaaactatacgttttgg acaatacgtatgacgatggctctaaaagttcacaaggtatgggaaacgattgaagaaggattagatgata tcgataaaaacaatatggcaagcgctctccttctccaatctatccctgaagcattaaccttgcgagttgg aaaacttaaaaccgcaaagaagatatgggatgcgataaaggctagaaatctaggagctgatagggtcaaa gatgcaagacttcagacattgatgggcgagtttgaaaggataaagatgaaagaaactgagaagatagatg acttcgctggaagactctcagaattgtccaccaaatcagcagatcttgggaatgatattgaagaacctaa gttggtgaaaaagtttctcaatagcctaccacggaaacgttatatacatatcattgctgcactagaacaa gttcttgatcttaatacaacaagtttcgaggatatagtcggcagactaaaggcatatgaagagagaatct gcgatgaagaagataaccaagatgatcagggaaagcttttgtatgcgaactctgaagaaaagtcagctca gaacaattggaatcccaataaaggaaaaagtcaaggtggtcgaggatacggacgaggaagaggcagagat cggtttgggaatgcacaaggcaacagagacatgacaaacgtgatttgctataggtgtgataagctaggac actatgcttccgactgtcccgatagattgcttaaacttcaagaaacacaagatgttaagaatgatgatac acaaaaggctgacgcattaatgatgcacgaagtagtgtttcttaacgagaagaatgtaatgccaaataaa cttgaggcgagcttggatgttgataatgtgtggtacttggataatggcgcgagcaaccatatgacgggaa acttggcttactttggtgagattgacgaaagagttacgggaaaggttcgttttggtgacgattctcgtat tgatatcaaaggaaaaggctcgattacgtttatagctaagaatgaagagagaaaaatcttggctgatgtc tattacattcccgatttaagaagcaacatcgtgagtcttggtcaggctaccgagtccggatgtgatgtta gaatgcgagatgaccatctaactctatatgatagagatggaaagctgctaataaaagcgacaagatccag aaatcgactttacaaggtgatcatggaagtcgatgatacaaagtgtttgcaacttgagagcttgagtgaa acgacaaagtggcatgcaagattgggtcatattggaactgacaacttgaaaagaatgatgcaaaaggaat tggtcattggcattcctaatatcaaagtcgagaaagaaatgtgtggctcatgcttgcttggtaagcaagc tagaaaaccatttccacaagcaactccatatcgtgcaactagcatactcgagctcttacatggagatctt tgtggaccaattacgccttctacagtagcataaaataggtatatctttgtcctaattgatgattattcac gttatatgtggtcacttcttctcaaggaaaagagcgaagcgttcaataagtttaaaagctttaaggcatg tgtggagcaagaaactggtgctaccattaaaacgtttcgaacggatagagggggagagtttgtttctcaa gagtttcaagcgttttgtgacgcctccgggattaaaaggcacttaactgcaccgtattctccacaacaaa acggagtggtcgagaggcgcaatagaacgttaatggaaatgactagaagcatcttgaaacacatgagtgt cccaaattatctatggggagaagcagtgagacattcgacttatcttataaacagagtagcgacgagaact ttggtagatcaaaccccatacaaagtcttaaagagtaagaagccaaatgtggagcatttacgggtctttg gttgcattggttacgctaaagcagaggctgtacatttgagaaagttagacgatcgctcacgaatgctagt acacctaggaacagagcctggatcaaaaggctatcgtttattagatccaacaaggagaaaggtgatagtt agcagagatgttgtatttgatgaagaaaaaaggtggaagtggaataatagtgaagatgaaattaataatg ttccaggaatgttcagtcttagttttgaagaatttggcaacaatggtataagagaggaggatgatatcac agaagaaacagagatcaatgatggagagaatcatgatcgtgaagctgaaattcctactcaagcaatagag actgtagaacaataagtagagccgcaagtcacactaagaaaatccgtgagagtgatctcaaaaccaagtt acttagacgattatgttttgttggcctcaatcgaatgtgaacggctcttacttatgataaatgaagaacc atgggactacaatgaggcaaaggaattgcaagaatggaagaaagcgtgtgtagaagagattgcgtcgata actaagaaccacacttgggatttggtggatcttccgatcggagctaaaccaattggactcaaatgggtct ttaagttaaaacggaactctgatggaagcgttaacaaacacaaagcaaggcttgtagctaagggttatgt tcaacgacacggtatcgactttgatgaagtatttgctccggttgcccgaattgaaaccgttcgacttatc attgctttagcagcttctaatggatgggaaatacatcatcttgatgtgaaaactgcatttcttcatggag aattaaaagaagtggtctacgtttcacaacctgaaggatttgtgataggaggaagtgaagataaagtgta caagctgaacaaagctctttacggactcaaacaagctcctagagcatggaataacaagttgaacaagatt ctcatggaacttaagtttactaagtgctctaaagaaccgtcattatattgtcgaagagacaaggatgagc tacttgttgtggtggtctatgtggatgacctacttgtcacgggatctaacttacaagtcatacttgagtt taaagaagagatggcaaagaagtttgaaatgagtgatcttggaaagcttacgtattacttaggtattgaa gtgtttcaacatgaaggcggcattatgcttaaacaagagaggtatgcaaacaagatcctagaagaaacaa aaatggatgattgtaatgcggttcagattccgatggatgcaaacctaaagctaagtaaagcacaagaaga gaaaaacatcgatgagaaggaatatagaagaaatattggatgtcttcgatatttgctgcacacacgtcct gacctttcttattgtgttggagtgcttagtagatatatgcacgagccaaaggaatcacatggagcagcct tgaaacagattcttaggtacttgcgaggaactcaatcttttggtctctgtttcaaacgaatgaataaaac agagctagtaggatttagtgatagcagtcacaatgttgatgaggatgatggaagaagtacgacgggtcat attttctatcttaatgactgtcctatcacttggtgctcgcaaaagcaagaaactgtggctctatcgtctt gtgaagctgagtttatggcggctacagaggcagcaaaacaagcggtttggcttcaagagttacttgaaga gattgttggaaaaacgtgtaagcaagtgttgatattaattgacaacaagtcggctatagcactcacgaaa aacccggttttccatggacgaagcaaacacattcacaagaggtatcactttattcgtgagtgcgttgcga atgaacaagttgaggtggagcacgtccccggaaccgaacagagagccgacattctaacaaaggcactcgg gagaatcaagttcaaagagatgagagatcttgtgggagttcaagatatgacaaagtgtagcttcaagctt aagggggaaaa1