;ID ATCOPIA58_I DNA ; ATH ; 4801 BP ;XX ;DE Internal region of the ATCOPIA58 copia-like LTR-retrotransposon - ;DE a consensus sequence. ;XX ;AC . ;XX ;DT 01-OCT-2001 (Rel. 6.2, Created) ;DT 01-OCT-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; gag, reverse transcriptase; integrase; ;KW ribonuclease H; ATCOPIA58LTR; ATCOPIA58_I. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4801) ;RA Kapitonov,V.V. and Jurka,J. ;RL Repbase Reports 1:(1) p. 18 (2001) ;XX ;CC ATCOPIA58_I is a consensus sequence of an internal region of the ;CC ATCOPIA58 copia-like endogenous retrovirus. There are 4 copies of ;CC ATCOPIA58_I present in the genome; they are 98% identical to the ;CC consensus sequence, and are flanked by ~1% divergent ATCOPIA58LTR ;CC LTRs and 5-bp target-site duplications. ;CC ATCOPIA58_I encodes the 1021-aa ATCOPIA58p copia-like polyprotein. ;CC ATCOPIA58p: ;CC MDYPKEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGEDVLKTEDQWNDA ;CC EEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQFENLSMEET ;CC ENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSLDTDSIDFEEVVGMLQAYE ;CC LEITSGKGGYSKGLALAASAKKNEIQELKDTMSMKNEIQELKDTISMMAKDFSRAMRRVEKKGFGRNQGT ;CC DRYRDRSSKRDEIQCHECQGYGHIKAECPSLKRKDLKCSECKGLGHTKFDCVGSKSKPDRSCSSESESDS ;CC NDGDSEDYIKGFVSFVGIIEEKDESSDSEADGEDEDNSADEDSDIEKDVNINEEFRKLYDSWLMLSKEKV ;CC AWLEEKLKVQELTEKLKGELTAANQKNSELTQKCSVAEEKNRELSQELSDTRKKIHMLNSGTKDLDSILA ;CC AGRVGKSNFGLGYNGAGSGTKTNFVRSEAAAPTKSQTGFRSNYDAVPARRVYQNHDHYHSRRTVTGYECY ;CC YCGRHGHIQRYCYRYAARLNKLKRQGKLYPYQGRTSKMYVRREDLYCHVAYTSIEEGIKKPWYFDSGASR ;CC HMTGSQSNLENYTSVKESKVTFGGGDKGKIKGKGDLTKAEKPQLTNVYFVEGLTANLISVSQLCDEGLTV ;CC SFNSVKCWATNEKNQNTLTGVRTGNNCYMWEEPKECLRAEKEDPVVWHQRLGHMNARSMSEIVSKEMVRG ;CC VQELKHIEKIVYDAYNQGKQIRVQHKRVVGVVERKNQTFQEMARAMIHGHGVPEKFWTEAISTACYVINH ;CC VYVRIGGTFDKLVKAFVKTMTTEFRLSMVGELKYFLGLQINQIDEGIAISQSTYAQNLVKRFDMCSSNPV ;CC ETPMSTTNLCSCCTKILWMKHMGLDYGMSFSDPLLVKCDNESAIAISKNPVQHSITKHIAIRHHFVRELV ;CC EEKQITVEHVPTEIQLADIFTKPLDLNMFVNLQKSLGIGEV ;CC Presumably, ATCOPIA58 was a semi-autonomous endogenous retrovirus. ;CC It is ~85% identical to ENDOVIR1. ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 4801 BP; 1598 A; 755 C; 1186 G; 1262 T; 0 other; ATCOPIA58_I tttggtatcagagcgggcatctgaaccaagttgtacttaacaacaggtgcagatcctgcggagaggatgg actaccccaaagagttcgttgcggtcggtaaagcaatcatgttggaaaaaggaaattacggacactggaa agtgaagatgagagctctcatacgtggtctaggaaaggaagcctggattgctacgagcattggatggaag gctccggtcatcaagggagaagatggagaagatgtgctaaaaactgaagatcaatggaatgatgcagaag aggcaaaggccacagctaattcaagggcactgtctttgatattcaactccgtgaatcaaaatcaattcaa gcggattcaaaactgtgaatcagctaaagaagcatgggataaacttgctaaagcatatgaagggacaagt agtgtcaaaagatccagaatcgacatgttagcatctcagtttgaaaatctcagtatggaagaaacagaga acattgaggagttcagtgggaaaatcagtgccatagcgagtgaagcacacaatctaggaaagaaatacaa agataagaaactggtcaagaaactgttgaggtgtctcccatcaaggtttgaaagtaagagaacggccatg ggaacgtcgttggacactgactcaatcgattttgaagaagtagtgggaatgctccaagcatatgaattag aaatcacttccggaaagggaggttactccaaaggacttgctttagctgcatcggcaaagaagaatgagat acaggaattgaaggatacaatgagcatgaagaatgagatacaggaattgaaggatacaataagcatgatg gcaaaagacttcagtagagcaatgaggagagttgagaagaaaggattcggaagaaatcagggaactgata gatatcgagaccgaagttcaaaaagggatgagattcaatgtcatgaatgtcaaggatacggacatattaa agctgaatgtccctccttaaagagaaaagatctcaagtgctctgagtgtaagggtcttggacacactaag ttcgactgtgttggatcaaagtctaagcctgatagatcctgcagttctgaaagtgaaagtgactcaaatg atggagactcggaagattatataaaaggtttcgtgtcttttgtaggaatcattgaagaaaaagatgaaag ttcagacagtgaagcagatggtgaggatgaggacaactcagctgatgaggattctgacatcgaaaaggac gttaacatcaatgaagagttcaggaaactgtatgacagctggttgatgctgagtaaagagaaagttgcct ggctggaagagaagctaaaagttcaagaactgacagaaaagctgaaaggagagttaactgctgcaaatca gaagaactctgagctgactcagaaatgcagtgtggctgaagagaaaaacagagaactttctcaagagctt agtgacactcgcaagaagatccacatgctgaacagtggaacaaaagatttggatagtatacttgctgctg gaagagtgggaaaatcaaattttggtttaggatacaatggtgctggatcaggtacaaagacgaattttgt acgaagcgaagctgctgctccaacaaaaagtcaaacaggttttcgaagcaactatgatgctgttccagca agacgcgtgtaccagaatcacgatcactatcattcccggagaactgtgacaggttacgaatgttactact gtggaagacatggtcatattcagagatattgctacaggtatgctgctaggttgaataagctgaagagaca aggaaaactatatccatatcaaggaagaacctccaagatgtatgtcagaagggaggatctctattgtcat gtagcatacacctcgattgaagaaggaataaagaaaccatggtattttgacagtggagcatccagacata tgacaggaagtcaatccaatcttgaaaattacacctctgtcaaggaaagtaaagttacttttggaggtgg ggataaaggaaaaatcaagggaaaaggtgatttgactaaagcagaaaagcctcagcttacaaatgtgtac tttgtcgaagggcttactgcaaatctgattagtgtgagtcagctatgtgatgaagggctgactgtgagtt tcaatagtgtaaaatgctgggctacaaacgagaagaaccaaaacactctcactggagttagaactgggaa caattgctacatgtgggaagaacctaaagagtgtcttagagctgaaaaagaggatccagtggtatggcat caacgtcttggtcacatgaatgcgaggagcatgtcagaaatagtgagcaaggaaatggttagaggagtac aagagctgaaacacatagagaaaattgtgtacgatgcctacaatcaaggtaaacagattagagtccaaca caagagagttgtaggtgttgttgagagaaagaaccagacttttcaagagatggccagagccatgattcat ggacatggagttcctgaaaaattctggacagaggctatcagtacagcatgttatgtgataaatcatgttt atgtgaggattggaggcacattcgacaagttggtcaaagcgtttgtgaagacaatgacaactgagttcag gttgagtatggtaggcgaacttaagtactttctggggttgcaaatcaatcagattgatgaagggattgct atctcgcaaagcacctatgctcagaacctggtgaaacgcttcgatatgtgttccagcaatccagttgaaa ctcctatgagcactaccaacctctgcagctgctgtactaaaattctttggatgaaacacatgggtttgga ttacggtatgtcattttctgaccctttacttgttaaatgtgataatgaaagtgctattgccatatctaag aatccggtacaacactcaatcactaaacacatagctataagacatcattttgttagagaattagttgaag aaaaacaaattaccgtagaacatgtgcctactgaaattcaacttgctgatattttcactaagcctttgga cttgaacatgtttgtgaacttgcaaaagtccctgggtattggtgaagtctaactatcttcttgatgagtg tttgtgctgaaacagggttggttgtacagtgagaagcaagttttcatatctattttacaaagtgtcataa ccggttttcaaatcttctgctgtaaaaagttcaccaagagaaagagctacccttcatggttcggagatgc aaagtcagtaaagtgtgtgacagacatagtgtgtagagcattaaggaagggatgaaatgcagaaaggggc agaaaagaagcatttcatttcataagatgaagcatgagcttatctaagacacgggggacaaaggagaaat cttgtcactaacccgatgggaacaaaggagaaatcttgttcacctacactttggagagaagggatgtcaa acaaaaatattgagagaacaagtaagaaagagcatcactacactctgcaacaagacaagagaaaaaaaaa aaaaataaaaaaaaaaaaggggggtgcaaaaacaagttctctcactgacatctcgtctctctgcagaaag gagcagtaagaagcaacaataagttctgaatacaaaagatcagacttgtaatttatcaatagcctgaatg ttgctgcacagtctgtcacactccttgttaaaaggactcgctaacactctgagatatgacggtgtgaaga ctactcttggagaatcaacattctcttttgtgttgtgtcagcagattatgttctgaatttttttgaatca agttacaatgagcttagtattgtacaatcacctctgggacagttctggacctcagtatgagggagagatg gttattactgtgctgatactaatggggacttgttctgtgtatatcaagatggcttcagaaaaatgagttg actttcagaggtacttagttcaattctataagctaattgggttgttcggttttgttttatggtggttatt gtgttttatgagatttgaccggaccaaaatccggttagcactgatttttggttttgagtttctatggttt gcataaagggatctatcgaatgaaacaagtgagaaaacccagaaaattgagctctcacgccgaaacacta gcgatgggagaacacgcagcgatgcccggcgtatgcgaaagatgtttaatccagagtatgtgtttggcac atccttacctgtagatttgagatatgtctctaccttgctgacgacaataacgacacatatggaaggtatg gtgtcttggtcctgtgctcagtgtctggtttaattggtagagtgtctagatagagtttttgatcaattgt gtcatgcaagcagggggagattatggactggtccctgagaactctaattcatttgtaaactctctagatg ttgataacgaagaaggctttgaagatactcaatcttcttagtgtgtttagatctagttgtgggggagttt aggattaagggggagtttttaagctgtgtttgttaatcatgtccttttggttttagtaggtctttgtttc taagtactggatgatgtgttttgaatcttaaaaacttatttatgcaatctatgatgagaccctctattgg ttatttccgctgcttattatctctatgacttgatgtgtgcttgtgagttttctttgaggggtttcttgca ggattgcttgaattgacaaattgtatcaaaaagggggagat1