;ID ATCOPIA91_I DNA ; ATH ; 4988 BP ;XX ;DE Internal region of the ATCOPIA91 copia-like LTR-retrotransposon. ;XX ;AC AB026643 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; the ATCOPIA91 ;KW family; ATCOPIA91LTR; ATCOPIA91_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4988) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal portion of the ATCOPIA91 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(3) p. 26 (2001) ;XX ;CC ATCOPIA91_I is an internal region of the ATCOPIA91 copia-like ;CC endogenous retrovirus flanked by the 93% identical ATCOPIA91LTR ;CC long terminal repeats and a 5-bp target-site duplication (TTTTT). ;CC ATCOPIA91 forms a separate family of copia-like retroviruses ;CC present in the A. thaliana genome since members of other families ;CC are less than 75% identical to ATCOPIA91_I. ;CC ATCOPIA91_I (positions 357-4973) encodes remnants of the ~1540-aa ;CC ATCOPIA91p copia-like polyprotein. The ORF which encodes ;CC ATCOPIA91p is damaged by two indels that produce false frame shifts ;CC at positions 735 and 2162 (marked by Xs in the ATCOPIA91p ;CC sequence). ;CC ATCOPIA91p: ;CC MTTSSNTNVDSSRQSIDQYENPFFLHSSDHAGLVLVFDRLTTGADFHSWRRSVRMTLNVRNKLGFIDGNI ;CC TKPSPDHHDFGSWSRCNDMVATWLMNSVCKKIGQSLLFISTAEGIWKNLLARFKQDXXAPRVYEIEQRLS ;CC IIQQGAMDVSSYYTELVTLWEEYRNYVELPVCTCGRSECNAAVLWERLQQRSRVTKFLMGLNESFESTRR ;CC QILMLKPIPTIEDAFNMVTQDERQRSIKTPSSKTVVFQASGPNQSSGQCYQDVSSYQGQMDNTAFAVQNE ;CC YRPRPPRPVCTQCGQSGHVVQKCFKIIGYPPGYIPRFKSTISNYQSQRSPAPSTFQPRGYSANAASKPHS ;CC VANIMTNPPSLYIPPPATEVNNLDINKLSGDQIQTLIQQLSGRIQTSEPLAPSPSTSAPSTVTEHGIMAV ;CC QSSAGTIPFPSSSLRFENDQLTFQHQCLSSLYTNLPHGSWIIDSGATTHVCSDLAMFNETNTVFGVTVSL ;CC PNDTRVQITHTGTIPLSHSLILQDVLHVPSFKFNLISVSSLLKTNHCSSHYYVDSCIIQEFIQGLTIGRG ;CC ILLHNLYILRLDAPSTHDQFAGSLVVDGILWHQRLGHPFADKLKCIXMHRISGTLSSSKSSFMHPFHCSI ;CC CHLAKQKRLSYESHNHLSSLSFDLVHLDVWGPFSIESLEGYKYFLTIVDDCTRVTWVYMMRNKSEVTQHF ;CC SDFIKHVLTQYKAVIKMIRTDNAPELAFKSIVKEHGMVHQFSCAYTPQQNSVVERKHQHLLNVARSLLFQ ;CC SNVPIAYWSDCLLTVVFLINRIPSVLLKNVSPYELLTKRKPAYDFLRSFGCLCYVSTLQKDRNKFSPRSE ;CC KCIFLGYSSSYKGYKVLHIDLNCVSVSRNVVFHENIFPFHDNPISPVSDVFSHAILPLPIHVNDEIHTSS ;CC SPNAHNEHSHASSASSSSTTSSQPSTSSSSIIPVLPEAVTTETACVSLSVARPKRQGKAPSYLSDYHCSF ;CC TQISEPSPQNSFPHLKVYSTPYLISSVLSYSSLKSPFQSFVLSYSAETEPKTFKQAIISVQWTKAMDEEL ;CC GAMELNKTWSVVSLPPGKNVIGCKWVYTIKYNPDGTIERYKARLVAKEFTQQEGVDYFDTFSHVAKLASV ;CC KLVLGCAAKKGWSLTQIDVSNAFLHSELDEEIYMSLPQGYTPSSGFLPSHHVCRLHKSIYGLKQTSRQWY ;CC KCLSKTLLDAAFIQSQSDNTLFVQLQGTSFIAILVYVDDILIASNDSNQVTLIKAYMADHFKIKDLGPAR ;CC FFLGLEIARNSEGIAICQRKYCLDLLTDAGLLGCKPSTVPMDPKVNLTADMGTLLDNAKPYRKLVGRLLY ;CC LCVTRPDITFAVHRLSRFLSCDTDVHMQAAQRVLKYLKGNPGQGLFYSTNTSLCLNGFADADWGTCLDSR ;CC RSVSGVCIFLGTFLISWKYKKQDVCSSSSTEAEYRSMAVATKDLLWFSYMLKDLHIKVETKAKLFCDNKS ;CC AMHIANNPVFHERTKHVEIDCHTTRDQMKFGFLQVHHIGTENQLADILTKPLHPGPFKSLLNRLGVSNLF ;CC LPKE ;XX ;DR Positions 28125 23138 Accession No AB026643 GenBank (rel. 124.0) ;XX ;SQ Sequence 4988 BP; 1419 A; 964 C; 912 G; 1693 T; 0 other; ATCOPIA91_I tggtagcagagcacaaaagacattttctctcactataacagaaatctcagagtttcttcttccgccaaaa ctcatcttcttcttcgatttcatccaatttctatggaatcaccatcgttatcaccctaaatcttccattt cttttctttactgagctccgattcatcaatagaggtctcaaaatcaccaatcgcaatcacattcgcgatt ttcactaacagttgaagctcaagctcgaatctccttcatctttcagatcgattcgttctacatcacgatc gattcaactctaatcgtttttcggttcgtcgttctaatcaatttcagtatcatctctcgattcttgattt gtttcaatgacgacatcatccaatacaaatgtggatagctcgcgacagtctattgatcagtatgagaatc cattctttcttcacagctccgatcatgccggattggttctagttttcgatcgtctcactactggagcaga ttttcattcttggagaagatccgttcgtatgacactcaatgtacgtaacaagcttggttttattgatggt aatatcactaaaccttcacctgatcatcatgattttggatcttggtctcgttgcaatgatatggtagcaa catggttaatgaattctgtatgcaagaagattggtcaaagcttgctgtttatatctactgctgagggaat ttggaaaaatcttttagctaggttcaaacaagatggcacctagagtctatgaaatagaacagcgtttgag tattattcaacaaggtgctatggatgttagttcatattatacagaacttgtgacattatgggaagaatat agaaattatgtggaattgcctgtttgcacttgtggaagatctgagtgcaatgctgcagtgttgtgggaaa gattgcagcagaggagccgagtcactaagtttcttatgggactgaatgagtcatttgagtctacaagaag acaaattttgatgctgaagccgattccaactattgaggatgcattcaatatggtaactcaagatgaaagg cagagaagtattaagacaccttcatcaaaaactgtagtgtttcaagcttcaggacctaatcaatcttctg gtcaatgttatcaagatgtctcttcatatcaaggtcagatggataatacagcttttgcagtacaaaatga atatcgtccgagaccacctagaccagtttgcacccaatgtggtcaatctggacatgtagtacagaaatgt ttcaagattattggttatcctccagggtatattccaaggtttaagagcaccatctctaactatcaatctc agagatctcctgctccatcaacctttcagcctagaggatattctgcaaatgctgcatctaaacctcattc agttgcaaatatcatgactaatcctcctagtctatacattccaccaccagctacagaagttaacaacctt gatattaacaagctgagtggagatcaaattcagacattgattcaacaactctcaggtcgtattcaaactt cagagcctttggctccttctccatcaacatcggcaccatcaacagttacagaacatggtataatggctgt tcaatcgtctgctggtacaattccttttccttcttcatctcttcgttttgaaaatgatcaacttactttc caacatcaatgtttatcttccttatacacaaatcttcctcatggtagctggataattgatagtggtgcca ctactcatgtgtgttctgatctagcaatgtttaatgaaactaatacagtttttggggtaactgtttcttt accaaatgatactagagttcagatcacacacacaggcacaattccattatctcattcactcattttgcaa gatgtgttgcatgttccttcttttaaattcaatttgatttctgtgagtagtcttttaaaaacaaatcatt gctcatctcactactatgttgattcttgtattattcaggagtttattcagggattgacgattggtagagg catcctacttcataatctctatattttgcgacttgatgcaccatctacacatgatcaatttgctggatcc ttggtagttgatggaattctttggcaccaacgccttggacatccatttgctgacaagttaaaatgcatcg catttcgggtactctttcttcgtctaagtctagtttcatgcatccatttcattgttctatttgtcattta gcaaaacagaaaagattgtcatatgagtctcataatcatttgtcttcactttcatttgatcttgttcatt tagatgtgtggggtcctttttctattgagtctttagaaggatacaaatattttcttactatagtagatga ttgtactcgtgtaacatgggtttatatgatgagaaataaaagtgaagttacacaacatttttcagatttt attaaacatgttcttacacaatataaagctgttattaaaatgattagaactgataatgctcctgaacttg cttttaaatctattgttaaagaacatggaatggtacatcagttttcttgtgcatacacacctcaacagaa ttcagtagtagaacgtaaacatcaacatctactgaatgttgcaagatcattgttgtttcagtctaatgtc ccgattgcatattggagtgattgcttgttaacagtcgtgtttcttattaataggattccatcagttttgt tgaaaaatgtatcaccttatgaattacttacaaagagaaaacctgcttatgattttttaagatcttttgg ttgtctttgttatgtttctactttgcaaaaggataggaataagttctctcctaggtctgaaaagtgtatt ttcttgggttattcatcaagttataagggatataaagtcttgcatattgatttaaattgtgtttcagttt ctaggaatgtagtatttcatgagaacatctttcctttccatgataatccaatctcacctgtttctgatgt ttttagtcatgctatcttaccattacctattcatgttaatgatgaaatccatacatcatcatcacctaat gcacataatgaacattctcatgcatcatctgcatcatcgtcatccacaacatcatcacaaccatccactt caagctctagtatcataccggttttgccagaagcagttacaacagagacagcatgtgtttcactttctgt tgctaggccaaaacgccaaggtaaggctcctagttacttatctgactatcactgttctttcacacaaatt tctgagccttcaccacaaaattcttttccacatctcaaagtttattcaacaccatatcttatatcttcag ttctctcttattctagcttaaaatctccatttcagtcttttgttttatcttattctgctgaaacagaacc taagacttttaaacaagccattatatctgttcaatggactaaagccatggatgaggaacttggggctatg gagcttaataagacttggagtgtggtttctttgcctccgggaaagaatgtaataggttgtaaatgggttt ataccattaagtacaatccggatggtactattgagagatacaaagcaagattggtagcaaaagaatttac acaacaagaaggtgtggattattttgatacattctctcatgttgctaaactagctagtgttaagctggtt cttggttgtgctgccaagaagggttggagtttaactcaaattgacgtttctaatgccttcttgcatagtg agcttgatgaagagatctacatgagtttgccacagggatatacaccttcttctggatttcttccttctca tcatgtgtgcagattacataagtccatctatgggttgaagcaaacatcacgccaatggtacaaatgtctg tcgaagactctcttggatgccgcttttattcagtctcaatctgataacaccttgttcgtgcagctacaag gtacctctttcattgctatccttgtttatgttgatgacattctcattgctagtaatgacagtaatcaagt cactttgataaaagcatatatggctgaccatttcaaaatcaaagatctaggaccggctagattttttctt ggtcttgagattgctaggaattcagaaggaattgccatttgtcaaaggaaatattgcttggatcttctca ctgatgcaggcttgttgggttgcaaaccgagtactgttccaatggatccaaaggtgaatcttactgcaga tatgggaactctcttggataatgcaaagccatatagaaaattggttggtcgcttgttatatctctgtgtt acaaggcctgatatcacatttgcggttcatcgacttagtcgtttcctgtcatgtgatactgatgtgcata tgcaagcagctcagagagtattgaagtatcttaaaggtaatccagggcaaggtttgttttactctacaaa cacttctctttgccttaatggatttgctgatgctgattggggaacatgcttagactctcgacgttctgtt tctggtgtttgcatttttctgggaacttttttgatttcctggaaatataagaagcaggatgtgtgtagta gtagtagcacagaggctgagtatcgcagtatggcagtggctacaaaggatcttttatggttcagttatat gttgaaggatttgcatattaaagttgaaacgaaggcaaagcttttctgcgataataagtctgcgatgcat atagcaaacaatccagtttttcacgagaggaccaaacatgttgaaatcgattgtcacacaacgagagatc aaatgaagtttggattcttgcaagtgcatcacattggaacagagaatcagctggccgatatactgaccaa gccattgcatcctggccctttcaaatcacttcttaatcgtcttggagtttcaaacctcttccttcctaaa gagtgatttgtggggggc1