;ID ATCOPIA81_I DNA ; ATH ; 4502 BP ;XX ;DE Internal region of the ATCOPIA81 copia-like LTR-retrotransposon. ;XX ;AC AL157735 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; the ATCOPIA81 ;KW family; ATCOPIA81LTR; ATCOPIA81_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4502) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal portion of the ATCOPIA81 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(3) p. 6 (2001) ;XX ;CC ATCOPIA81_I is an internal region of the ATCOPIA81 copia-like ;CC endogenous retrovirus flanked by the 98% identical ATCOPIA81LTR ;CC long terminal repeats, and a 5-bp target-site duplication (AACAT). ;CC ATCOPIA81 forms a separate family of copia-like retroviruses ;CC present in the A. thaliana genome. ;CC ATCOPIA81_I encodes remnants of the ATCOPIA81p copia-like ;CC polyprotein. The ORF which encodes ATCOPIA81p is slightly damaged ;CC by false stop codons. ;CC ATCOPIA81p: ;CC MWLKICGKHLKKRFSVMNGSRLQQLKSELACCKQRGLAIESYFGKLTRIWDNLATHRPLRVCRCGKCTCN ;CC LGAAQEADREEDKTHEFLNGFDEQFRAVRSTLVARTPIQPLEEVYNIIRQEEDLRVPVEESPSMTAFAVQ ;CC SKPRLPTDEKEKSFFCKLCNRSGHTAERCYAVIGYPEWWGDRPKGRTLQGKGRGGSTSRGGRGRGASQEV ;CC ANRVYVPNVENVTTEQANHVLTDDDRSGAHGLNDTQWKMIKSILNAGKQPSTEQQTSMSSLSPWIMDTGA ;CC SHHLTGRFETLTNVREMPPVLIIMADGREQVSYKEGSISLGSHLVMKSVYYVEELKTDLMSLGQLMDENK ;CC CVVQLADRFLVVQDRISRMVIGCGKRVGGTFHFRSTKIAASAATKEIKSFELWHNRMGHPSTQVVGKLPF ;CC VSASVSSSSLNKPCDICLRSKQTRDSFPLSMNKSSNCFELIHVDLWGPYRTPSHSGARYFLTIVDDYSRG ;CC VWLYLLTTKSEAPEQLKKFCALTERQFNTKIKRIRSDNGTEFLCLTKYFLTNGIIHETSCVATPQQNARA ;CC ERKHRHILNIARALRFQASLPIEFWGECVLTAAYLINRTPSSVLDFATPFERLFNKAPTYDHIRVFGSLC ;CC YAHDQNKSGDKFASRSKCCVFVGYPYGKKGWRLYDLEKLEFFVSRDVVFSETQFPFAPINHLQASDESKA ;CC LWAPISEFLENDDCGLRKPNPIRSVALGPVSSSQLSTPISSANDTRRSENSDGDNGGARQLPEPTAGDKI ;CC LPSTAPNPIPVAPTSRLTPAAVVMPPVPTEELLGKGKRNRTPSVRLKDFVVPHAPKPKQQEEINLVCAEN ;CC LTHNVDVHRFSETHVAYVAAVLSNLEPRSFKQAMQEEKWRNAVGSEYGTLEENNTWTIEDLPPNKKAIGS ;CC QWIFKVKFKSDGTIERYKARLVPMGNKQIEGEDYGETFSPVVKMGTVRLFLDIAVKKGWIIHQMDVHNAF ;CC LHGDLEEEVYMKLPPGFESVDKNKVCRLRKSLYGLKQAPRCWFAKLSSALLEYGFQQLRSDYSLFTYAQG ;CC TTRLNILVYVDDLVIAGSSLKATESFKAYLSSCFHMKDLGELKYFLGIEVARNASGIYLCQRKYALDIIT ;CC ETDQLGAKPAHFPLEANHKLALSESVLLHDPKPYRRLLGRLIYLGVTRPDLAFSVHVLAQFMQNPRLDHW ;CC LATLRLVRYLKSDPGQGILLRADGNFQVTGWCNADWDNCPITRCSVTGYLCS ;XX ;DR Positions 64171 59670 Accession No AL157735 GenBank (rel. 124.0) ;XX ;SQ Sequence 4502 BP; 1285 A; 900 C; 1021 G; 1296 T; 0 other; ATCOPIA81_I tggtatcagagcctgaaaagcaaaactcttgattttttttttctcgacaatgtcaactgaaagcaacacc tccgctgcgactgaagtaaggagaacaatctctccttatgacctgacttctgctgataatcctggagccg taatctctcatcctctcttgaagggaagcaactatgaagagtgggcttgtggttttcgaaccacgttgat atcacggaaaaagtatggatttctcgatggttcaatttccaaaccagaagagacttcccctgattttgaa gactggacgacgattcaagctcttcttgtgtcatggatcaaaatgacaattgaaccaactcttcgatcca ccatttctcatagagatgtggctcaagatctgtgggaaacatctgaaaaagaggttctctgtcatgaatg ggtcacgactacaacaactgaaatctgagctagcttgctgcaaacagagaggtttggctattgaatctta ctttggcaagttgactaggatctgggataacctcgccacacatcgtccgttgcgtgtttgtcgttgtggt aagtgtacctgcaatcttggtgctgctcaagaggcagatcgggaggaagacaagactcatgaatttctca atggttttgatgagcagtttcgggcggtccgatcaactctagtggctcggactcccattcaaccacttga agaggtgtacaacatcattcgtcaggaagaggacttgcgtgttcccgtagaagagtcaccgtcaatgaca gcttttgcggtgcaatccaaaccacgtcttccaactgacgaaaaggagaaaagtttcttctgcaaactgt gtaatcgatctggtcacactgcagagcggtgctatgctgtcatcgggtatccagagtggtggggtgatag acccaagggtcgtactctacaagggaagggacgtggtggcagtacgtctcgtggtggcaggggacgcgga gcctcccaagaggttgctaatcgggtttatgtgccgaatgtggaaaacgtgacaactgaacaagctaatc atgttctcacagacgacgaccgatctggagctcatggactgaatgatacccagtggaaaatgataaaatc aattctcaatgcgggaaaacaaccttcaactgaacaacaaacgagtatgtcttctctctctccttggatt atggatacaggggcgtcacatcatttaactggaaggtttgagaccttaacaaatgtgcgagaaatgcccc ctgttttgatcattatggcagacggtagagagcaagtttcttacaaagaaggttctattagtcttggtag tcatttggtaatgaaatctgtttactatgttgaagagttgaaaactgatttgatgtctttggggcagtta atggatgaaaacaagtgtgttgttcagttggctgatcggtttctcgtggttcaggaccgcatttcgagga tggtgattgggtgtggtaaaagagtgggtggtacctttcactttcgtagtacaaagatcgcagcttcagc cgcaacaaaggaaatcaagtcatttgagttgtggcataatcggatgggtcatccgtcgacgcaagtggtg ggaaagcttccgtttgtttctgcttctgtttcttcttctagtttgaataagccttgtgatatttgtcttc gttcaaaacagacaagggatagttttccattgagtatgaataaaagttcaaattgttttgaattaataca tgttgatttatggggtccttatagaactccatctcattctggagcaagatacttcttaactattgttgat gactactccagaggagtgtggctgtatttgttaaccactaaaagtgaagctccagagcagttgaaaaagt tttgtgctttaacagagagacaattcaataccaaaataaagaggatccggagtgacaacgggactgagtt tctatgcttaactaagtattttctcaccaatggtatcatccatgaaacctcgtgtgtagctactcctcaa cagaatgcaagagctgaaaggaaacatcgccatatcttaaatattgctagggcattgagatttcaagcgt ctttacccattgaattttggggtgaatgtgtgttaaccgcagcgtatttgataaaccggactcctagttc agttctcgacttcgcaacaccattcgaacgcttatttaacaaagctccaacttatgatcatatcagagtt ttcggctccctgtgctatgcccatgatcagaataaaagtggtgataaatttgcttcccgaagtaaatgtt gtgtttttgtgggttatccgtatgggaaaaaaggctggagattatatgacttagaaaaattggaattttt tgtctcaagagatgttgttttctctgagacacagtttccttttgctccgattaatcacttgcaggcttca gacgaatctaaagctttgtgggctccaatatctgaatttttggaaaatgatgattgtgggcttcgtaagc ccaatcctattaggtctgttgcattagggcctgtatcatccagccagctgtcgactcccatctcatctgc aaatgatacacgccggtctgaaaattcagacggtgacaacggtggtgcccgacaacttcctgaaccaacc gccggtgacaaaattctgccttctacggcacccaatccaattccagttgctccaactagccgattaacac cggctgctgtggttatgcctccagttccgacagaagagctattaggtaaagggaaacgcaacaggacccc gtcggttcgtttaaaagattttgttgtcccacacgctcccaaaccgaaacaacaagaagaaattaatctg gtttgtgctgaaaatctgactcacaacgtggatgttcaccgtttctccgaaacacatgttgcttatgttg ctgcagttctctccaacttggaaccaagatcattcaaacaagctatgcaagaagaaaagtggcgaaatgc tgttggttcagaatatggaacattagaagaaaacaacacatggaccattgaagacttacctcctaacaaa aaggcaatagggagtcaatggatcttcaaggtcaaatttaaatcagatggtacaatagagagatataaag ccagattggttccaatgggaaataaacagattgagggagaagattatggagagactttctctcctgtggt aaagatgggaacagtgagattgtttcttgacattgctgttaaaaaagggtggattatacatcaaatggac gtccacaatgccttcttacatggtgatttagaagaagaggtatatatgaagttgcctcctggttttgagt ctgttgataagaataaagtttgccgattgcgcaaatctctgtatggccttaagcaggcacctcgatgctg gtttgcaaagctctcttcagctcttcttgagtatgggtttcagcaattacgcagtgactactcattattc acttatgcacaaggtactactcgtctaaatattctggtttatgttgacgatttagttatcgctggaagta gcctgaaagctacagaatcatttaaagcctatctctcttcatgcttccatatgaaagatctaggagaact taaatattttctgggaatagaggtggctagaaatgcatctggcatatatctttgtcagagaaaatatgca ctagacatcattactgaaacagatcaactaggagcaaagccagcacactttcctttggaagcaaatcaca agctagcactctccgagtctgttctattacatgatccgaaaccatatcgtcgtttgttaggacgtcttat ttacttgggagttacacgtcctgatctcgcgttctcggttcatgttcttgctcaatttatgcagaatcca cgattggatcattggctcgcaactttacgactcgtacggtatttaaaatctgatccaggacaaggtatct tgttgcgagccgatgggaattttcaagtcactggctggtgtaatgcagactgggacaactgcccgataac acgttgctcagtcacaggctatttgtgcagctaggagactctccgatcagttggaaaacgaaaaagcaga aaacggttagtctttcatcagctgaagcagaatatcgggcacttgctaaacttgttcaagagcttatttg gatcaagatgatgcttaaaactcttggggttgttcatacttaaccaatgttagtgcaatgtgatagtaaa tctgcaatctacatcgccacaaatccggtcttctacgaacatacgaaacatatcgagattgatctacatt ttgtcagagacgaagttctaaaacgagagattcagctttgtcatgtggattctttttcacagcttgcaga catcttaacaaaacctattgggaaagatggttttcgttacttcaagtccaagctgggaacactaaatctg tattctccagcttgagggaggg1