;ID ATCOPIA87_I DNA ; ATH ; 4087 BP ;XX ;DE Internal region of the ATCOPIA87 copia-like LTR-retrotransposon. ;XX ;AC AB028606 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; the ATCOPIA87 ;KW family; ATCOPIA87LTR; ATCOPIA87_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4087) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal portion of the ATCOPIA87 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(3) p. 18 (2001) ;XX ;CC ATCOPIA87_I is an internal region of the ATCOPIA87 copia-like ;CC endogenous retrovirus flanked by the 99% identical ATCOPIA87LTR ;CC long terminal repeats, and a 5-bp target-site duplication (TAGAC). ;CC ATCOPIA87 forms a young family of copia-like endogenous retroviruses ;CC present today in the A. thaliana genome. ;CC ATCOPIA87_I encodes well preserved remnants of the 1338-aa ;CC ATCOPIA87p copia-like polyprotein. ;CC ATCOPIA87p: ;CC MVSELQVKEETSQEPRFERFDGRGDYTLWKRKLLAQLEVMGISDALKEKEEKKEAVETERVKVVSSSSER ;CC RREEHKKDHSREEKENKARSVIILSVADNILRRIRTEETAAGMISVLDKLYLSDPLSSRISLKRKLFEFK ;CC MSENKAVEENIEDFFRIVEDLEKLDVYVSDEDKAFMLLLSLPRKLEQLKYSLDYCEEPLTLGRVMTAIYK ;CC KELEVAQIERQTEEEEKRLSLRERERSDYREEQAKGKEKVRSEAREKKGPCWRCGQKGHVKTECFQEKKN ;CC KSRKKSVRYEESSAQSIVSGGSVFMVSEAAARASKGSSEEWICDTGCTSHMSSRKEWFEDLVFSESGNVS ;CC MANDTTLQVKGIGSVRILNDDGTTVLLTNVMYIPGMSKNLISLGTLENKGCWFKSKNGILKVIKGCITLM ;CC KAEKVGTLYMLKGKAVTARRRAVQGPKEETKMEHIKPAHMSQTSLEIPVKKGCIRKQKIGEVKVCEDRAQ ;CC KEVKRIKFDSEKIVTEMKPKIVHSKIWSFASTPRKIKEGLRIKGITAKETGKSVCDTILLKSCTGGVDIA ;CC VNLKKSLFSSSFEVSRDKETSSASNCRKLVISQDEKFQRDIICECCQEHECERRKCVSEKGIFSLHDKNE ;CC LVYNESVIQGGASSSENSQQHSSLNSHQVMATFMRSRKEEMAKDGTYHHSTSHQVSMKRSGVTVYAPSSQ ;CC RKPSPRERNIISVGCYDGEIFFKILLPEDKEDCFIRKNAEFLEGYLIKVLNLKFQDLEDQRRKSVRVSLK ;CC SLQDYETSSFQVQGGAYSAKGSFISEGQESLNTSLQGRDAASETSEEETDLQGYLLTRINVKREVQPSKK ;CC DEETKLVGSVQFIIEDAGKTEPTSFEEEKRDPSWSKVEEIQFLHNRDTWDLADKLMNQEASICKGIYKKK ;CC PEFSGIGKPSFKVNLRVKGHSKEKVVLKLMEMKQKTDFLHENSEKNIRMLQPAVYEEKSRKEKAFLLENF ;CC LYGLKNLPRSQNQRNDTLLIQDGYMRSKYVLWVYFKVSKMEIHQNRGFESPRLSQEAYVIKVIRLFKEDQ ;CC CEFVFTSSETLFEFQALEEEKQGDQADYMKLISCLSDVDSVVVAIIGTRVNIAWFRGVVFRSKKEHWKSM ;CC LRRMRGIKGTTESVQVHRKQEDFELTRVNDSKVAVGADQKRVLTCRSFTTAGSSISCKYMFQEVDFVSTK ;CC KGERNRWTGAYKRVVWVRDFKDLIESSLVFKKQEESMLRGCYSSKFAAGSVLKVNFQQCVFSFAGNTLRR ;CC ESGLLKVGVSSTTEAEKRVQEMDIQETVLSRELLDMDTGGLKKTTKLGMLSSAAGVVTKGLTRSMFQDGG ;CC PMLRFTKN ;XX ;DR Positions 25401 29487 Accession No AB028606 GenBank (rel. 124.0) ;XX ;SQ Sequence 4087 BP; 1479 A; 582 C; 1018 G; 1008 T; 0 other; ATCOPIA87_I aaaatggtatcagagctccaggttaaggaggaaacctctcaagaacctagattcgaaagattcgacggga gaggagattacacactgtggaaaagaaagcttcttgctcagcttgaagttatgggtatctcagatgctct aaaggagaaagaagaaaagaaggaagctgttgagacagaaagggtgaaggtcgtgtcttcaagctcagaa agaagaagagaagaacacaagaaagaccattcacgagaagagaaggagaacaaggccagatctgttatca tacttagtgttgcagacaatatcctaagaagaatcagaacagaggagactgctgcaggtatgataagtgt tctagataaactctacttgtctgatccactatcaagtcgtatatctcttaaaaggaagctctttgagttt aagatgagtgaaaacaaggctgtagaagagaatatagaagatttcttcaggatagttgaagatctagaaa agttagatgtttatgtgtctgatgaagacaaagcattcatgttgcttttgtctctccctagaaagcttga acagcttaagtactctttagattattgcgaggaacccttaactttgggtagagtaatgactgcaatatac aagaaagagcttgaggttgctcaaatagaaagacaaacagaggaagaagagaagcgtttgtctttaaggg aaagagaaaggtcagattacagagaagaacaagctaagggaaaagagaaggtcagatctgaggcaagaga aaagaaaggtccatgttggagatgtggacaaaagggacatgtcaagacagagtgcttccaagagaaaaag aacaagtcaaggaagaagtcagttagatatgaagaatcatcagcgcaaagtattgtttctggtgggtctg tgttcatggtttcagaagctgctgctcgagcaagtaaaggaagctctgaagaatggatttgtgacacagg ttgtacttctcatatgagctcaagaaaagaatggtttgaggacttggtattttctgaatctggaaatgtg tcaatggcaaatgacactactttacaggttaaagggattggaagtgtgaggatcttgaatgacgatggaa caacagtcttgttgactaacgtcatgtatattccgggcatgtcaaagaatctcatatccttagggacact tgagaacaagggatgctggtttaaatccaagaatgggattttaaaggtcataaagggatgcataacattg atgaaggctgaaaaagttggtacactttacatgctaaagggtaaagcagtaacagcgagacgaagagctg tacaaggaccaaaagaggagactaagatggagcatatcaagcctgctcacatgagtcaaacgagtctcga gattccggtcaagaaagggtgtatcagaaaacagaaaatcggtgaagtgaaagtctgtgaagacagggca cagaaagaagtaaaaagaatcaagtttgattctgaaaagattgtcactgaaatgaagcctaagattgtgc attcaaagatatggagtttcgcatctactccgagaaaaatcaaagaaggattgagaataaaaggcatcac tgcaaaagaaacagggaaatcggtttgtgacactattctgttaaagtcatgtacaggaggtgttgatatt gcagtgaatctcaagaaaagtttattctcatcaagttttgaagtctcaagagataaagagacaagttctg catcaaactgcaggaaattggtcataagtcaagatgagaaatttcagagagacatcatttgtgaatgttg tcaagaacatgaatgtgaaaggcgaaagtgtgtctcagaaaagggaattttcagtctacatgacaagaat gagttagtctacaacgaatctgttattcaaggtggagcaagctcaagtgaaaattcacagcaacattcaa gtttgaattctcatcaagtcatggcaacgttcatgagatcaagaaaagaagaaatggcaaaagatgggac atatcatcactcaacaagtcatcaagtgagtatgaaacgttctggagttacagtatatgcaccatcaagc caaaggaagccaagtccaagagaaagaaatattatttctgtcggatgttatgatggagaaatctttttca agatcttgttgcctgaagataaagaggactgcttcataaggaaaaatgcagagtttctagaaggttattt gatcaaagtgttgaatcttaagtttcaggatttagaggatcaaagaagaaaatctgtgagagtaagtctg aaatctctacaagattatgaaaccagtagctttcaagttcaaggtggagcttattcagcaaaaggcagtt tcataagtgaaggacaagagtctttaaatacgtcgttgcaaggtagagatgcagcatcagaaaccagtga agaagaaacggatctgcagggttatctattgacaagaattaatgtcaaaagagaagttcagccaagtaag aaagatgaagaaacaaagttagttggatcagttcagtttatcattgaagatgcaggaaaaacagaaccaa ctagctttgaggaggaaaagagagatccaagctggtcaaaagttgaagaaatccagtttttgcataatag ggatacatgggatctggctgataaactcatgaatcaagaggcaagtatatgcaaagggatctacaagaag aaaccagaattttctggaattggaaaaccaagttttaaagtcaacttgagggttaaaggtcactcaaaag aaaaagtggttctgaagctaatggagatgaagcaaaagacagatttcttacatgaaaattctgagaagaa cattcggatgcttcaacctgcagtttatgaagaaaaatcaagaaaagagaaagcatttctcttggagaat tttctgtatggtttgaagaatttaccaagatcacagaatcagagaaatgacacactcttgattcaagatg gttatatgagaagcaagtacgttttgtgggtatacttcaaggtgtcaaagatggagatacatcaaaacag aggatttgaaagtccaaggctgtctcaggaagcttatgtgataaaggtaataaggttgttcaaggaagat caatgtgagttcgtttttacttcatcggaaactctatttgagttccaagcactcgaagaagaaaaacaag gagatcaagcggattatatgaagttaatatcatgcttaagtgatgttgatagtgtggtagtcgccatcat tggcacaagagttaatattgcttggtttagaggagtggtgtttcgttctaagaaagaacattggaagtca atgttacggaggatgcgaggcattaaaggaaccacggaatcagttcaagttcaccggaaacaagaagatt ttgagcttacaagagttaatgactcaaaagttgcagttggtgctgatcaaaaacgagttttgacgtgtcg gagtttcacgactgcaggcagctcaataagctgcaaatatatgtttcaagaagtagattttgtgtctaca aagaagggtgaacgcaacagatggactggggcttataagcgagttgtttgggtgagagactttaaggatt tgatagaatcaagtctggttttcaagaaacaagaagagtctatgttaagaggatgctacagctcaaagtt tgcagcaggatcagttttaaaggtaaactttcaacaatgtgtgttcagttttgcaggaaacacattaagg agagaatctggtctgctgaaagtgggtgtttcatctacaacagaggctgaaaagagggtccaggaaatgg acatacaagagacagttttgtcaagagaattgcttgacatggacacaggagggctgaagaaaactaccaa gttgggtatgttgtctagtgcagccggtgttgtcaccaaagggttaacaaggagcatgttccaagatggc gggccgatgctcaggtttaccaagaactgaccaaggtcagggtccggggaaggaatccaagaactagaat catcaactcaaagacaaggtggagttt1