;ID ATCOPIA76_I DNA ; ATH ; 4272 BP ;XX ;DE Internal region of ATCOPIA76 copia-like LTR-retrotransposon. ;XX ;AC AC006841 ;XX ;DT 05-NOV-2001 (Rel. 6.2, Created) ;DT 05-NOV-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; ATCOPIA76LTR; ATCOPIA76_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4272) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal region of ATCOPIA76 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(2) p. 34 (2001) ;XX ;CC ATCOPIA76_I is an internal region of the ATCOPIA76 copia-like ;CC endogenous retrovirus flanked by the 99% identical ATCOPIA76LTR ;CC long terminal repeats, and a 5-bp target-site duplication (GGTTT). ;CC ATCOPIA76_I encodes the 1361-aa ATCOPIA76p copia-like polyprotein ;CC A corresponding CDS is interrupted by a copy of ATGP2NLTR, which ;CC is flanked by the GACAT target site duplication. The ATGP2NLTR and ;CC one the GACAT repeat were removed from the ATCOPIA76_I nucleotide ;CC sequence. ;CC ATCOPIA76p: ;CC MSAARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRRELLEEK ;CC RRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFKMSENLSIEGNIDEF ;CC LRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVTLSLDEVVAAIYSKELELGSNKKSI ;CC KGQAEGLFVKEKTETRGRTEQRGNNNNNKKSRSKSRSKKGCWICGEEGHFKSSCPNKNKTQPQTKTTNNN ;CC NKGESSNGSSNYSEANGLYVSEALSSTDIHLEDEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTV ;CC SKVRGIGTIRVKNEAGMVVRLTNVRYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCY ;CC TLYLLQWRPVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRI ;CC GFNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFDKFVEWANLVE ;CC NQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAERMNRTLMEKVRSMLSDSGLPK ;CC KFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSYLRRFGCIAFVHTDDGKLNPRAKKGILVGYP ;CC IGVKGYKIWLLEEKKCVVSRNVIFQENASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPI ;CC VEAQSPFNPSPATTQTYSEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEI ;CC EPADYSEAKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFKARL ;CC VAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEKIYMVPPEGYEEMF ;CC KEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIKELSDGSRVYLLLYVDDMLVAAKN ;CC KEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREENTLWLSQNGYLNKILETYNMAESKHVVTPLGAH ;CC LKMRAATVEKQEQDEDYMKSIPYSSAVGSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWVLRYI ;CC KGSLGTKLQYKRSSDFKVVGYCDADHAACKDRRRSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSL ;CC TEAVKEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGDGDVVKI ;CC DTEKNPADIFTKIVPVNKFQAALTLLQVKPE ;XX ;DR Positions 95983 89644 Accession No AC006841 GenBank (rel. 124.0) ;XX ;SQ Sequence 4272 BP; 1402 A; 707 C; 1069 G; 1094 T; 0 other; ATCOPIA76_I gagtggtatcagagccataggttcattgttctgtggatctctaaagattcaaagatcctgaactatgtct gcagcaagaatagaggttgaaaagtttgatggtcgtggggattacacgatgtggaaagagaagctcatgg cgcatctagacattctaggcctgagtgtggctcttaaggaagaagacgatttggtggagaaagttgcaga gatgcagcttactgaagaagaagagaaggaagaggtactgagacgagagctcttggaagaaaaaaggaga aaggcgagaagcgccattgtgctgagtgttactgatagagtcttgaggaagattaagaaggagcagtctg cagctgctatgctaggcgtgcttgacaaactgtacatgtctaaggcgctaccaaacagaatctaccaaaa acagaagctttacagcttcaagatgtcagaaaacctcagcatagaaggtaatatagatgagttcttgcgt attatagctgatttagagaacacgaacgtgttagtttctgatgaagaccaagctatattactactcatgt cattacctaagccgtttgatcaacttagagatactctgaagtatgggttaggaagagtcactttatcgtt agatgaggttgtagcagctatctactctaaggaactagagttagggtctaacaagaagagtattaagggt caggctgaaggtctctttgtcaaggaaaagactgagacaagaggaaggactgaacagcgaggcaacaaca acaacaacaagaagtctagatccaaatctagatctaagaagggttgttggatttgcggtgaagagggaca cttcaagagttcatgtcctaacaagaacaaaacacaacctcagacaaagacaactaacaacaacaacaaa ggtgaatcatcaaacggtagcagtaactactctgaagctaatgggctttatgtttctgaagctttgtctt ctacagacatacatcttgaggacgaatgggtcatggacactggctgtagttaccacatgacatacaagcg tgaatggtttgaagatttgaatgaggatgctggtgggtctgtgaggatgggaaacaagactgtttcaaag gtcagaggaataggcacaatccgggttaagaatgaagcaggaatggtggttcgtcttacaaatgtgagat acattccagaaatggataggaatcttttatctctgggaacatttgagaagtctggctacagtttcaagtt agaaaatggaacactgagcatcattgcgggagacagtgttctactcacagtaagaaggtgttatacactc tacctgttacagtggagaccagtaacagaagagtctctctctgtggtgaagagacaagatgacacaatct tgtggcatcgaagattgggacacatgagtcaaaagaacatggatttattgctgaaaaaaggtcttttgga caagaaaaaagtgtccaagctggagacatgtgaggactgcatatacgggaaagctaagaggattggattc aacttagctcaacatgatacaagagagaagctggagtatgtgcattcggatttatggggagctccatcag tgccattctctctaggtaaatgtcaatacttcatatcgtttattgatgattacactagaaaagttaggat ttatttcctgaagactaaggatgaggcatttgataaatttgttgaatgggctaaccttgttgagaaccaa actgacaagaggataaaaactcttagaacagacaatggtcttgagttttgtaacaggtcatttgatgagt tctgctcacagaaagggattctatggcatagaacgtgtgcatacacgccacaacagaatggtgttgcaga gcgcatgaacaggaccttaatggagaaagtcaggagtatgcttagtgattctggtcttccaaagaagttt tgggcagaggctactcatacgacagctatactcatcaacaaaaccccatcatcagctctaaactatgaag taccagacaagagatggtcagggaagtcaccaatctacagctacttaagaagattcgggtgcattgcgtt tgttcacactgacgatggaaagctcaatccgagagctaagaaaggaatactagtaggatatcctattggt gttaagggttacaagatttggttgttagaagagaagaagtgtgtggtgagcagaaatgtgatttttcaag aaaatgcatcttacaaggacatgatgcagagtaaagatgctgagaaagatgaaaacgaggcaccaccaag ctcttatttggatttggatcttgatcatgaagaagttatcacctcaggtggagatgatccgattgtcgaa gctcagtctccattcaatccaagtccggcaaccactcaaacctacagtgaaggagttaactcagaaactg atataattcagtcaccactgagttatcaactggtgagagatcgagatagaagaacaatcagagctccagt gaggtttgatgatgaagactatctcgctgaagctctctatactacagaagacagtggagaaatagaacct gcagattacagtgaagctaaaagaagcatgaattggaataagtggaaacttgctatgaatgaagaaatgg agtcacagatcaaaaaccatacttggacagtggtcaaaagacctcaacatcagaaggttattggtagtag gtggatctacaagtttaaacttgggattcctggagttgaagaaggtagattcaaggcaaggcttgttgcc aaagggtatgctcaacgtaaaggaatcgattaccatgagatctttgctcctgttgtgaaacatgtctcca ttagaatactgatgtctattgttgctcaagaagacttggagctggaacaacttgatgtaaagacagcatt tctacacggtgagctgaaagagaagatttacatggtacctcctgaaggttatgaagaaatgtttaaagaa gatgaagtttgtcttcttaataagtcactgtatggactcaagcaagctccaaagcaatggaatgagaagt ttaatgcttacatgtctgagattggctttataaggagtttgtatgacagttgcgcatacattaaggaatt gagtgatggttcaagggtttatctgcttctgtatgtggacgatatgctagtggcagctaagaacaaagaa gatatatctcagcttaaagaagaactcagtcagagattcgacatgaaggatttgggggctgctaaaagaa tcctcggtatggagattatcagaaatagagaagagaacactctgtggctgtcacagaatggctatctgaa taaaattcttgagacttacaacatggcagagtcaaaacatgtggtgacaccacttggagctcacttgaag atgcgagcagccacagttgagaagcaagagcaagacgaggactacatgaagtcaattccctactcaagtg cagtaggaagtatcatgtacgcaatgataggtactcgccctgatctagcttatcctgttggaatcattag tcgctacatgagtcaaccggctagagaacactggcttggagtcaaatgggtcttgaggtacatcaaaggc tcactgggaactaagttgcaatacaagagaagcagtgactttaaggttgtgggatactgcgatgctgacc acgctgcatgtaaagatcggagaagatcaattacagggcttgtgtttactcttggaggaagcactatcag ttggaaatcaggtcaacagagagttgtagctctctcaactacagaggcagagtacatgtctctaactgaa gctgtgaaagaagcagtgtggatgaagggtttgttgaaggaatttggttatgaacaaaagagcgtggaga tcttttgtgattctcaaagtgctattgcactctccaagaataatgttcatcatgaaagaacgaagcatat agatgttcgatatcaatatattcgggacataattgctaatggtgatggtgatgtggtgaagattgacact gaaaagaatccagctgatatcttcaccaagatcgtgcctgtaaacaagtttcaggcggctttgaccttgt tacaggtcaagcctgagtagtaaaactcaggaggaatccgagtatggaatcctacaactaggttctcggg gtactctcttatctctctcattgacagtttgtgcaagtttttggttttatcatcaagtttcaggtggaga tt1