;ID ATCOPIA82_I DNA ; ATH ; 4279 BP ;XX ;DE Internal region of the ATCOPIA82 copia-like LTR-retrotransposon. ;XX ;AC AP001298 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; the ATCOPIA82 ;KW family; ATCOPIA82LTR; ATCOPIA82_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4279) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal portion of the ATCOPIA82 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(3) p. 8 (2001) ;XX ;CC ATCOPIA82_I is an internal region of the ATCOPIA82 copia-like ;CC endogenous retrovirus flanked by the 98% identical ATCOPIA82LTR ;CC long terminal repeats, and a 5-bp target-site duplication (TTGGT). ;CC ATCOPIA82 forms a separate family of copia-like retroviruses ;CC present in the A. thaliana genome. ;CC ATCOPIA82_I encodes (positions 14-4279) well preserved remnants ;CC of the ~1422-aa ATCOPIA82p copia-like polyprotein. A false stop ;CC codons at position 1756 corresponds to H (based on polyproteins ;CC most close to ATCOPIA82p). ;CC ATCOPIA82p: ;CC MDSQIQLYSHPVIHISNYVTVQLTERNYLLWKTQFESFLSGQNLLGFVNGAIKPPPAVNTLTQINGLTTE ;CC VQNPDYQAWQRSDQVVRAWLLGSLSEDILREVVHTITAQEVWTALAQHFNKVSSSRLFKLQRKLQTIEKL ;CC DKSMEDYVREIKRICEQLASIGNPVSQKMKIFAALHDLGRDYEPIKTSIEGSMDLHPPPTFESVIPRLTG ;CC FADRMAGYNAGNEVSPHLAFNITTTNGSHYYSSQGRGNGKPGNNNKGRLNFTTKGRGFHQQISSGSSGGD ;CC RIICQICGKPGHPALKCWHRFNNSYQHEELPSALAALRITDVTETAGHDWFTDSAATAHVTNSTNRLQQS ;CC QPYSGSDAVMVGNGEFLPITHTGSTSLQSTSGNLPLTDVLVCPDINKSLISVSKLTSDYPCCVEFDCDTV ;CC RITDKATKRLLTMGHHNKGLYMLKNHSPLEVYYSSRQQAASDAVWHRRLGHPNAQILQHLSTTKAISVNK ;CC NTKMVCEACQLGKSLKLPFSASSFVASRPLQRIHCDLWGPSPIMSVQGFRYYAVLIDNYSRFSWFYPLKL ;CC KSDFALIFPVFQAMVENQFQ*KIGTFQCDGGGEFISKDFIAHLQKHGIQQLMSCPHTPQQNGLAERKHRH ;CC IIELGLSMIFQSSMPQKYWVEAFYTANFLINLLPSSVLEKKCSPYEVLMGKPPNYTSLRVFGCACYPTLR ;CC DYATTKFDPRSLKCVFLGYNDKYKGYRCLLPTTGRVYISRHVIFDESLFPFSSMAYIHLQPANVTPLMSA ;CC WLKGCSVQEQNQTGTSTENQDHNDQNSVPSGRVLLREEESTGCTAGFDHVPIGNSSSSSTQQITTSEDSP ;CC IIQPLPSTTEQSTQNQSSQSASSNSESSQVQTAQPSQSTHPMTTRLKDGIRKPNPRYGLHTQRVSYPEPK ;CC TVTAALKDEGWTDAMHEEMDNCSEAKTWSLVPYTPNMHVLGSKWVFRTKLNADGSLDKLKARLVAKGFNQ ;CC EEGIDYLETYSLVVRTPTVRSVLHLATIMQWDIKQMDVKNAFLHGDLTETVYMMQPAGFVEKSKPDHVCL ;CC LHKSIYGLKQFPRAWFDKFSTYLIEFGFECSKPDPSLFVYIKNKSIILLLLYVDDMIITGNSSDAMSKLL ;CC DSLNTEFRMKDMGRLHYFLGIQVQFHSEGMFLSQQKYAEDLLAVAVMSDCAPMPTHLPLQLTAIPAQDEI ;CC FDNPTYFRSLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTVSDFNLLKRILRYIKGTITMGISFNKNTYCR ;CC LRAYCDSDYGNCIDSRRSIGGYCTFLGTNIISWSSQKQDSVSKSSTEAEYRTLSDTASEVTWLGSVLKEL ;CC GIPLLDTPEIYCDNLSSVYLSANPAFHKRSKHFQLHYHYVRERVALGALIVKHIPGHQQIADIFTKSLPI ;CC KPFCDLRYKLGVDVPPTPSLRG ;XX ;DR Positions 60791 56513 Accession No AP001298 GenBank (rel. 124.0) ;XX ;SQ Sequence 4279 BP; 1277 A; 940 C; 852 G; 1210 T; 0 other; ATCOPIA82_I tggtatcagagccatggattctcaaatccagctctactcgcatcctgtcattcacatttccaactacgtt acggttcaacttactgagagaaactatcttctctggaagactcagtttgagtcctttctctctggacaaa accttctagggtttgtcaatggtgctatcaagcctcctccagctgtcaacactctcacacagatcaatgg tctcaccacagaagtccaaaatcctgactatcaagcatggcagagatctgatcaagttgttcgagcatgg cttctgggatctctatcagaagatattctcagagaggttgttcataccatcacagcacaagaggtttgga cagctttagctcagcacttcaataaggtatcttcatcccgtctctttaagctgcaaagaaaactgcaaac catagaaaagttagataaatctatggaagactatgttagagagatcaagagaatctgtgaacaacttgca tctattggtaatccggttagtcagaaaatgaagatttttgctgcattacatgacctaggaagagattatg aaccaatcaagacttctatagaaggatctatggacttacatcctccacctactttcgagtctgtgattcc taggttgactggttttgctgatagaatggctggttacaatgctggaaatgaggtgtctccccacttggca ttcaacattaccacaactaatggttcccattactatagtagccaaggtcgtggaaatggaaaacctggaa acaacaacaaagggagattaaattttacaacaaaaggaagaggctttcaccagcaaatctcatcaggttc ttcaggaggtgacagaataatatgtcagatatgtggcaaacctggacatcctgctctgaaatgctggcac cgcttcaacaacagctaccaacacgaggaactgccaagtgctctagctgcgttaaggattacagatgtta cagagactgctggtcatgattggtttactgattcggctgcaacagctcatgtcacaaactcaactaatag gcttcagcagtctcagccttactcaggatctgatgcagtaatggttggtaatggtgagtttctccccata actcacactggatcaactagtcttcagtcaacctcaggtaatcttcctttaactgatgttctagtttgtc ctgatattaataaatccttgatatctgtttctaagctcacatcagactatccctgttgtgttgaatttga ctgtgacactgtgcgtattactgataaggcaacaaagaggttgttaacaatggggcatcacaataagggg ttgtacatgttgaagaatcactcacctcttgaagtctactattcctcaagacagcaagctgcaagtgatg ctgtttggcatagaagactcggtcatcctaatgctcagattcttcagcacctgtcaacaactaaagctat ttcagtcaacaaaaacaccaagatggtatgtgaagcctgtcagcttgggaagagtcttaagttacctttt tctgcttcttcgtttgtagcctctagacctttgcaaagaatacattgtgatctttggggtccttcaccaa taatgtcagtacaaggttttcgatactatgctgttctcattgacaattactcacgcttcagctggtttta tcccctcaagttgaaatcagactttgctttgatatttcctgtgtttcaagcaatggttgagaatcagttt caatagaaaattggaacctttcaatgtgatggtgggggtgagtttataagcaaagacttcatagctcatc tacaaaaacatggcattcaacaactgatgtcatgtccacacacacctcaacaaaacggtttagcagaaag aaaacacagacacattattgaattaggactctcaatgatattccaaagtagtatgccacagaagtattgg gttgaggcgttctacactgcaaattttcttatcaatctactaccaagttcagtattggagaagaaatgca gtccctatgaagttctaatgggaaagcctccaaattacacatcactacgtgtctttgggtgtgcctgcta tcctaccctaagagattatgcaaccacaaagttcgaccctagatcactcaagtgtgtgttcctaggatac aatgacaagtataagggctacagatgcttacttccaaccacagggcgtgtgtatatcagccgtcatgtca tctttgatgagtcactctttcctttttcatctatggcttacattcacttgcaacctgcaaatgttacacc attgatgtctgcttggttaaagggttgctcagtacaagaacagaatcagactggtacttcaacagaaaat caagatcacaatgatcaaaattcagtaccttctggtagggttttgttgagagaagaggagagtactgggt gtacggcaggctttgatcatgttcctataggcaacagctcttcttcttctactcagcaaatcactacctc agaagactctcctatcattcaaccgctgccatcaactacagaacagtctactcagaatcaatcttctcag agtgcatcttccaacagtgagtcatctcaggtccaaacagctcagccatctcagtctacccaccctatga caacaagattaaaagatggaatcagaaaaccaaatccgaggtatggtttacacactcaaagagtatctta cccggaacctaaaacagtgacagctgctttaaaagatgaaggctggactgatgcaatgcacgaagaaatg gacaattgctctgaagctaaaacctggtccttggtgccctatacaccaaatatgcatgtattaggcagca agtgggttttcagaactaaactcaatgctgatgggtcccttgacaagctgaaagccagactggttgcaaa gggatttaatcaagaagaaggcattgattacctggaaacttatagcctagtggtgagaactccgacagtt agatcagttcttcacctagcaaccatcatgcagtgggacataaagcagatggatgtgaagaatgccttcc tccatggtgatctcacagagacagtgtacatgatgcagcctgcaggttttgtagagaaatcaaagccaga tcatgtctgtcttttgcataagtctatctacggattaaaacaattcccacgtgcttggtttgataaattt agcacttatctgattgagtttggctttgaatgtagcaagccggatccatccttgtttgtatacatcaaga acaagagtatcatcctgctgctgctttacgttgatgacatgataataacaggaaacagttcagatgcgat gtcaaagctattagacagcttgaatactgaattcagaatgaaagatatgggaagactacattactttttg gggattcaagtgcagtttcattcagaaggaatgtttctctcccaacagaagtatgcagaagatcttcttg ctgtggctgtgatgagtgattgtgctccaatgcctactcatctgccacttcagttgactgctatacctgc acaagatgaaatctttgacaatccaacatatttcagaagtcttgcaggtaaacttcaatatcttacctta acaagacctgatatacagtttgctgttaactttgtttgccagaagatgcatgctccaacagtgtctgact tcaatctgctcaaacgaattctgaggtacatcaagggaactataacaatggggatctcctttaataagaa cacatattgcagattgagagcttactgtgatagtgattatggaaattgcatagactcacgaagatccatt ggaggttactgtaccttccttggtacaaatatcatctcatggtcctcacaaaagcaagattcagtctcca aaagctcaaccgaagctgaatatcgaactctgtctgacactgcctcggaggtaacctggttaggatctgt tctcaaggaattgggcattcctctacttgacactccagaaatctattgcgacaatctctcctccgtgtat ctctctgcgaatccagctttccacaaaagaagtaaacactttcagctacactatcattatgttcgagaga gagtagcacttggagctctgattgtgaagcacatacctggtcatcaacagatcgcggacatcttcaccaa atctctccccatcaagccgttctgcgatcttcgttacaaactaggcgtcgatgttccacctacgccgagt ttgcggggg1