;ID ATCOPIA72_I DNA ; ATH ; 4483 BP ;XX ;DE Internal region of ATCOPIA72 copia-like LTR-retrotransposon. ;XX ;AC AC007109 ;XX ;DT 05-NOV-2001 (Rel. 6.2, Created) ;DT 05-NOV-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; ATCOPIA72LTR; ATCOPIA72_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4483) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal region of ATCOPIA72 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(2) p. 28 (2001) ;XX ;CC ATCOPIA72_I is an internal region of the ATCOPIA72 copia-like ;CC endogenous retrovirus flanked by the identical ATCOPIA72LTRs ;CC and a 5-bp target-site duplication (GGTGA). ATCOPIA72_I encodes ;CC the 1471-aa ATCOPIA72p copia-like polyprotein. ;CC ATCOPIA72p: ;CC MVPGIRVTRKSARSKVSTGSVARKSSKSTGVLDSASDSPPMARSQTAGASRGVFSSGFDDPTQSPFFLHS ;CC ADHPGLSIISHRLDETTYGDWSVAMRISLDAKNKLGFVDGSLPRPLESDPNFRLWSRCNSMVKSWLLNSV ;CC SPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDLRQGTMSLSEYYTLLKTLWDQLDSTEAL ;CC DDPCTCGKAVRLYQKAEKAKIMKFLAGLNESYAIVRRQIIAKKALPSLAEVYHILDQDNSQKGFFNVVAP ;CC PAAFQVSEVSHSPITSPEIMYVQSGPNKGRPTCSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQ ;CC AVAAQVTLSPDKMTGQLETLAGNFSPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVAPSGILFS ;CC PSTYCFIGILAVSHNSLSSDTWVIDSGATHHVSHDRKLFQTLDTSIVSFVNLPTGPNVRISGVGTVLINK ;CC DIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRIGNLYVLDTQSPAISV ;CC NAVVDVSVWHKRLGHPSFSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICNSTFELLHI ;CC DVWGPFSVETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNAKELA ;CC FTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSALL ;CC SNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYKLLDLESNVVHIS ;CC RNVEFHEELFPLASSQQSATTASDVFTPMDPLSSGNSITSHLPSPQISPSTQISKRRITKFPAHLQDYHC ;CC YFVNKDDSHPISSSLSYSQISPSHMLYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITS ;CC LPPGKKAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWY ;CC LNQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGF ;CC EKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYFLGLEVARTSE ;CC GISLSQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLEDKEMYRRLVGKLMYLTITRPDITFAV ;CC NKLCQFSSAPRTAHLAAVYKVLQYIKGTVGQGLFYSAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGS ;CC SLISWRSKKQPTVSRSSAEAEYRALALASCEMAWLSTLLLALRVHSGVPILYSDSTAAVYIATNPVFHER ;CC TKHIEIDCHTVREKLDNGQLKLLHVKTKDQVADILTKPLFPYQFAHLLSKMSIQNIFVFS ;XX ;DR Positions 16028 11546 Accession No AC007109 GenBank (rel. 124.0) ;XX ;SQ Sequence 4483 BP; 1153 A; 911 C; 940 G; 1479 T; 0 other; ATCOPIA72_I tggtatcagagccatcgagctcaattttttcttgattttgttcgatttcttcttcttttgagctagtttt cttcgtttcttcacgatccaatggttcctggaattcgtgttactcgaaaatcagctcgctcgaaggtgtc gacgggttctgttgctcggaaatcatcaaagtccaccggtgttctcgattctgcttccgattctcctcca atggctcgttctcagaccgctggagcttcgcgaggtgttttctcatcgggatttgatgatccgacgcagt ctcctttcttccttcatagtgcagatcatccaggtttgagcatcatttctcatcgtttagatgaaacaac ttatggtgactggagtgtggctatgaggatctcgttggatgctaagaacaaactaggatttgtagacgga tctttacctcgtcctttagaatcggatccaaatttccgtttatggtctagatgcaacagtatggtgaaat cctggttgcttaactctgtttctcctcagatctatcgtagcatcttacgtctgaatgatgctacagatat ttggcgtgatctctttgacaggtttaacctgacgaatcttccacgtacctacaatctgacacaggagatt caggatcttcgtcaaggaacaatgtctctatctgagtactatactcttttaaagactctctgggatcagc ttgacagtacagaggctttggatgatccttgtacttgtggaaaagctgttcgtctgtatcagaaggcaga gaaagctaagataatgaaatttcttgcaggattgaatgagtcttatgccattgttcgtagacagatcatt gcaaagaaggcacttcctagtttggcagaagtttatcatatcttggatcaggataatagccagaagggat tctttaatgttgttgctccacctgcagcttttcaagtctctgaggtatctcattctcctatcacttctcc tgagataatgtatgttcagagtggaccaaacaaaggtcgtcctacgtgttcattctgcaacagagttggt catatagctgaaagatgctataagaagcatggttttccaccaggtttcactcctaaagggaagtcctctg ataaacctccaaaacctcaggcagtggcagctcaggttactctttctccggataagatgacaggacaact tgagactcttgctggtaacttctcccctgatcagatacagaatttgattgccttgttcagttctcagttg cagccacagattgtttctcctcagactgcttcttctcagcatgaagcaagttcttctcagtctgttgctc cttctggtatcttattctctccttccacatattgctttattggcatcttggcagtttcacataactcttt gtccagtgacacttgggttattgactctggggctacacatcatgtgtcccatgacagaaaattgtttcag actttagatacttctattgtgagttttgtgaatcttccaacaggtccaaatgtcagaatcagtggagtgg gaacagttttgataaacaaagacattattctccagaatgttttgtttattcctgaattcagattgaattt gatcagtatcagctctttgactactgaccttggtactagagtgatctttgatccttcttgctgtcaaata caggatcttaccaaggggttgacgcttggagaaggtaaaaggattgggaatctctatgtgttggacacac aatctcctgctatctcggtgaatgcagttgtggatgtgagcgtgtggcacaagagacttggacacccatc tttttcaagactggattctctttctgaagttttgggaactactagacataagaataagaaatcagcttat tgtcatgtttgtcatttagccaaacaaaagaagttgtcatttccttctgcgaacaacatttgtaattcaa catttgagctgttacacattgatgtttggggacccttttcagtggagacagttgaaggatacaaatattt cttaactatagttgatgatcattctagagcaacgtggatttatttgcttaagtctaagagtgacgtcctc acagtgtttcctgccttcattgacttagttgagaatcagtatgatacaagagttaaatctgtgagatctg ataatgctaaagagttggctttcacagaattttacaaagcaaagggaatcgtttcttttcattcttgtcc tgagacaccagaacaaaattcagtggttgagaggaagcatcagcatattcttaatgtggctcgggctttg atgtttcagtctaacatgtctttgccatattggggtgactgtgttttaactgctgtcttcttgattaaca ggacaccttctgctttgttatcaaacaagactccttttgaggttctcactggaaagctaccagattactc tcagctcaagacatttggttgcctttgctacagctctacttcatcgaaacagcgacacaagttccttcca aggtcaagagcgtgtgttttcttgggctatccgtttggttttaaaggctacaagttgttggatttagaga gcaacgtggttcatatatcgaggaatgtggagtttcatgaggagttgtttccattagcgagttctcaaca gtctgctactacagcttcagatgttttcacaccaatggatcctttgtcctcaggtaattccatcacttct catcttccatcaccacaaatttctccatcaacacaaatttctaaacgtaggattactaaattccctgctc atctccaagactatcactgttattttgtcaataaagatgactcacatcctatttcatcttctctttctta ctctcaaatctcaccatctcatatgttatacatcaataacatttccaaaattccaatccctcaatcttat catgaggcaaaggattccaaagaatggtgtggtgctattgatcaggaaattggtgcaatggaaaggactg atacttgggagattacaagtttacctcctgggaagaaggcagttggatgtaagtgggtatttacagtgaa gtttcacgcagatggcagtttggaaagattcaaggccagaattgttgctaagggttatactcagaaggaa ggtttggattacactgagactttctctcctgttgctaagatggccacagtaaagttacttttgaaagttt cagcttctaagaagtggtatttgaatcagctggatatatctaatgcttttctcaatggagatttagagga aaccatatatatgaagctgcctgatggttatgcagatattaagggaacttctctgccacctaatgttgtt tgtcgtttgaagaagtccatttatggtcttaaacaggcatctcgtcaatggtttttgaagttttctaact ctctgttggctctgggtttcgaaaaacagcatggtgatcatactctctttgttcgctgtattggttctga gttcattgtcctcttagtttatgttgatgacatagtgattgcgagtactacagaacaagcagcacagtcg ttgacagaggctttaaaagctagctttaagctgagggaacttggtccactgaagtatttcttgggtttag aggttgctcgcacttctgaaggtatttccctatctcaaaggaagtatgctttagaattgctcacttctgc agatatgttggactgtaaaccatcctccatacctatgactccgaatattagattatctaagaatgatggt ctactcttggaggacaaagaaatgtatcgaagacttgttggcaagttgatgtatctgaccataactcgcc ctgatatcacatttgcggtgaacaagttatgtcagttctcttctgctcctcgtactgcacatcttgcagc tgtctacaaagtcttacaatacattaaaggtacagtgggtcaaggtctgttttattctgctgaggatgat ctgactttaaaaggctatactgatgcggattggggtacttgcccagatagtcgtcgatcaaccacaggtt tcactatgtttgttggttcctctctgatatcctggcgctccaagaaacagcctactgtctcacggtcgtc tgcagaggcagagtatcgagcattggctttggcttcttgtgaaatggcgtggctgtctacactgttattg gctttgcgtgttcattcaggtgtgcctattttatactctgacagtaccgccgctgtgtatatagccacta acccagtgtttcacgaacgaacgaaacacatcgaaatcgattgtcacaccgttcgtgagaagctggataa tggtcagttgaagctgcttcatgtcaagactaaagatcaggttgctgatatccttactaaaccactcttc ccttatcaatttgctcatttattgtccaagatgagtatccaaaacatctttgtattctcatcttgagggg gac1