;ID ATCOPIA62_I DNA ; ATH ; 4639 BP ;XX ;DE Internal region of ATCOPIA62 copia-like LTR-retrotransposon. ;XX ;AC AL163975 ;XX ;DT 26-OCT-2001 (Rel. 6.2, Created) ;DT 26-OCT-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; ATCOPIA62LTR; ATCOPIA62_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4639) ;RA Jordan,N., Bangert,S., Wiedelmann,R., Voss,H., Unseld,M., Mewes,H.W., ;RA Rudd,S., Lemcke,K., Mayer,K.F.X., Quetier,F. and Salanoubat,M. ;RL Direct submission to GenBank (April 2000) ;XX ;RN [2] (bases 1 to 4639) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (August 23, 2001) ;XX ;CC ATCOPIA62 has been found by [1] and slightly modified by [2]. ;CC ATCOPIA62_I is an internal region of the ATCOPIA62 copia-like ;CC endogenous retrovirus flanked by the 1% divergent ATCOPIA62LTRs ;CC and a 5-bp target-site duplication [2]. ;CC ATCOPIA62_I encodes the 1481-aa ATCOPIA62p copia-like polyprotein. ;CC ATCOPIA62p ;CC MAPGRKISTRRTIRVPVSARRSANRDDSSPEGSPEPRARTRNPVTESHDSIHSPYYLTNSDNPGASITSE ;CC VFDGTNYDDWKISIKIALDAKNKLVFIDGSVPRPPESDPMFRIWSRCNSLVKSWLLNSVSKPIYKSILRF ;CC DDASEIWNDLSTRYHITNLPRSYPLTQQIWSLQQGTMDLTTYYTTLRTLWNELDGSDCVTLCKRCDCCKA ;CC MDKKAEHARVIKFLAGLNESYAVIRSQIIMKKHVPSLAEIYNLLDQDHSQRSFTPVPSNAAAFIVSAPEQ ;CC VQPSVNATFNNAKPQKVICSHCGYTGHTVDRCYKIHGYPLGFKHKNKNQSDKSVSLEKSVSTVKPVVAHM ;CC ALTDSTTNDLINGLTKVLTKDQINGVVAYFNSQMQNSSIASSSGATITALPGIAFSSSTLGFIGVLKATV ;CC NVLSSETWIIDSGATHHVCHDKNLLMRLSETMNSSVTLPTGFGVKITCIGTVKLNEFLVLNNVLYIPDFR ;CC LNLLSVSQLTKDLGYRVTFDEDYCLIQDHVKGLMIGRGEQINNLYVLDVPRIKDFPTKEISFHANIVVDS ;CC SLWHSRLGHPSVTTSDIVTDVLGFKQRNERSFHCTICPLAKQKRLPFVSKNHVCDSAFDLVHIDVWGPFN ;CC VPTPDGFRYFLTIVDDHTRVTWLYLMKNKNEVLTIFPDFLKMIETQYKSQVKGVRSDNAPELKFVKLFKE ;CC KGIIHYFSCPETPEQNSVVERKHQHILNVARSLMFQAQVPVEYWGECVLTAVFLINRLPTPLLHDKSPFE ;CC VLTNKMPDFNGLRVFGCLCYSSTSTKNRDKFQPRAKACVFLGYPPGVKGYRLLDLETNIIYVSRNVVFHE ;CC DIFPFAKSGSTVLPDYFATETSNVDASSTEASTSEAPAVVNDSVTPSNINPVVVSESPTDTNDIVDSTIP ;CC AVSSTDKTSKGRTSKTPAYLQDYYCNLSTNGVEHPISNFLNYDGLADSHRAYICSITKYAEPTSFTQARK ;CC SDDWLKAMNDELKALEGTATWKICSLPPDKHAIGCRWVYKVKLNADGSLERYKARLVAKGYTQQEGVDFV ;CC DTFSPMAKMTIVKTLLVVAAAKKWSLHQLDISNAFLYGDLEEEIYMTLPPGYTTKEGETLPPNAVCKLQK ;CC SLYGLKQASRQWFLKFSTTLMLLGFQRSQADHTLFVRNVNGKYIAVLVYVDDIIIASNDDAEVVELKADL ;CC ERAFKLRDLGTLKYFLGLEIARNASGISVCQRKYALGLLEETGLLACKPSNIPMEPSIKLISDGDEPPME ;CC DPASYRRLVGKMMYLTITRPDITYAVNRLCQFTSAPKESHMKAAHKVLHYVKGTVGTGLFYSADCDMTLQ ;CC AYTDADWASCRDTRRSTSGFCMFLGTSLISWKSKKQQTASHSSAESEYRAMEFAVREVAWLVNLLREFQA ;CC PQLKSVAFFCDSTAAIHIANNAVFHERTKHVELDCHILRDKVMSGLIKTLHLKTDQQVADVFTKPLFPTQ ;CC FKALVGKMALQ ;XX ;DR Positions 63260 58622 Accession No AL163975 GenBank (rel. 124.0) ;XX ;SQ Sequence 4639 BP; 1313 A; 859 C; 945 G; 1522 T; 0 other; ATCOPIA62_I tggtatcagagcatgaatggtgaatcattcatacgattcttgacctcgttttcttcttttctttttcatt tcgatcagtttcttcagtctcgtcactgagaaactgttgtatcgaccaaaaatttcaacaaaaatttctt cctttgccgtgaaattggagcttctctaatggcgcctggacgtaaaatctctactcgtcgcacgattcgc gttcctgtttcagctcgtagatcggcaaatcgcgatgattcttcacctgaaggttcacctgaacctcgag ctcgtactcgaaatccggtaactgaatctcatgatagtatacattcaccctattatcttacgaatagtga taatcctggagcttctattacttctgaagtgtttgatggaacgaattatgatgattggaaaatttcgatc aagattgctttagatgcgaagaataagcttgttttcattgatggatctgttcctcgacctcctgaatcag atcctatgtttcgaatttggtcccgatgtaacagcttggttaagtcttggctcttgaattcggtgtcaaa accaatatataagagtatccttcgtttcgatgatgcctcagagatatggaatgatctttcaactcgttat cacattactaatcttccaagatcttatccgttaactcaacagatttggtcacttcaacaagggactatgg atcttactacttactatacgacattgaggactctctggaatgaattggatggttctgattgtgtgacttt gtgtaaacgttgtgattgttgcaaagctatggataagaaagctgaacatgctcgtgtgataaagtttttg gctggcttgaatgaatcatatgccgtcattagaagccaaatcatcatgaagaaacatgtgccttccttag ctgagatttacaatttgttggatcaagatcacagtcaacgcagcttcacaccggttccttctaatgcagc tgcatttattgtatctgcgccagaacaagttcaaccttctgtgaatgccacattcaacaatgcgaaacca cagaaagtcatatgttctcattgtggttacacaggacatactgttgatcgttgttacaagattcatggat atccacttggttttaaacacaagaataagaaccaatctgataagagtgtttctttggaaaaatcagtttc tacagttaaacctgttgttgctcatatggctttgacagatagtactacaaatgatcttattaatggtctg actaaggttcttaccaaggatcaaattaatggagttgttgcatacttcaattctcaaatgcagaattcct ctattgcttcctcgtctggtgctactattaccgcattacctggtattgctttctcctcctctactcttgg ttttattggtgttttgaaagctactgttaatgttttatcctcggaaacttggataatagacagtggagca actcatcatgtttgtcatgataagaatttgcttatgagattatctgaaactatgaatagttcagttacct tacctactggttttggagttaagatcacatgtataggtacagtgaagctgaatgagttcctcgtcttgaa taatgtgctttacattccggattttcgccttaatcttctgagtgtcagtcagctgactaaagatctggga tatagagtgacatttgatgaggattattgccttatacaggatcatgtcaaggggctgatgattggtagag gtgagcagatcaacaatctatacgtcctggatgttccgagaattaaggattttcctactaaggaaataag ttttcatgcaaacattgttgttgattctagtctttggcatagtagactaggtcatccatctgtaactact tctgatatagttactgatgtacttggatttaaacaaaggaatgaaagatcttttcattgcaccatttgtc ctcttgcaaaacagaagcgtcttccctttgtttccaagaatcatgtttgcgactcagcttttgatttagt tcatatcgacgtctggggtccattcaatgttcctactccagatggttttcgatattttctaaccattgtt gatgatcatacacgggtcacttggttgtatcttatgaagaacaagaatgaagtgttgactatcttcccag attttctgaaaatgatagagactcagtacaagagtcaggtgaaaggtgttagatcagacaatgcaccaga attgaagtttgtgaagttgtttaaagaaaagggcatcattcattatttctcttgtccagaaacaccagaa caaaactcggtggtggaaaggaaacatcaacacatattgaatgttgctcgttctcttatgtttcaagctc aagtgcctgtggaatattggggagagtgtgtgttaactgcagtctttctcatcaatcgattgcctacacc attgcttcatgacaaatctccttttgaagtgcttactaacaaaatgcctgattttaatggtcttcgtgtg tttggctgtctttgttacagttctacatcaaccaaaaatcgagataagtttcaaccaagagctaaggcgt gtgtgtttcttggttatccaccaggtgttaagggttatcgacttttggatttggaaaccaatatcatata cgtctcacgcaatgttgtttttcatgaagacatttttccatttgctaaaagtggatctactgttcttcct gattattttgctactgaaacatctaatgttgatgcatcttctactgaagcatctacttctgaagcacctg cagttgtgaatgattctgtcactccatctaatatcaatcctgtagttgtgagtgaatctcctacggatac taatgatattgttgacagtactattcctgcagtttcttctacggataagacaagtaaaggccgaacaagt aagactcctgcttacctccaagactattattgtaatttgtctactaatggagtggagcacccaatttcaa atttcttgaactatgatggtttagctgattcacaccgagcatatatttgttctataacaaaatatgcaga gcctacttctttcactcaagccaggaaatctgatgattggttaaaggcaatgaatgatgaattgaaggct ctagaaggaacagcaacttggaagatatgttctttaccacctgataaacatgccataggctgcagatggg tttataaagtgaagttaaacgcagatggaagtttagagcgttacaaggcacggttagttgccaagggtta cacacagcaggagggtgttgactttgttgacactttttccccaatggcaaagatgactattgtcaaaaca ttgttggttgttgcagcagcaaagaaatggagtttgcatcagttggatatatcgaatgcctttttatatg gcgaccttgaagaagaaatttatatgactcttcctccgggttacacaactaaagaaggcgagactcttcc acctaatgcagtctgtaagttgcaaaaatctctctatggtttaaaacaagcttcaagacagtggtttctg aagtttagtaccactttaatgctattaggattccaaagatcacaggctgatcacactttgtttgtgagaa atgtgaatgggaaatatatagcagtacttgtgtatgttgatgatattatcatcgcaagtaatgatgatgc agaggttgttgaactaaaagcagacttggaaagagcttttaaactgagagatttgggtactttgaagtat tttttgggcttggagatagctcgtaatgcttcaggtatttcagtttgtcaacgtaagtatgcgttgggat tacttgaagaaacaggtttattggcttgtaagccatctaatattcctatggaaccaagtataaagttgat atcggatggagatgagcctccgatggaagatccagcttcttacagacgcttagtgggtaaaatgatgtat cttaccatcactagacctgacattacatatgcagtgaatagactttgtcagtttacttcagctccaaaag aatcacatatgaaggcagctcacaaggttttacactatgttaaagggactgttggaacaggtctcttcta ttcagctgattgtgatatgacattacaggcatatactgatgcagattgggcttcatgtcgtgatacaaga cgttccacttctggcttctgtatgtttctaggcacatctttgatctcatggaagtcaaagaagcagcaga ctgcatctcattcttcagctgagtctgagtatcgagcaatggaatttgcagttcgtgaggttgcttggct tgttaatcttctcagagagtttcaagcacctcagctaaagtccgttgctttcttctgtgattcaactgca gcaatacatattgcaaataatgcagtatttcacgaaagaaccaaacatgtggaacttgattgccacatcc ttagagacaaggttatgagtggtttgattaagactttgcaccttaaaactgatcaacaggttgcagatgt ttttaccaagcccttatttccgactcaattcaaggctcttgttggcaagatggctctccaatgaatatac ttgccatcttgagggaggc1