;ID ATCOPIA51_I DNA ; ATH ; 4214 BP ;XX ;DE Internal region of ATCOPIA51 copia-like LTR-retrotransposon. ;XX ;AC AL161515 ;XX ;DT 02-SEP-2001 (Rel. 6.2, Created) ;DT 02-SEP-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; ATCOPIA51LTR; ATCOPIA51_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] ;RA Spiegel,L.A., Huang,E.N., Nascimento,L.U., de la Bastide,M., ;RA Vil,D.M., Preston,R.R., Matero,A., Shah,R., O'Shaughnessy,A., ;RA Rodriguez,M., Shekher,M., Schutz,K., See,L.H., Swaby,I., ;RA Habermann,K., Dedhia,N.N., Mewes,H.W., Lemcke,K. and Mayer,K.F.X ;RL Direct submission (March 2000) ;XX ;RN [2] ;RA Terol,J., Castillo,M.C., Bargues,M., Perez-Alonso,M. ;RA and de Frutos,R. ;RT Structural and evolutionary analysis of the copia-like elements in ;RT the Arabidopsis thaliana genome ;RL Mol. Biol. Evol. 18, 882-892 (2001) ;XX ;CC ATCOPIA51 was found by [1], minor modifications of the sequence ;CC coordinates were made by [2]. ;CC ATCOPIA51_I is an internal region of the ATCOPIA51 copia-like ;CC endogenous retrovirus flanked by 1% divergent ATCOPIA51LTR ;CC LTRs. ATCOPIA51_I encodes the 1392-aa ATCOPIA51p copia-like ;CC polyprotein. ;CC ATCOPIA51p: ;CC MAPAYPFPDNVHVSSSVTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPN ;CC PQYEDWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKDKS ;CC LSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLSKLPAPTFNDVISEVQGFDS ;CC KLQSYDDTVSVNPHLAFNTERSNSGAPQYNSNSRGRGRSGQNRGRGGYSTRGRGFSQHQSASPSSGQRPV ;CC CQICGRIGHTAIKCYNRFDNNYQSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGN ;CC DAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLT ;CC TQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVC ;CC EPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFIS ;CC VFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGL ;CC SMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKF ;CC DPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTP ;CC PSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL ;CC GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLV ;CC PATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEW ;CC PLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECS ;CC TSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYFLGIEIESYNNGLF ;CC LHQHAYASDILHQAGMTECNPMPTPLPQHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQ ;CC RMHAPTNSDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISW ;CC SAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSANPALHKRSKHF ;CC DKDFHYIRERVALGLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGVSASPVSPTPSLKE ;XX ;DR Positions 4999 9212 Accession No AL161515 GenBank (rel. 124.0) ;XX ;SQ Sequence 4214 BP; 1202 A; 1030 C; 791 G; 1191 T; 0 other; ATCOPIA51_I tggtatcagagctattataccttaacaggtgattcaatggcccctgcttacccttttccagacaatgtcc atgtctctagttccgttaccttaaagctcaacgatagtaactacttgttgtggaagacacagtttgagtc ccttctatcgagccaaaagctcataggttttgtcaatggagtcgtcactcctccagctcagactcgtctt gttgttaatgatgatgtcaccagcgaagttccgaatcctcaatatgaagactggttttgcacagaccagc tcgtccggtcgtggttgtttggtacgctttcagaggaagtgcttggtcatgtccacaacctcactacatc tcgtcagatttggatctctctagctgaaaatttcaacaaaagtagcatcgccagagagttttctcttcgt cgtaatcttcaacttctgacaaaaaaagataagtctctatctgtttactgtcgtgattttaaaataatat gcgactctctaagctccattggcaaaccagtagaggaatccatgaaaatctttggctttctcaatggact cggcagagagtacgatcctatcaccacagttatccaaagctccctaagcaagctccctgctccgacgttt aacgacgtcatctccgaagttcaagggtttgacagtaagctgcaatcttatgacgacactgtctctgtta atcctcatcttgcgttcaatactgaaagatctaactctggcgctcctcaatacaattccaattcccgtgg tcgtggtcgttctgggcaaaacagaggacgcggtggctactctacacgcggcagaggattttctcaacat caatccgcttcaccatcatcaggacaaagaccagtttgtcaaatttgtggtcgcataggacacactgcta tcaaatgctacaaccgatttgacaacaactaccaaagtgaagtccctactcaagcattttctgctctccg tgtctctgatgaaaccggcaaggaatggtaccccgattctgcagccacagcccacataacagcctcaaca tctggtctgcaaaacgcaacaacatatgagggaaacgatgcagtcttggttggagatggaacatacctcc ctattacacatgttggatccaccacaatttcctcatccaaaggtactattccgttgaatgaagtcttagt gtgccctgctatacaaaaatctcttctatctgtgtccaaactttgcgatgattatccatgcggtgtttat tttgatgctaataaggtttgcataattgatttaaccactcagaaagtggtgtccaagggtccacgaaata atgggctctacatgctggagaattcagagtttgtagcactctattcaaatcgtcaatgtgcagctagcat ggaaacatggcatcatcgacttggccactcaaactcaaagattcttcagcaacttttaacccgcaaggaa atccaagtgaataaaagcagaacttctcccgtttgtgagccttgccaaatgggaaagagcactagattac agtttttctcttctgattttcgagctttaaaacctttagatcgagttcattgtgatctttggggaccatc accggttgtatcaaaccaaggattcaaatactatgcagtttttgttgatgatttctcaagattctcttgg ttttttcctttgcgcatgaagtcaaagtttatttcagtgtttattgcatatcagaaattggttgagaatc aacttggtacaaaaatcaaagagtttcaaagcgatggagggggagaatttacaagcaacaaattaaaaga acactttagagagcatggcattcatcatcgtatatcttgtccatatacaccgcaacaaaacggtgttgcc gaaaggaagcacagacatttggtagagcttgggctttcaatgttatatcacagtcatacacctctcaagt tctgggtagaagctttcttcactgccaactatctcagtaatctcttgccttcttctgtcctcaaggaaat aagtccctatgaaactttgtttcaacaaaaagttgattatacacctctccgagtgtttggtacagcctgc tacccctgcttgagaccgttagcaaagaacaagtttgatccacgctcgttgcaatgcgtgtttcttggct atcacaaccaatacaagggataccgctgtttgtatcctcctaccggtaaagtctacatctctagacatgt catttttgatgaagctcaattcccatttaaagaaaagtaccacagtctggttccaaaataccagacgacc ttactacaggcttggcaacatactgatctcacaccaccttcagtgccttcttctcaattacaacctcttg caagacaaatgactcctatggcaacaagtgagaatcagccaatgatgaattatgagacagaggaagccgt caatgttaatatggaaactagctctgatgaggaaactgaatcaaatgatgaatttgaccacgaagtagct cccgtactaaatgatcaaaatgaagacaatgcactaggacaaggctcattagaaaatctccatcccatga ttacaagatcaaaagatggaattcagaagccaaacccccggtatgctctcattgtctctaaatcctcttt tgatgaaccaaaaactattactactgctatgaagcatcctagctggaacgctgcagttatggatgagata gatcgcattcacatgctaaacacttggtctctagttcctgcaacagaggacatgaatattctgacatcca aatgggttttcaagactaaactcaaacctgatggcaccatagataagttgaaagctcgtctagttgccaa agggtttgatcaagaagaaggagtcgactatcttgagacattcagtccggttgttcgaactgcaactata cgtcttgttctcgataccgctactgcaaatgagtggcctctcaaacagcttgatgtgtccaacgcgtttc tccatggagaattacaagaaccggtgtttatgttccaaccctctggttttgttgatcctaacaagcctaa tcacgtttgtcggctcaccaaagctctttatggtctaaaacaagcgcctagagcctggtttgacaccttt agcaactttcttcttgactttggctttgagtgcagcacatctgatccttccctcttcgtttgtcatcaaa atgggcaaagtctcatactcctcttatatgtcgacgatatactcctcacaggaagtgatcaactgctcat ggataaacttcttcaagctctcaacaaccgcttttcgatgaaagatcttgggcctcctcgctattttttg ggtatagaaattgaatcttacaacaatggtctatttttacatcaacacgcatacgcttccgacattcttc atcaagcaggcatgacagaatgcaaccctatgcctacccctctgccacaacacttggaagacctcaattc agaaccctttgaagagccaacatactttcggagtttagctggcaagttacaatacttaacaatcacaaga ccggatattcaatatgccgtgaacttcatctgccaaagaatgcacgctccgaccaactctgattttggcc ttctcaaacgcatactcaggtatgtgaaaggaactatcaacatggggcttccaatcagaaaacaccacaa ccctgttctttcgggattttgcgatagtgattacgctggctgcaaggacactagacgctccactactggt ttctgcatcctcttgggatctactctgatatcttggtctgcaaagagacaacccactatctctcactcct caacagaagccgaatatagagctctttccgatacagctcgagaaatcacttggatttcctctcttctccg agatcttggaatctctcaacatcaacctacacgagtgttctgtgataacctatctgctgtctacctctct gcaaatcctgctcttcataaacgatctaaacacttcgataaagactttcactacatcagggaacgtgtgg ctctcggtctcatagaaacgcaacacatcccagcaactattcaacttgctgatgtcttcaccaagtcact accgcgacggccctttatcacgcttagagccaaactcggcgtgtctgcgtcaccggtctcacccacgcca agtttgaaggaggg1