;ID ATCOPIA89_I DNA ; ATH ; 4576 BP ;XX ;DE Internal region of the ATCOPIA89 copia-like LTR-retrotransposon. ;XX ;AC AC068809 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; the ATCOPIA89 ;KW family; ATCOPIA89LTR; ATCOPIA89_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4576) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal portion of the ATCOPIA89 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(3) p. 22 (2001) ;XX ;CC ATCOPIA89_I is an internal region of the ATCOPIA89 copia-like ;CC endogenous retrovirus flanked by the 99% identical ATCOPIA89LTR ;CC long terminal repeats. and a 5-bp target-site duplication (AACAT). ;CC ATCOPIA89 forms a separate family of copia-like retroviruses ;CC present in the A. thaliana genome since members of other families ;CC are less than 75% identical to ATCOPIA89_I and ATCOPIA89LTR. ;CC ATCOPIA89_I (positions 91-4566) encodes remnants of the 1492-aa ;CC ATCOPIA89p copia-like polyprotein. The ORF which encodes ;CC ATCOPIA89p is damaged by one false stop codon at position ;CC 1690-1692 (marked by X in the ATCOPIA89p sequence). ;CC ATCOPIA89p: ;CC MGVPVRKSTRRLGRSVGAGSSGTKLKAPMSSNSPDPAPVVPIRSISEIDAVDSPHSPFFLHSADHPGLTL ;CC VADRLDGTNYTQWSSAMKISLDAKNKIAFIDGSLPRPAEGTPLSRIWSRCNSMVKSWLLNAVSKQIYGSI ;CC LNLDDATMIWNDLHDRFHMTNLPRTFHPIQQIQDLRQGSMDLSSYYTALKTLWDQLDGSEPTESCLCCHS ;CC FNCVSKRHYRGKVDRGRIIKFLAGLNETYSIIRGQIIMKKPLPDIAEVYHILDQDDSQRKFGNSIVPAAF ;CC QVGIAGSHPGVVNADASSPSAGSLIAAYQSFKKDKPTCSYCGFTGHVVDRCYKKHGYPPGWKPRKQQVNA ;CC SPTPQSSSPAITAQVSATVGSGDKQDGGLDSLIGNLSKEQLQHFIAYFSSHLTTQFPASVPLNEASTSGI ;CC SFSPSTYSFIGILTVTQSVTNKRSWIIDSGATHHVSHDKSLFSDLDASVSQHVNLPNGSVVMVAGVGTVI ;CC INTSISLKNVLYIPDFRMNLLSISSLTTDLGSRVIFDPASCLIXDPTKGLTIGRGRRIANLYLLDVEEPA ;CC DSRQLSSYSLNDVIDSAVWHKRLGHTSFSRIDMLTDVLGISKQRNKGVIHCDICQRAKQKKLSYPNRNNL ;CC CSASFDLLHIDVWGPFSEPTVEGFRYFLTIVDDHTRVTWVYLLRLKSDVLTVFPEFLQMVETQFDKRVRC ;CC VRSDNAPELKFTELYRRLGIIPYHSCPETPEQNSVVERKHQHILNVARALLFQSNLPLSLWGDCILTAVF ;CC LINRTPSPLLENKTPFEKLTNTTPEYTDLRTFGCLCYASTSPKQRTKFEDRAKACVFLGYPAGYKGYKLM ;CC DIESNVVFISRNVKFFEDIFLFQNSQASDEVDVTGFFPQISTRVADPGESSGTRTEGEYSSTMRQGESNS ;CC ESVIPNTEDTSRRRVSRPPGYLQDYQCYSVKESVKESTEHPISQVFSTDNLSSSYCAYINALTKYPTPTS ;CC YTQASKVKEFCDAIKDEIGALERTNTWIVCAIPPGKTVVGCKWIYTIKLNADGSLERYKARLVAKGYTQK ;CC EGLDYVETFSPVARMATVKFLLSVAAPRKWFLDQLDISNAFLNGDLHEEIFMALLPGYVDKDGKPFPPNS ;CC VCKLQKSLYGLKQASRQWFLKLSHCLMSMGFRNGTGDDTLFLRRTDDTYMAILVYVDDIIIASSSSTATA ;CC SFTAALKESFKLRDLGPLKYFLGLEVARTSAGISICQRKYVLDLLDETGLLGCKPSSIPMDPSQKLNLET ;CC GDLLTDVEMYRRLVGKLMYLTFTRPDITFAVHKLCQFTSAPRQPHLTAVYKVLHYLKGTIGQGMFYSANS ;CC DLKLKSFSDADWGTCTDSRISVTGFCMFLGPSLVSWKSNKQETVSMSSAESEYKAMSTAVKEMLWLRKLM ;CC NDLWIDASEASVLYCDNTAAIHIANNSVFHERTKYLDLACHLVRERVLMGQIKTLHVQTEHQLADALTKP ;CC LYPTLFLRLIRKMGVINIYTPS ;XX ;DR Positions 74828 70253 Accession No AC068809 GenBank (rel. 124.0) ;XX ;SQ Sequence 4576 BP; 1216 A; 893 C; 994 G; 1473 T; 0 other; ATCOPIA89_I tggtatcagagctatacgcttagctccgatttcgttttcctcctctgattctatttctttggcgtgtttt gtgtggttgtttagatcgatatgggagttccggttcgtaaatctactcgtcgtctgggtagatctgttgg tgctggtagctcggggactaagttgaaagctccgatgagctcaaactcgcccgatccagctcctgttgtt ccgattcgctccatctctgagatcgatgccgtggatagtccacactcgccgttctttcttcacagtgctg atcatccgggtctgactttggttgctgatcgtctagatggtacaaactacactcaatggagctctgcgat gaagatttctcttgatgcgaagaacaaaatcgctttcatcgatggatctcttcctcgtcctgcagaaggt actcctctctctcggatctggtctagatgtaacagtatggttaagtcctggttgttgaatgctgtttcaa agcagatctatggtagtattttgaacttggatgatgctactatgatctggaatgatttacatgatcgttt tcacatgacgaatttgccgagaacatttcatcccatccagcaaattcaggatctgcgtcaaggttccatg gatctatcaagctattacacagctttgaaaactctctgggatcaacttgatggcagtgaacccactgagt cgtgtttatgttgtcattcttttaactgtgttagcaaaaggcattatcggggtaaggtggatagaggtcg catcatcaaatttcttgctggtttgaatgaaacctattccatcatcagaggtcagataatcatgaagaag cctcttcctgacattgctgaagtttatcacatcttagatcaagatgacagtcaaaggaaatttggtaaca gcattgttccagctgcttttcaagtagggattgcagggtctcatcctggagtggtgaacgctgatgcatc ctcgccttctgctggttctttgattgctgcttaccagtcttttaagaaagacaaaccaacatgttcctat tgtggtttcactggacatgttgtggatagatgctacaagaagcatggctatcctcctgggtggaaaccta gaaaacaacaagtcaatgcttctccgactcctcaatcgtcttctcctgcaatcacagcacaagtgtctgc tactgttggatcaggtgataaacaggatggtggtctggatagtttgattgggaatctcagtaaagagcag ttacaacacttcattgcttacttcagttctcatctcactactcagtttcctgcttcagttcctttaaatg aggcttctacttctggtatatctttctcaccatctacctatagttttattgggattttaactgttactca gagtgttacaaataagagatcatggataattgattctggtgctactcatcatgtaagtcatgataagagc ttattcagtgatctagatgcttctgttagtcagcatgttaaccttccaaatggtagtgttgttatggtag ctggtgtgggaacagtgattataaacacttctatcagtttgaagaatgttctctacataccagatttcag aatgaatcttctcagtataagctcgttgactacagatcttggttctcgagttatttttgatcctgcttct tgcctcatataggatcctaccaagggattgacgattggaagaggtagacggattgctaatctttacttgt tggatgttgaagaacctgcagattcaagacaactatcttcttatagtttgaatgatgtaatagactctgc tgtttggcataagagattaggacatacctctttttctcggattgatatgcttacagatgttcttggaatt tctaaacaaaggaataaaggagttatacattgtgacatttgtcaaagagctaaacaaaagaaactttcat atcctaatagaaacaatctttgttctgcatcttttgacttgttacacattgatgtttggggtccattctc agagccaacagtagagggattcagatacttcttgacaatagtggatgatcatactcgtgtcacctgggtt tatttgctgagattgaagagtgatgttttgactgtgtttccagaatttctgcaaatggtagagactcagt ttgataagcgagttcgttgtgtgagatcagataatgctccggagttgaagtttactgagctttatcgacg gctgggcatcattccttaccactcctgtcctgaaacaccagagcagaactccgtggttgagaggaaacac cagcatatactgaatgttgcacgagctcttctttttcagtccaacctaccgttgtctctgtggggggatt gtattcttacagctgtgttcttgatcaatcgcacgccatcaccattgttggaaaataagactccttttga gaagctcactaatacaacaccagagtacactgatttgagaacttttggctgcttgtgttatgcgagtact tctcctaagcaacgaactaaatttgaagatagggctaaagcttgtgtcttcttgggttatccagcaggtt acaaaggatataaactgatggacattgaaagtaatgttgtgttcatttccagaaatgttaaattctttga agacatatttctttttcagaattcccaagctagtgacgaagttgatgtcacagggttctttcctcagata agtactcgtgttgctgatccaggagaatcttcagggactagaactgagggggaatattcgtctacaatga gacagggggaatctaactctgagtcagtaatacctaatacagaagatacatcacggcgacgagtttcgag acctcctggttatttgcaagactatcaatgctattcagtgaaagaatctgtgaaagagtctacagaacat ccaatatcacaggtcttttctactgataatctctcttcttcttattgtgcatacattaatgctctcacca aatatcctactccaactagttatactcaagctagtaaagtcaaggaattttgtgatgctataaaggatga gataggggctttagagcgaactaatacttggattgtttgtgctattcctcctggtaagactgtggttgga tgtaaatggatttatactatcaagttgaatgcagatggcagtcttgaacgttataaagctcgtctagtgg caaagggttacactcagaaagaaggacttgattatgtggaaacgttctcaccagtggcaaggatggcaac tgtcaagttcttactctctgttgcagctcctagaaaatggtttttagaccaactggatatatccaatgct tttctcaatggagatcttcatgaagagattttcatggccttgcttcctggttatgttgataaggatggga agccatttccaccaaattcagtttgcaagcttcagaagtctctgtacgggttgaaacaagcttcacgaca gtggtttttgaagctctctcactgcttgatgtctatgggttttagaaatggaacaggagatgatacttta tttctcaggagaacagatgatacttacatggctatattggtttatgttgacgatataatcattgcaagca gctcttctacagccactgcgtcttttacagccgctttgaaagaatcttttaaactgagagatttaggacc tctcaaatacttcctaggtcttgaggttgcaaggacttctgctggcatttctatttgtcaacgcaagtat gtgttagatcttttggatgagacaggtcttctaggatgtaaaccatcttctatacccatggatccaagtc agaaactgaatctagaaactggagatcttttgacagatgtggagatgtatagacggctcgttgggaagct tatgtatctgacttttactcggccagacatcacctttgcagttcataagctgtgtcagttcacctctgcg ccgcgacaaccacacctcacagctgtctacaaagtgttacactacttgaaaggcaccattggtcagggta tgttctattctgccaattctgatcttaagttgaagagtttttcagatgcagactgggggacttgtactga ttcaagaatctcggttactggcttctgtatgttccttggtccatctttggtttcatggaagtccaataag caggaaacagtctccatgtcatctgcagaatccgagtacaaagctatgtcaactgcagtcaaggaaatgt tgtggcttcgaaagctcatgaatgatttgtggattgatgcttcagaagcttctgtgctttactgtgacaa cacggctgctatacatatagctaacaattcagtctttcatgagagaactaagtatttggaccttgcttgt catcttgtcagggaaagggttctaatggggcagatcaagactcttcatgtgcagactgaacatcaattgg cagatgcgttaactaaacctctatatcctactttatttttgagactcattcgcaagatgggtgttattaa catatacactccatcttgaaggggaa1