;ID ATCOPIA44_I DNA ; ATH ; 4190 BP ;XX ;DE Internal portion of the ATCOPIA44 copia-like LTR-retrotransposon. ;XX ;AC AL133315 ;XX ;DT 02-SEP-2001 (Rel. 6.2, Created) ;DT 02-SEP-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; pol; reverse transcriptase; ;KW ATCOPIA44LTR; ATCOPIA44_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] ;RA Choisne,N., Robert,C., Brottier,P., Wincker,P., Cattolico,L., ;RA Artiguenave,F., Saurin,W., Weissenbach,J., Mewes,H.W., Lemcke,K., ;RA Mayer,K.F.X., Quetier,F. and Salanoubat,M. ;RL Direct submission in GenBank (December 1999) ;XX ;RN [2] (bases 1 to 4190) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (August 2001) ;XX ;CC Two copies of ATCOPIA44LTR flank the ATCOPIA44_I internal sequence, ;CC they are 2% divergent from each other. ;CC ATCOPIA44_I encodes the 1349-aa ATCOPIA44p copia-like polyprotein ;CC (it is disrupted by one stop codon emerged after integration of ;CC ATCOPIA44 in the genome). ;CC ATCOPIA44p: ;CC MAMSSKVEIKTFNGDRDFSLWKIRIEAQLGVLGLMNTLTDYSLTKFVPVPKSEGKKPETDEESSPTEEVP ;CC DLIKIEQSKQAKNIIINHITDAVLLKVQHCVSAADMWATLNKLYMETSLPNRIYTQLRLYSFKMLETMSI ;CC DQNIDQFLRIVAELGSLQIVVAEEVQAILILNSLPVSYIQLKHTLKYGNKTLCVQDVVSSAKSLEHELAE ;CC SKESERGSSTVLYTTERGRPQNRSQQQGNKGKGRSRSNSKTKVTCWFCKKEGHVKKDCFAKKRKLESEGP ;CC GEAGVIIEKLEVSEALNIGDRLVKDMWVLDSGCTSHMSSRRDWFSDFEENDGTTILLGDDHTVKSQGQGS ;CC IRIKANGGSIRILKNVKYVPNLRRNLISTGTLDKLGYHHEGGDGKVRYHKNNATALVGRLINGLYVLDGE ;CC TIMSESFNAEDTKSSTELWHSRLGHMSLNNMKILAGKGLLQKNDVKELEFCEHCVMGKSKKLSFNVSKHI ;CC TEEALGYVHADLWGSPNVTPSLLGMKYFLSIVDGKTRKVWLMFLKSKDETFDRFCEWKELVETQV*KKVK ;CC VLRTDNGLEFCNSKFEDYLKKFGIERHRTCAYTPQQNGVAERMNRTLMEKVRCLLSESGLEEIFWAEAAS ;CC TVAYLVNRSPAFAVDHNVPEELWLNRKSGYKHLRRFGSVAYVHQDQGKLKPRALKGVFLGYPQGTKGYKI ;CC WLLEEMKCVISRNVIFHEDLVYKDLQLKEKSEQEERAEKITQAEKTVSEIVSNQQQVGESSVAGGTVDVS ;CC SSDDESEYFEPEGEAPASSERLSNYQLARDRVRRQIRAPIRFSDYSQFAYALMAAEDMDSSEEPSCYHEA ;CC KETKEWEKWNAGMGDEMQSLLKNYTWDIVDHPKNQKIISCRWLYKKKPGIPGVEPERYKARLVARGFTQR ;CC KGIDYDEVFAPVVKHVSIRILMSIVVQEDLELEQMDVKTAFLHGDLDQPLYMEQPEGYVADEQKDQVCLL ;CC KKSLYGLKQAPRQWNKKFNSFIMDQNFIRSGHDSCVYIKQVSDEEFVYILIYVDDMLIAAKSMTEINKIK ;CC EALSTGFEMKDMGAASRILGIDIIRDRKAGTLRLSQTGYLEKVLHMFNMTEARPVSTPMGAHFKLASVVE ;CC EEECVDTDKVPNSSAIGSIMYAMVGTRPDIAQAIGVLSRFMSKPGKIHWTAVKWLLRYLKGSTDLNLVFT ;CC REKDFRVQGFSDSDYAADLDRRRSTTGYVFTVGGNTVSWKSNLQSIVALSTTEAEYVALTEAVKEALWIQ ;CC GLLTEMGFKQEKVTLWCDSQSAISLAKNNTFHERTKHIAIKFNFIRDVIEEGSVEVLKIHTSQNPADMLT ;CC KGIHVQKFESALEFLKLLR ;XX ;DR Positions 19000 14811 Accession No AL133315 GenBank (rel. 124.0) ;XX ;SQ Sequence 4190 BP; 1392 A; 668 C; 1025 G; 1105 T; 0 other; ATCOPIA44_I aagtggtatcagagccttggtttctgaattgaggaggtttgatgcttcgatagcgttaatggcgatgtcg tcaaaggtagagatcaagacatttaatggagatagagacttttctctatggaagattcggattgaagcac aacttggagttctgggtttgatgaacactttaacggattactctttgacaaagtttgttccagtcccaaa gagtgaaggaaagaaacctgaaaccgatgaagaatcatctccgactgaagaagttccagatctgatcaag attgaacaatcgaaacaagctaagaacatcataatcaatcacattactgatgcggttcttcttaaagttc agcattgtgtatctgcagctgatatgtgggcaacgctaaacaagctctacatggaaacatctctgcctaa caggatctatactcaacttagactttactcattcaagatgcttgaaacaatgagtattgatcagaacatt gatcaattcttaagaattgtggctgaacttggcagtctgcagattgtagttgctgaagaagtgcaagcaa tcttgatcttgaattcattgcctgtgagttatatccagttgaagcacactttgaagtatggtaacaagac tctctgtgtgcaggacgttgtatcatcagctaagtcattggaacatgaacttgctgaatctaaagagtct gaaagaggctcttcaactgtgttgtatacaactgaaagaggtagaccccaaaacaggtctcagcagcaag gaaacaaagggaaaggcaggagcagatctaattccaaaacaaaggtcacctgctggttctgtaaaaagga aggtcatgtgaagaaagattgttttgctaaaaaaagaaaactggaaagtgaaggtccaggagaggctggt gttatcattgagaaactagaagtttctgaagccttaaacattggtgacagattggtcaaggacatgtggg tactagactctggatgcacatcacatatgtcatcaagaagagactggtttagtgattttgaggagaatga tggcacaacaattcttcttggtgacgatcacacagttaagtctcagggacaaggttctattcggattaag gcaaatggtggatcaatcagaatcttgaagaatgtcaagtatgtgcctaatctcaggcgaaatctaattt caacaggcactctagataaactgggatatcaccatgaaggtggagatggtaaagtgagataccataagaa caatgcaactgcattagttggacgtttaatcaatggactgtatgttctggatggagagaccattatgtct gagagctttaatgcagaagacactaaaagcagtactgaattatggcatagcagacttggccatatgagtt taaacaacatgaagatactggctggaaagggactgctacaaaagaacgatgtcaaagaactagagttctg tgagcactgtgtaatgggaaaatccaagaagcttagcttcaatgtcagcaagcacatcacagaggaagct ctaggatacgttcatgcagacctatggggctctccaaatgtaactccatcactcttaggtatgaaatatt ttctgtctattgttgatggtaagacaagaaaggtttggcttatgtttcttaaatctaaagatgagacatt tgaccgtttttgtgagtggaaagaacttgttgaaacacaggtgtgaaagaaggttaaagtgctcaggaca gataatggattggagttttgtaattccaaatttgaagattacctcaagaagtttggtattgaaaggcaca ggacatgtgcttatacccctcagcagaacggtgtagcagagagaatgaacagaactcttatggaaaaagt gagatgtcttttgagtgaatcaggtcttgaagaaatattctgggctgaagctgcttcaactgttgcatat ctggtgaacaggtcacctgcttttgcagtggatcacaatgtacctgaagagttgtggttaaacaggaaat ctgggtacaagcatttaaggaggtttgggtcagttgcttatgtacaccaagatcaaggaaagcttaaacc aagagctttaaaaggtgtgttcctcggttatccgcaaggcactaagggatacaagatctggctcttagaa gaaatgaaatgtgttatcagtcgaaatgtgatatttcatgaggacttggtgtataaggatttgcagttaa aagagaagtctgaacaagaagaaagagcagagaagattactcaagcagaaaagactgtctctgaaatagt aagtaaccaacagcaggttggtgagagttctgttgcaggtggaacagttgatgtttcatctagtgatgat gagtcagagtattttgaacccgaaggagaagctccagcaagcagtgaaagactgagcaattatcagttag ctagagatcgggttagaaggcaaatcagagcacctataagattctctgattactctcaatttgcatatgc tcttatggcagctgaagatatggacagcagtgaagaacctagctgttatcatgaagctaaagaaaccaaa gagtgggaaaaatggaatgcaggaatgggagatgaaatgcagtcactattgaaaaactatacatgggata tagtagaccatcccaagaatcagaaaatcatcagttgtagatggctgtacaagaagaaaccaggaattcc cggtgtggaacctgaaagatacaaagccagactagtagcaagaggctttactcagagaaaaggaatcgac tatgatgaagtgtttgcacctgtagtcaagcatgtgtcgataagaatcttgatgtctattgttgttcaag aagatctagaattggaacaaatggatgttaagactgcgttcttgcatggtgatctggaccagccacttta catggagcaacctgaagggtatgttgctgatgaacaaaaggatcaagtgtgcttgttaaaaaagtcactc tatgggttaaaacaagcaccacgtcagtggaacaagaagtttaactcttttatcatggatcagaacttta tcagaagtggtcatgattcgtgtgtttacataaaacaggtgagtgatgaagagtttgtgtatatactgat atacgttgatgacatgttgatagcagctaagtcaatgactgaaatcaacaagatcaaagaggcgttgagt acaggatttgaaatgaaggatatgggtgcagctagtcgaatactgggaatcgacattataagagacagaa aagcaggtacattgcggttgtctcagacaggatacttagagaaagtgcttcacatgtttaatatgactga agcaagacctgtaagcacacctatgggagcacatttcaaacttgcctcagtagttgaggaagaagagtgt gtggacactgataaagttccaaactcaagtgctattggcagcatcatgtatgccatggttggcaccagac cagatatagctcaagctattggagttctaagcaggtttatgagcaaaccaggtaagattcattggactgc agtgaaatggttgctaagatacttaaaagggtctacagatttgaatctggttttcaccagagaaaaggat ttcagagttcaaggctttagtgactctgactatgcagcagaccttgacaggaggcgttcaacaacaggtt atgtattcactgttggtggaaatacagtaagttggaagtcaaatctgcagagcatagtggctttatcaac cactgaagcagagtatgttgctttaactgaagcagtgaaagaagctttgtggattcaggggttgttaaca gaaatgggattcaagcaagagaaggttactttgtggtgtgactcacagtcagcaatcagtttggcaaaga acaacacgttccatgaaagaactaaacatattgcgatcaagttcaactttatcagggatgtgattgaaga aggaagtgttgaagttcttaagatccatacttctcagaatcctgcagacatgcttaccaaaggcattcat gtgcagaagtttgagtcagctttagagtttctaaagctactcaggtgaggtggaaagacatctcagccga aggtgaaatccaagtaagtgcaagtccactagttatggagagatttgaatcaaggtggag1