;ID ATCOPIA48_I DNA ; ATH ; 4420 BP ;XX ;DE Internal region of ATCOPIA48 copia-like LTR-retrotransposon. ;XX ;AC AL161543 ;XX ;DT 05-NOV-2001 (Rel. 6.2, Created) ;DT 05-NOV-2001 (Rel. 6.2, Last updated, Version 1) ;XX ;KW LTR-retrotransposon; COPIA superfamily; internal region; ;KW copia-like polyprotein; reverse transcriptase; ATCOPIA48LTR; ;KW ATCOPIA48_I. ;XX ;OS Arabidopsis thaliana ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; ;OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons; ;OC Rosidae; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 4420) ;RA Kapitonov,V.V. and Jurka,J. ;RT Internal region of ATCOPIA48 copia-like LTR-retrotransposon. ;RL Repbase Reports 1:(2) p. 2 (2001) ;XX ;CC ATCOPIA48_I is an internal region of the ATCOPIA48 copia-like ;CC endogenous retrovirus flanked by the 98% identical ATCOPIA48LTR ;CC long terminal repeats, and by the 5-bp target-site duplication ;CC (GAAGT). ATCOPIA48_I CDS, which encodes the ATCOPIA48p copia-like ;CC polyprotein, is interrupted by a non-LTR retrotransposon inserted ;CC recently in the genome. Presumably, the insertion has induced a ;CC 2629-bp duplication which flanks the insert. Both the non-LTR ;CC retrotransposon and a copy of the 2629-bp direct repeat were ;CC removed from the ATCOPIA48_I sequence reported here. ;CC ATCOPIA48p: ;CC MERRSMELYTVPQLNISNCVTVTLTQQNYILWKSQFESFLSGQGLLGFVTGSISAPSPTIPVPDINGVTT ;CC DRPNPEFDVWFKTDKVVKSWLLGSFAEDILSVVVNYVTAHEVWSTLANHFNRATSSRLFELQRRLQTLEK ;CC KDKPMQVYLKELQTIYEQLASVGSPVPEKMKIFAALNGLGREYEPIKTSIEGSIDIPPTPKLDEIMPRLN ;CC GYDDRLQAYAANSDVSPHLAFNTVQANSVFYTNRGRGQGNRRFGGSRGQGSFSTRGRGFHQXLSYNDSSS ;CC NASAERPTCQICGKHGHHALNCWHRFDNSYQLDVLPQALPATQITDITDHSGSEWVTDSAATAHITNSPR ;CC HLQQTKSYAGSDSVMVGNGNFLPITHTGSTSIGSTSGKLHLKDVLVCPLITKSLLSVSKVTKDYPCIFEF ;CC DCDEVRVRDKETKKLLLQGSNRDGLYVLDEPKLLVFYSSRQVAASDEVWHRRLGHPNPHVLQQLSSTKSI ;CC LINKHSKAICEACQSGKSSRLSFSASSFVASRPLERIHCDLWGPSPVMSVQGFRYYVIFIDNYSRYCWFY ;CC PLKLKSDFYTIFAKFQALVQNQLQSKISIFQCHGGGEFTSKVFLNHLQEHGIQQYISCPYTPQQNGLAER ;CC KHRHITDFGLSMLFQGKVPQKHWVEAFYTTNFLSNLLPHTALTDAKSPFELLNKKKPDYQALRIFGCACF ;CC PTLRDYAQHKFDPKSLKCVFLGYNEKYKGYRCLLPTTGRVYISRHVIFDEHSFPFSDTYMHLQPTGVTPL ;CC LSAWQQSFMPQTTASSTASATATSPFNAAEIQVSPVITSNNNTGASVLENGSSQLPIQNSSVLSTVASEE ;CC SSECTESINLLPIGNSSSSLANRTDNADTSPLQEAATETNSSTVQEAAESTTSSTMQEPASNQSTHPMIT ;CC RSKKGITKPNPRYGLLTHKVKYAEPKTVTEALKHPGWTAAMHEEYDNCTEAQTWSLVPYTSDMNVLGSKW ;CC VFRTKLNADGSLDKLKARLVAKGFDQEEGIDYLETYSHVVRSATVRMVLHVATVMDWEVKQMDVKNAFLH ;CC GDLTETVYMLQPAGFVNKEKPTHVCHLHKALYGLKQAPRAWFDKFSNYLLEFGFNCSIKDPSLFIYLKGN ;CC DLILLLLYVDDMVLTGSNSATMIKLLEDLNTQFRMKDLGQMHYFLGTQAXFHENXXCLYTLSGLFLSQQK ;CC YAEDLLTIAAMDECSPMPTPLPLQLHKVPHQEELFANPTYFRSLAGKLQYLTLTRPDLQFSVNFVCQKMH ;CC QPTVSDYNLLKRILRYVKGTLSMGIHFSKHSDFQLRVYTEKDPAFSLRAYSDSDWGGCKDTRRSTGGYCT ;CC FLGTNLISWSSKKQPTVSRSSTEAEYRSLSETAQEMTWICHLLRELGIPLPVTPELYGDNLSSVYLTANP ;CC AFHARSKHFEFDYHYVRERVALGSLVVKHIPAHQQIVDIFTKSLPYEAFCNLRFKLGVDLPPTPRLRG ;XX ;DR Positions 16028 11546 Accession No AL161543 GenBank (rel. 124.0) ;XX ;SQ Sequence 4420 BP; 1228 A; 991 C; 870 G; 1331 T; 0 other; ATCOPIA48_I tggtatcagagctcatggaaagaagatccatggagctctatactgttcctcaactcaacatttcaaattg cgttacagtcactcttacgcagcagaactatattctgtggaagagtcagttcgaatctttcctttctggt caaggcttgcttgggtttgtcactggatctatttctgcaccgtcaccaaccattcctgttccagatatca atggtgtcaccacagacagaccaaatccagagtttgatgtttggttcaagacagacaaggttgtcaagtc ttggcttctagggtcctttgctgaagatatcctgagtgttgttgtgaactacgtcactgctcatgaggta tggtctactcttgcaaatcacttcaatagagctacttcatctaggctatttgagcttcaaaggcgtttac aaactctagaaaagaaagataaacctatgcaagtctatcttaaggagttacaaaccatctatgaacagtt agcttctgtagggagtccagttcctgagaagatgaaaatctttgctgctcttaatggtctaggtagggaa tatgagcctatcaaaacaagtattgaaggttccattgatattcctcccactcctaagcttgatgaaatca tgcctagactcaacggttatgatgatagacttcaggcttatgcagcaaactctgatgttagtcctcatct agctttcaatacagttcaagctaactctgtcttctacaccaaccgcggcagaggtcaagggaaccgtcgg tttggtgggtcacgaggccaaggttccttctccactcggggtcgtggcttccatcagtagctctcataca atgattcttcttccaacgcctctgcagaacgtccaacatgtcagatttgtgggaaacatggtcatcatgc tctcaactgctggcacagatttgataatagttatcagcttgatgtgttaccacaggctctcccagcaacg cagatcacagacattactgatcactctggcagcgaatgggtcacagacagtgctgctactgctcacatca ccaactcgccacgtcatctgcaacagacaaagtcctatgctggttctgattctgtaatggttggcaatgg gaattttctacctatcactcataccggttctacaagtattggttctacttcaggtaagcttcatcttaaa gatgtattggtttgtcctctaattactaaatctttgttgtctgtgtcaaaagtcacaaaggattatccct gcatttttgagtttgattgtgatgaagttcgtgtgcgtgataaggaaaccaagaagcttcttcttcaggg aagtaatcgagatggactctatgtgctggatgaaccaaagcttctggtgttctactcttctcgccaagtt gcagcgtctgatgaagtttggcacagacggttaggacacccaaatccccatgttctccagcagctatcct caacaaagtccattcttattaataaacacagcaaggctatttgtgaagcatgtcagtctggtaaaagctc aagactgtcattctctgcatcatcttttgttgctagtagacccttagagagaattcattgtgatctctgg ggtccttctcctgttatgtcagttcaaggattcagatactatgttatatttattgacaattactctagat attgctggttctatcctctcaagttaaagtctgatttctacacaatctttgcaaagtttcaagctttggt tcagaatcagctacaaagcaaaatctcaatttttcaatgtcatggagggggagaattcaccagtaaggtc tttctcaatcatcttcaagaacatgggattcaacaatacatctcctgtccttacactccccaacagaatg gtcttgctgaaagaaaacacagacacattactgattttggtttatctatgctttttcaaggcaaagtccc tcaaaaacattgggtagaagccttttatactacaaactttctcagtaatctgcttcctcatactgctctt actgatgctaaaagtccttttgagctgttaaacaagaagaaaccggattatcaggccttgagaatctttg gatgtgcttgttttccgacactccgagattatgcacaacacaagtttgatcctaagtccttaaaatgtgt gttcttgggctacaatgaaaaatataagggctaccggtgtcttcttcctaccacaggcagagtctacata agtcgtcatgtcatttttgatgaacactccttccctttctcagatacttatatgcatctgcaacccactg gtgttacacctttgctctctgcctggcaacaaagttttatgcctcagacaactgcttcttctacagcctc tgcaactgcaacctctccattcaatgctgcagagattcaggtttctcctgtgattaccagtaacaacaac acaggcgcttcagtgttagaaaatggttcatctcagctgcctatacagaattcatcagtcttgtctactg tggctagtgaagagagttctgagtgtacggagagcatcaatctcttacctattggcaatagctcttcttc acttgctaacaggacggataatgctgatacttctcctcttcaagaagctgcaacagaaacaaacagctct actgtgcaagaagctgcagaatcaacaactagctctacaatgcaagaacctgcttcaaatcagtctactc atccaatgataacacggtctaagaaaggtattacaaagcctaatccacggtatggacttcttacacacaa agttaaatatgcagaaccaaaaacggttacagaagctcttaaacacccgggatggactgctgcaatgcat gaagagtatgataattgtacagaagcacaaacgtggagtctagttccgtatacttctgatatgaatgtcc ttggaagtaaatgggtgtttcgaaccaagttaaatgcagatgggtctttggacaagttaaaagctcgctt ggttgcaaaaggatttgatcaagaagagggaattgattacttagaaacatacagccatgtggtaaggtct gcaacagtcagaatggtgcttcatgttgcaacagtcatggactgggaagtgaaacaaatggatgtgaaga atgcatttcttcatggggacctcactgaaactgtttacatgcttcaaccagctggctttgtgaataagga aaaacctactcatgtctgccatctacataaagctctttatggtttaaaacaggctcctcgggcttggttt gacaaatttagcaattacttgcttgaatttggtttcaactgcagtattaaagatccatctctgttcattt atttaaaagggaatgatcttatactcttacttctttatgttgatgacatggttttaacaggtagtaactc tgcaactatgatcaagctgcttgaggatttgaatacacaatttcgtatgaaagatcttgggcaaatgcat tattttcttgggacacaagcttaatttcacgagaattgatgatgtttatacactctttcaggcctatttc tatctcagcagaagtatgctgaagaccttctcaccattgcagcaatggacgaatgctccccaatgccaac tccactgccacttcagcttcacaaagttcctcatcaagaagaactctttgctaatccaacttatttccgg agtcttgcagggaaacttcagtacttgacattgacaaggccggatcttcagttttctgtaaactttgtgt gccaaaagatgcatcaaccaacagtttcagattacaatcttctcaagcggattcttcggtatgtcaaagg aactctatcaatgggaatccacttctccaagcactctgatttccagctccgggtttacactgaaaaagac cctgcttttagtcttcgtgcttacagtgatagtgactggggtggttgtaaagatactcgtcgttccacag gaggctattgcacatttcttggcactaacctcatctcctggtcgtctaagaagcagccgactgtttctcg aagctcgactgaagcagaatatcgatcactctcagaaacagctcaggaaatgacttggatctgtcattta cttcgagagcttggcatacctcttcctgtcacacccgagctctatggagacaacttatcttctgtgtacc ttactgccaatcctgccttccacgctcgtagcaaacactttgaattcgactatcattatgtccgtgaaag agtcgccttgggatctttggtcgtgaaacacattcctgctcatcaacaaattgttgacatcttcacgaag tctttgccttatgaagctttctgtaatctcaggttcaaacttggtgtggatttaccacccacaccgcgtt tgagggggag1