;ID ATHILA5_I DNA ; ATH ; 7505 BP ;XX ;DE ATHILA5_I is an internal portion of the ATHILA5 endogenous ;DE retrovirus - a consensus sequence. ;XX ;AC . ;XX ;DT 14-DEC-2000 (Rel. 5.9, Created) ;DT 14-DEC-2000 (Rel. 5.9, Last updated, Version 1) ;XX ;KW Gypsy-like endogenous retrovirus; ATHILA5p1; ATHILA5p2; ;KW ATHILA5_LTR; ATHILA5_I. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryotae; mitochondrial eukaryotes; Viridiplantae; ;OC Charophyta/Embryophyta group; Embryophyta; Magnoliophyta; ;OC Magnoliopsida; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 7505) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (December 2000) ;XX ;CC ATHILA5_I is an internal portion of the ATHILA5 endogenous ;CC retrovirus. There are several copies of ATHILA5 in the genome; ;CC they are ~96% identical to the consensus sequence. Long terminal ;CC repeats from ATHILA5 are deposited in Repbase Update as ;CC ATHILA5_LTR. ATHILA5 has generated 5-bp target site duplications. ;CC ATHILA5_I encodes two proteins, ATHILA5p1 and ATHILA5p2. ;CC ATHILA5p1 (885 aa, position 292-2949): ;CC MANEANANELPGNIGAGDAPRNHHQRAGIVPPPIQNNNFEIKSGLISMIQGNKFHGLPME ;CC DPLDHLDNFDRLCSLTKINGVSEDSFKLRLFPFSLGDKAHLWEKTLPVESVDTWDDCKKA ;CC FLAKFFSNSRTARLRNEISGFNQKNSESFAEAWERFKGYTTQCPHHGFKKASLLSTLYRG ;CC ALPKIRMLLDTTSNGNFLNKDVAEGWELVENLAQSDGNYNEDYDRTNRGSSDSEDRHKKE ;CC IKALNDKIDKLVLAQQRNVHYITEEELTQLQDGENLTIEEVSYLQNQGGYNKGFNNYKPP ;CC HPNLSYRSNNVANPQDQVYPPQNQPPQAKPFVPYNQGYNQKQNFGPPGFTQQPQQTSAQD ;CC SEMKTLLQQLVQGQASCSMTMDKKLAELTTRIDCSYNDLNIKIDALNTRVKSMEGHIAST ;CC SAPKHPGQLPGKSVQNPKEYAHAISTVNTSATADSGIQEGEVLRPRSRQEIELDFFARLV ;CC ERAHDPSNPIPIPPPYEPKPYFPERIAQINERIFQKHKMMFIKCIKELEEKIPLVDTPKE ;CC VIMERPQEAQQIVELSFECSAIIQRKVIPKKLGDPGSFTLPCSLGPLVFNNSLCDLGASV ;CC SLMPLSVAKRLGFDKFKPSSIHLILADRSVRVPHGMLEDLPVKIGSVEIPTDFVVLEMDE ;CC EPKDPLILGRPFLATAGALIDVQMGKIDLNLGKNLHMSFDIAKKMKKPTIEGQLFFIEEG ;CC NLDAELLSGLENSIPYSIPTHHLGEPEEPLMIEGEPSSEVETKRNHFDVGPIARELMELR ;CC KQYGAQGETMEKLDLKMEELNYAILELKEMIKGYPGPEIEEYFEEPDLGEEDYTTDEKEA ;CC YFEERSNEYSTLQLSRENAEYDSDFEDSASEDEDFSVPLLNLFST ;CC ATHILA5p2 (679 aa, position 3785-5824): ;CC MDPEESRRRASARAVARGFAMINEKGPSRNQEAASPGFTRLTGRVQWPSMAPESSMGRSA ;CC AAREEIARGKRVWESEPVAEEEVPVLEKEASEEDVEIDEEVPIVPARRRRNNPRRKKEPT ;CC IEEHYQYLMELSFEGTRYPHRPTMQALGICRDVDYLMEMAKLETFFSYKCEGYKTESCQF ;CC LATLKLHFYAEERERELHKGVGYITFMVFGIQYSLPIRQLDAVFKFPTKYGIRQNFSKDE ;CC LHDLWLTIAGPLPYKSSRAKSSLIRSPVLRYLHLCIASAFFPKKTTGHVNEGELKMLDLT ;CC LCFILGRTRNGIEMEGDRADTSLSVVLIDHLIGFREYATGIHQSGYGGSLCAGGVITPIL ;CC IAAGVPLHTPTVTANYIDMEYLKRKCYLDRSAPADQLYFKFKHSTLGLSRLALPCKEFTT ;CC VRIGNNIDFDPPQSILVNVLAPLQAEPSIGSESQEEGAEFNQEEAEQEDYTRPSNFQQAE ;CC YGQAEYEQAEFSRAEQFDEQEDSCEAAAEQYFFEDYAESDQERDPGQVHKKLGMLKGLGK ;CC FQRKLFSGLKKKVRKMKRAMDGMAVQIQELQRRQRSPPPPPEFRRCNSTSVAPQRDVRFD ;CC PPRASNYELGRSSTFSDRRFNRRPPGPVNQLVLNADPSREEYYSGYMLDGSTEYNPNTST ;CC HDPEYTQDRLDEFVQNLFV ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 7505 BP; 2200 A; 1564 C; 1698 G; 2043 T; 0 other; ATHILA5_I atttggcgccgttgccatttgggtgtttttcttgttacatttaggatttctgagttattaagatcaagtt ctatttctttctttctggttactcacttgtttcttcatttgcttgtttttgtcttgcaggtactcatact cgactagcacgtcgagtacatcaaccgagtaagatcaaaacgtatgcaaactcgttctcaaggttctggt aatctcctgcggtacagagacgacatcgacaggattcagcgtgaactcagagaacaacaagccacttcaa acccagtagtaatggctaatgaggcgaatgctaatgagttgccaggcaacattggtgctggtgatgcacc tagaaaccaccatcagagagctggtattgtcccacctcctattcagaacaacaacttcgagatcaagagt ggtcttatctccatgattcaaggcaacaagtttcatggactgccaatggaggacccattggatcatcttg ataactttgatagactctgtagtctcaccaagatcaatggagttagtgaggacagcttcaagctcaggtt attccctttctcacttggagataaagctcatctttgggaaaagactttgcctgttgaatctgtagacact tgggatgattgcaagaaggctttcctagccaagttcttctctaactcaagaacggctagattgaggaatg agatttcgggattcaatcagaagaattcagaatcattcgctgaagcatgggagcgtttcaaaggatatac cactcagtgcccgcaccacggattcaagaaggcctccctcctcagtactctataccgaggtgccttaccc aagatccggatgctcttggatacaacttccaatggtaactttctcaacaaagatgtagctgaaggttggg aacttgttgaaaacctagcacaatctgatgggaattacaatgaggactatgatcgcactaacagaggcag tagcgattctgaggacaggcacaagaaggagatcaaagctcttaatgataagattgacaaattggtgctt gctcaacagagaaatgtccactacattacagaagaagagcttacacaactccaagatggggagaatctta ctattgaggaggtgagctacctacagaatcaaggtggctacaacaagggattcaacaactacaaaccccc tcatcctaatctctcttacagaagcaacaatgtggcaaacccacaagatcaagtctaccccccacagaac caaccgcctcaagctaagccctttgtaccctacaaccaaggctacaaccaaaagcagaactttggacctc caggcttcacccagcaaccacagcaaacttcagcacaagactcagagatgaagactctacttcaacaact tgtgcaaggacaggcctcatgttccatgaccatggataagaagctagctgagctcaccaccaggattgat tgctcttataacgacctgaatataaagatagatgcacttaacactagagtcaagagcatggagggacaca ttgcttctacttcagctcctaagcaccctggacaacttcctggaaaatctgttcaaaatccaaaggagta tgcccatgctatctccacagttaatacttctgccactgcggacagtgggattcaagaaggggaggttttg agaccaagatcaagacaggagattgaactcgacttctttgctcggcttgtcgaacgagcacatgacccga gcaacccaatccctattccacctccctatgaacctaaaccatactttccagaaaggattgcacagattaa tgaaaggatcttccagaaacacaagatgatgttcatcaagtgtatcaaagagttagaagagaagataccc ttggttgatactcccaaggaagtgattatggaaagaccccaagaagctcagcaaatagttgaattgagtt ttgagtgcagtgctatcattcaaaggaaggtgataccaaagaagctaggtgatccaggttccttcactct accttgttcactaggacccttagtgttcaacaatagtctttgtgatttgggagcttcggttagtttgatg cctttgtctgttgcaaagagattgggatttgataagttcaagcctagcagcattcatcttattctagctg atagatcagtgagagtgcctcatgggatgctagaagacttaccggtaaagattggatcagtcgagattcc aaccgactttgtggttctagagatggatgaggaaccgaaagaccctctcattcttgggagaccattttta gcaactgcgggcgctcttatcgatgtgcaaatgggtaagatcgacttgaaccttggtaagaatctccata tgagctttgacattgctaagaagatgaagaagcccaccatagaagggcagcttttcttcattgaagaagg aaatttagatgctgagttgttgagtgggttggaaaattctattccgtactccattccgactcaccaccta ggagagcccgaggagcctcttatgatagaaggagaacctagctcagaggttgagactaagaggaaccatt ttgatgttggtcctattgctagagagcttatggagctcaggaaacagtatggagctcaaggggaaaccat ggaaaagttggacctcaagatggaagagctgaactacgctatcctggagcttaaggagatgattaaaggt tacccaggtcctgagattgaggaatactttgaggaacctgatttgggagaagaggactatactactgatg aaaaggaggcttactttgaggaaagatccaatgagtactctacactccagctatcaagagaaaatgcgga gtatgattcagactttgaggattcagcaagtgaggatgaggacttctcagttcctctcctcaatctcttc tctacctaaacattgtgagagtcaagcttagtgactttaaacaagctcacttgggaggaagtcccatgtc tatccttgtatatattgctttcttgttatttttgatgtttttgtttaagtgtttcaggaaaaaagacttc tgaaaaatttcaggccacactcgaccgtaccactcggctacaggccgagtgtgggcttcaattgaccaaa ggcccaagattgaatagtactcggaccataccactcggcccatggtcgagtatgggcctcactgttcaaa ggcccaagaagatgcagcactcggcccgaagccgagtcagaaaagaagcccatcaggcccaacactgtac tcgacacgcgcgtcgagtatgtcggccgagttcacacgtgcgaggtcaacacaattcaaattcaaatttg aattcgaatttgctcggccccaccaatcaaatcctgctccccaaagcccttcaaagattcttctgcaaat agtgaaagcatgtccccttgaccaaatgaagggtgaagatctaaggtgtttggagggctaggatcaaatc ttcttgtctataaaaccacactcaacaagctaagtcacttacactctctctttgctcaaaaatttcgaat tttacttcttctctccaaacatttcaagattttactctcacttctctctagaattcccagaaactttacc aaaatctcttgttttctctccatacaaaccttttcaaaacctctcttacagcctttgttcagttttaaac atcaatttctcttctactttgacctactttgggttattttgtggtgactatctgcagtaaatcatcccaa gatcatggatcccgaagaatcaagaaggagagcctctgctcgagctgtggcaagagggtttgcgatgatt aatgagaagggtccgtcaaggaatcaggaagctgcgagtcctggttttactcggttgacaggccgagtac agtggccgagtatggccccggagtcttctatgggtcgttctgcggctgctcgagaggaaattgcaagagg caagagggtttgggagtccgagccagttgcagaagaagaagtgcctgtgctagagaaggaagcatctgag gaagatgtggaaattgatgaggaggtcccgatagttcctgcaaggagaagaaggaacaacccaaggagaa agaaagagcctaccattgaagagcactaccagtacctcatggagctgagttttgaggggacaagatatcc ccatagacctaccatgcaagctttggggatatgtagggatgttgactacctcatggagatggccaagctg gagaccttcttctcctacaagtgtgaaggatacaaaactgagagctgccaattcctagccactttgaagc tccatttctatgctgaagaaagggagagagagctacataagggagttggctatatcacattcatggtgtt tggaattcaatactcccttcctattaggcagttggatgctgttttcaagttccccaccaagtatgggatc cgccaaaacttcagcaaggacgagctccatgatctatggttgacaatcgccggtccactcccctacaagt catctagggccaagagttcgttgataaggagcccggtgcttaggtatctccacctttgcattgcaagtgc cttcttcccgaagaagacaaccggccatgttaatgagggagagctgaagatgcttgatctcaccttatgc ttcattttggggcgcacaaggaatgggatagagatggaaggggatagggctgatacatccctttcggtgg tgttgattgatcacttgattggttttagggaatatgcaaccggcatccaccaatccggctatggaggaag cttatgtgctggaggagtgatcacgcccatcctaatagccgctggagttcctcttcatacccccactgtc acagcaaactacattgatatggagtatttgaagaggaaatgctacttggataggtctgccccagctgatc aactctatttcaagttcaagcactccacgctaggtctctctaggctagcactcccttgcaaggagttcac cacagttagaattgggaacaacattgactttgatcctcctcaatcgatcttagtcaatgtccttgcgcct ttacaagcagagccgagcatagggagtgagtctcaagaagaaggagcagagttcaaccaagaagaagctg agcaggaagactatactcggccgagtaactttcagcaggccgagtatggccaggccgagtatgaacaagc tgagttcagtcgagctgaacagtttgatgagcaagaagattcgtgtgaagctgcagcggagcagtatttc tttgaagattatgctgagtccgatcaagagagagatcccggtcaagttcacaagaagttgggaatgctca agggtttgggcaagtttcagagaaagttgttcagtgggttgaagaagaaggtgagaaagatgaagagggc aatggacggcatggcggttcagattcaggagctgcaaagaaggcagagatcgccaccacctccacccgag ttcaggagatgtaactcaacgagtgtggcaccgcaaagggacgttcgtttcgacccgccaagagcctcaa attacgagctcgggaggagctccaccttctcagatcgccgtttcaacaggcgcccacccggtccagtgaa ccagttggttctgaatgctgacccgagccgagaggagtactactcgggctacatgctcgacgggtcaacc gagtacaaccccaacacctccacccatgatcctgagtacacacaagaccgcctggacgagtttgtccaga acctcttcgtctaaatgttgaggtatcactccatttcactgtatatatcattgcatttcttttatttctt gctttgtgtggttatttctcttgaattcttctttgaatttttattacacaagggactgtgtaatttaagt ttgggggagagttcaagatgtatctaacattgtttcatgttttcttattcaaatttttgcatcatctaag gcatagaaaacccataaaaatttgaaaatttttcgaaaatgattccaaaaaaatagagtgtcatgtagtt tgcatttgcattattagggctgtttttagaatgttatcatataggttgttgcattttgcacttgcatagg ggataatgatgatcatagccttgtaaatttgcaatgttcactagatagtttcaatgcccttgttgttagt tgtctagtgcttaaccgattgaacttgaagtaaaaccgcaccatcttttgaattcatatacttgatcttc cttagtcgaaactcgctgtgatttgaagctattccctatcaatttgaaccataatttgacttttaattat catactatgcattgcttgttaaactcatggttacccttaaaatatttggatcttcttattcatttcacca ctcttgttgatccaaatagctgtctctcacctttagagcagtttccccacaccctaacctaagccttctt tcaagccatatatcacttgtgagtgtttgtgaggtcttatttcgattaagcttggtagaaagtgttaggt ttgtaacgacaaagatagtatctcatgtagttctagttcgcgttatccggactagataggactaggtggg tacttattctatgggttgggaagagtttaaaagagaaaaagggttgaattcattgtttacaagaaaaggg aaaagaattctaggagaagtaagctaaagaagttagaaaaagtctagtaaagggtttgagattgttaaag aaagagattgggttattgttagctaatgaagaagggtaaaaagccctaagcttaatagagattaaaaaca gaaccttagtactaaagaaagccaaacccgctagaagtatcaaagagaaaagaaaagcttctcctagagt taagagaaaaagaaaagaatgggttaagaaagagttcaaaagattatgaatgcaaaagggtagagttaag ttcttaattgggatgggagatgggattgccattagatcttcattgattatactttgggtagatgggatct tatctttgtatgcataacttgggacttacctttagcattctactaaagcttaatcattcttgagagatcc cttgttactaaagcctattctttaagggaccatttttgtctcttgaccctttacccttagccaaatgagt ttaatatgcattgtgtagtatgatccatggttcttgcttaatgaatgttaaagggaatatgctgatttga atgcttgaatagactaagtgaaagattaggttgtgttgtgaagaagatggctaaagtttttaagtagaga tcattcaacctagcactctagaactagcaacatggacattgagactatttattttacatgcatattttgg ttctgaatccccaccttcaaacctcactcctagcctagttctatttgttgcttgaggacaagcaaagagc taagtttgggggtgt1