;ID ATHILA4A_I DNA ; ATH ; 2636 BP ;XX ;DE ATHILA4A_I is an internal portion of the ATHILA4A endogenous ;DE retrovirus - a consensus sequence. ;XX ;AC . ;XX ;DT 16-JAN-2001 (Rel. 6.0, Created) ;DT 16-JAN-2001 (Rel. 6.0, Last updated, Version 1) ;XX ;KW Gypsy-like endogenous retrovirus; ATHILA superfamily; internal ;KW portion; gag; pol; ATHILA4A_LTR; ATHILA4A_I. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryotae; mitochondrial eukaryotes; Viridiplantae; ;OC Charophyta/Embryophyta group; Embryophyta; Magnoliophyta; ;OC Magnoliopsida; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 2636) ;RA Kapitonov,V.V. and Jurka,J. ;RL Direct submission (January 2000) ;XX ;CC ATHILA4A_I is an internal portion of the ATHILA4A endogenous ;CC retrovirus. There are several copies of ATHILA4A_I in the genome; ;CC they are ~93% identical with the consensus sequence. ;CC Apparently, ATHILA4A is a non-autonomous element; its internal ;CC portion encodes one protein, ATHILA4Ap, which is related to ;CC the "ORF1" proteins encoded by ATHILA and retroviruses expressed ;CC in Vicia faba cells (GenBank GI: 7488857 and 2522227). ;CC ATHILA4A was active in the A.thaliana genome relatively recently ;CC since its proviral copies are flanked by 89-97% identical LTRs. ;CC There is ~80% identity between 160-bp 5' ends of ATHILA4A_I ;CC and ATHILA4_I. ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 2636 BP; 736 A; 560 C; 561 G; 779 T; 0 other; ATHILA4A_I aatttggcgccgttgccgaagttctctacgattcaccattagattaggattagaattttgtctaagcttt tgtttttgtttttctttttatttacttagtttttctcttttctttttcttgtgtgtttcaggtgtttgtc taatcacagaaccaggaccaaacgagaaacatgaacgagagtaagattgctgggatcatggctaagctta acgcggtccacaatcaacttgtggaggatgtcgattatgttggtagcagaggttttccgaatcagagatt tgatcaccaacaaggctacagaggttcctatgggaatggtccaactagctacactcagaattctcagttc cagcaaccgcttcagaatagcaatagtttcagcttcacaaggaactatgatcttgcttcttaccaagccc cccctccacccgcacctagaagcaagatagaatcattgctggaacagattctggaaggtcaacagaggat cttagagagacctatccctagatggaatgagagcagcctcactacttcttggcatgctgcttcaatggaa agtgagtatgagttagtgaagattgaggagagtgacgattttgaagttgcagaggcagtgtcgaccgaca ccaacccttggtgtcgatcgacaccatcgcatatatacgagacatcaggctgtgacgaagcaaaatcgac caggctcgacattgatctccagaatcgctcgacacacattgatagtgtcgaccgacaccacctgggtgtc gatcgacaccagtaccaacctgaggaagtgtcgacgtttgacagagcagaatcgaccgagattgacagag caggatcgataccagatagtagtgtcgaccgacactcacctagtgtcgatcgacactggacgcagactgc accaatcagccttgactttccacctgttgctgaactagtctacacacctgaaattccttgtccattgcct caacagtatatgcaagagattcaggaggagcgctgcaaggctccacagcatcaagaagacaaaaggaaga caattatcgtctcagaacagatttcagagctgaattgtcaacatgctaaggaaagtcgagttgtagcacg agtgtcgatcgacaccaccactcgcgtcgatcgacacacatcactggttgaagcaaaatcatctccgata gctacttttcaagctgggatcactgctataagtccaggcactactgtcacgtcacccactgcacctctca tacttcatccacctcctaaacggctaaggtactcatctcctcctcctaaacctccagatttcatcaataa gactttaaaattttcaaaaactgttttacggttaagcaagccacgagtttctcgtcgagcgcttgttcgt tcttttgatgattgtgcaagaaaaaggagtattccacccatacctgagtcccaccctgctgatgctcctc tcttcctagatagacctcatgcgctaactgcttatacgccagcatggatgatgagaggtctttctccgcg atatttactgccaccatctgatccacctgatccccatattccatcaaagtcaagctattgactataaaca agcgcttagtgggaggcaacccactggtatgatttttatttttctttttcttttattttatttttttctt tttctcgcttactccggatttgaatgccactggggacagtgtcgtttaagtctgggggaggcgcttacta actacgtttttctttttgagtcttattcttattcttatttttcttttttgagtcgttttattgagtcaaa attttgttaggaattatgatctctattctttagattattgcttggttgattaatgctgcaggttcgactc aatccgactattggaaaatctagatggcaaaccaacaaacttgaggatatcctggtttccaacacaacca ctaaggctggagctaatctcactgctcttccttttacctcttcaaacggttagtattcatagatcttgac tccttgttcatacctgatattgagaccaatgtgataaaaagaatgtgttcctctccccttataaaaagaa aaacaaaatgagagattgatacgattttcagatagtgtataggggtaggaatgtttcctatgaccccatt attatacattatttggaatgatcaattacaaaggttggaaatatgccgagggtgtcgatccagtatgcga gttcccctttgtttactctctaaaaagaataagtctgggggagagaaagaacaccaaagaaaaataaaat ataaagaatggtcgatttatcatggtatagtacaagaatgagtctagaaggatcttgaatgttagcttgt ttgatgagacccagcgatgaaagcctggtggatatagtagtggaacttaggtatggaatctagaatgttg tgtaagcttgcagagaagtacatgttggttaggatttgattagaactgctaggcatatggatttggttag aatgatcatggcaatagactagagaatgttgggagttcctgtgttttcaaacctcttccacgggttcggc ttttgtgttttgcttgagggcaagcaaaagctaagtctgggggagt1