;ID ATHILA_I DNA ; ATH ; 7414 BP ;XX ;DE Internal portion of the ATHILA endogenous retrovirus. ;XX ;AC X81801 ;XX ;DT 11-NOV-1996 (Rel. 2.2, Created) ;DT 04-DEC-2001 (Rel. 6.3, Last updated, Version 2) ;XX ;KW LTR retrotransposon; GYPSY superfamily; ATHILA_LTR; ATHILA; ATHILA_I. ;XX ;OS thale cress. ;XX ;OC Arabidopsis thaliana ;OC Eukaryotae; mitochondrial eukaryotes; Viridiplantae; ;OC Charophyta/Embryophyta group; Embryophyta; Magnoliophyta; ;OC Magnoliopsida; Capparales; Brassicaceae; Arabidopsis. ;XX ;RN [1] (bases 1 to 7414) ;RA Pelissier,T., Tutois,S., Deragon,J.M., Tourmente,S., Genestier,S. ;RA and Picard,G. ;RT Athila, a new retroelement from Arabidopsis thaliana ;RL Plant Mol. Biol. 29 (3), 441-452 (1995) ;XX ;RN [2] (bases 1 to 7414) ;RA Pelissier,T. ;RT Direct Submission ;RL Submitted (19-SEP-1994) T. Pelissier, GDR 977 Biomove CNRS, ;RL Universite Blaise Pascal, 24 Avenue des Landais, 63177 Aubiere ;RL Cedex, FRANCE ;XX ;CC ATHILA_I is an internal portion of the ATHILA LTR retrotransposon. ;CC ATHILA 10.5 kb elements, representing 0.3% of the genome, present ;CC several features of retrotransposons and retroviruses. Athila elements ;CC are flanked by 1.5 kb long terminal repeats (ATHILA_LTR) that are ;CC themselves bounded by 5 bp perfect inverted repeats. These LTRs ;CC start and end with the retroviral consensus 5'TG...CA3' nucleotides. ;CC A tRNA-binding site and a polypurine tract are found adjacent to the 5' ;CC and 3' LTR respectively. The central domain is composed of two long ;CC open reading frames (ORFs) of 935 and 694 amino acids. Despite several ;CC indications of recent transposition activity, the translation of these ;CC ORFs failed to reveal significant homology with proteins associated to ;CC retrotransposition. The authors suggest that the Athila family could ;CC result from the transduction and dispersion of a cellular gene by a ;CC retrotransposon. ;CC ;CC primer_bind 4..18 ;CC /note="complementary to Lupine tRNAglu 3' end" ;CC CDS 193..3000 ;CC /note="Athila ORF 1" ;CC /codon_start=1 ;CC mRNA 3709..>5793 ;CC /note="Athila ORF 2" ;CC misc_feature 7407..7413 ;CC /note="poly purine track" ;XX ;DR Positions 1540 8953 Accession No X81801 GenBank (rel. 95.0) ;XX ;SQ Sequence 7414 BP; 2284 A; 1624 C; 1622 G; 1884 T; 0 other; ATHILA_I atttggcgccgttgccagtttggtgtttgtttgctatatttgagatcattagaaacattaagatcaagtt cgtttttgttacacaagttactaacattgtgccttttctgtctttgtgtttcaggtaccttgcaacctgc tttccagacgcatctctgattcgtctgttatagcacggctgctatctgttgcatgcatacacggtcacaa ggaaatcagaacctgctattcaacgataacatcgaccgtattgctcgccaactaagagaacagacagaaa cagacacaatggctgacgttgtagatgagcaagagcaacctaccaacattggtgctggtgacttccctca caaccacaaccagcgtcatggaatcgttccacctccagtacagaacaacaactttgagatcaaaagcggt ctcattgctatggttcaggggaacaagtttcatggcctgctaatggaggatccgctagatcatcttgatg agtttgaaaggctctgtcgccttactaaaatcaatggagttagtgaagatggtttcaagcttcgcttgtt cccattttcacttggagataaagcccatctgtgggaaaaaacgctaccccatggatcaatcacgacctgg gatgactgcaaaaaggccttcttggcaaaattcttctccaactccaggactgcaagactccggaacgaga tatccggattcactcagaagcaaaatgaaagcttctgtgaagcttgggagcgctttaagggttatccaac caaatgccctcatcacggatttaaggaagcatctcttctcagcacactttacagaggcgtcctgcctaag atacgtatgctccttgacaccgcctccaacgggaatttcctgaacaaggatgtcgaagacggatgggagc tagttgaaaacttggctcagtcggatggcaattacaatgaggattatgacagaagcatctgcaccagtac tgaatctgatgacaagcaccgccgagagattaaagctctccatgacaaaatcgacaaactcctccaagtg cagcagaagcatgttcacttcgcctctgaagacgaagtattccaaatccaagagggggagaatgatcagg gcgctgaaatcagttatgttcagaaccaaggcggctccaacaaagggtacaacaattacagacccaaccc aaatctgtcatacagaagcacaaatgtcgccaatcctcaggaccaagtttaccttcaacagcagctgcag cagaaccaacccaaaccgtttgttacttacaaccaaagcctagggttcgttcctaagcaacagttccaag gagggtatcaacagcaacagccaccacctggtttcacaccacaacaacatcaggcctctgcaccccagga atcagacatcaagaatatgctccaacagatcatgcaaggtcaagccgcaggggtaatggaatctgctaag aagatagctgacttaaacaacaaagttgacagacagttcaatgaactgagtagcaggttggaatcattga acacaagggtacggtatctggacggcgtcactacttcaccacctgttacaaacaacccaggccagcttcc aggaaaagcaatccaaaaccctaaggagtacgccactgcacacgcaatcactatcagacatgagcgagag ctgccaactcaacatgtctctacatcaaacactgaggacagcgtaattcaagaaggggaggcttccactc agattgaagtctctgttgctgagatcgactattcagctcaacctttctttcaggctcagtctgacttgga ggagaaggctgctataatcgagaggatggtgaaacgattcaagccagctccatcaccctcacttgctctt ccatggacgttcaggaaagcttggaaaagcagatatgaatctctggcagaaaaacaacttggtgagatgg aagctgtgatgcccttagtagaagttctcaagttgatttcagaccctcacaaagatgtgagaaacctgat actggaaaggctcaacatatatcaggattcagatgacgaagacgatgtcattatgcatcaagccgctggt gagaggattattcaagagaagctggaagatcctggatctttcactctgccatgttcgattaggcaattga ctttcaacaattgtctatgtgatctaggagcctcagtcagtttaatgccactctctgtggcaaggaagct gggattcattcagtataagccctgtgacatcactttgatccttgctgatagaacttcaaggagacctttc agcattctagagaacgtaccagtaatgatagatggagttgaggtacccactgattttgtcgtgcttgaaa tggatgaagagtctaaggatcctttgatcttaggaagacctttcttagcctcagtgggagcagtaataga tgtcagatatggtaagattgatctcaaccttgggaggcatgtcaagttacagtttgacatcaacaaattt ccgacaaggccaccaatagaaggaaaaaccttagaggtccagagagcggatccaagtgaaggccttgaga tcaaaagagggaaggagcaaatttcggttcaaactccgccatcgaactcgactactcgatcgagtatacc ctcctcacttgattggctggaactcaaaaggaagactgatctacagaacagaaccattcagaagctgagc tacactgttgaaaagcttagagacaaactaagtcgaatgcaaaaggaggttcagccccagctcaaccatg acacgatttcaagaaaggagatcacctgggattggtcagaagagaaagactaccctccagaggaggaagc agcatactatgaggaaagaagaattgagtactcagctgtgcaactgtcaagggaggatgctgagtatgat gatgatatcagtgaagaggaggacttctcaagttccctctgtcacattttctccacttgattagtgtgag gagtcaagctagagactctaaacaagctcacttgggaggaagtcccatgactatccctgtatatactttt tatttatttgttacttttgattcgttttggttgtgtctttgattctcaggaaatgaataacagctggagt gatcaacaaggatgtgatcgagtgttaaaatgaaaattttccaaattttttttcaaatactcgatcacaa cataggagtgaaagctcaggagtgcgatatagtagattcgggataacaattttctactagatcgcaagga ggagtgaaagtgtccacagaacgtcaatgacaaacctcagtcccattaagcctactggatttcaatctcc accaaaactccaaaaagcccatgctgtgtccaattgtagcccatgacacgaaattaaaggatcaacaacc catactcgatcacaaggcgcgtgacttgaaggaaccccatcctgccgacataaagcaccatgttttgtca ccttatcacgttgccgacatcataacgggcattgcatcccatccgtccgtttaaaagcaacacacctcct ccactcattcaaatcgcaaagggagataagtgtcatcactcttcacttttcttcatcacttgtgatttca aaacatctctcttgtttatctctctacttcactaccaaaaacgctctcaactctttcgttgttgctagct gttattgactaaaacagaacctcagaaactactctatcgcaaccctgtctctcatttgcttgcttcactc gatcgcgacctgagctctcactatcaaattctactcgatcacaactctatatcaataggaagtcatttct ttcattgtctttgtttgtttctcactaacctactgagctttagttgtgagtttcaaatcgatttcagctt caagatgagttcacacagctacgaatcatctatggatgccgactacaatgttgatgatgttgaatcttgg tcaactaggccaaagagagaagcagatgagtataggcgatttgtggaggaaacagagcgagcagtcgcca atgacaggagaagagaagagatagccagagggaagaggtcaatgacggaaaattatgagctaattgatga agagatggaagatgatgcggagtacattccagagcaaacccgcaagaccaccaagtccctaatgaaagaa gacaagctgacgccgggagactactacaaggctctcaaaagaaatccgttttggggaacaagatatcctc acccggagacaatggcggagttaggcatcttggaagacgttcagctcttgttcgagaaatgccacatgac aacactcatgtctcatccctatcctacctatgaggatgagacacttcaattcctttcatcacttcaagtg gaattgtttgagggattgtcagctgttgagttaagagaagaagggcttggttacttaagcttcaccatcg acgaccgcgactatatcatggcgatcaagacgctggaatccatgtttgggtttcccagtggcacgggaat gaaaccgaagtttgcccgagaggaattgaaatctttgaggaacacaatcggagacaccaccagcttcaac tccgcaagatccaagagcaactccatctgcaatcccgccattcgctatttccagagagccatggccaatg tcccttacgctagggagatatctgccactgtgacgaaccaagacatggagacgttagacctggcgctgct aagtttcctccgttacaccaaggacgggaaaacgatgaagggagataccaatgacacaccaccgtcaatg tatcttctcaatcacttatccagctacagagggtgggcaatggccaatgacaaaaagaacgcgaaaggag ccctttgcattggaggagtggtcacaccaattctcctatcgtgcgacgtcccgctgcgaccggatccaat ccaaccacgttggatggatatcgcccatctcaagcttgcccatgtgatcgaacataaggtctatgacggg aggtacgctttcaagtttgatcacccgtcaacaggcgaagcaacttttcttctcccattcagtgaaatga caaccatcacggtgagagacaacatcgacttctcgccaccacaagaaatcctccatgctgtgattggagg atccgcgtcccgaaatgctgaagaagtagaaggtcagagtgacaatgaggagaccaattgggagaactat gacacgagccgctaccactttgaagaacacaagcccccttcacgagaaagcaagagtctcgctgaagcac accggaagctcaatctaacccaaaggtggtgcaagtttcaagacaagatcatccataagtgtgtcaaggc cattgacaacatgcagaaggcaatcagttgcaccacatccaccagtgcgatcaccagggacaacccgcca gaagacatgccatccaggcgtcatgacattgccccgccaaggcaatctgcttatcaacagagagaacgcg ctgtcccgcaagaacccgcacgacactcgtctcacgaggtcagggagcacaagaggcggaagagcgctag gatggttcgatcatcaagcaagggtcggataatgtcggggcggagaacacgtgagcgtcgtgctgaacaa cctgctggggttgcgatcgagcacaatgacgaggagatgacggagccacatcaagaaccggtcatgcctg ctgagtacactcaagcagatatgaatgactacatctccaggacattctactgaggtaacacaccaaaact ctttgtaaatatcgcttttgtttttgtttctgtgtttatcttgaattctttcacaaattttgattacaca agggactgcataatttaagtttgggggagggttcaagacgtatttgacttgttttttgtgaatatcattt gagtctgcatatcatctaagtcatagaaaaaaacaaaaaaatttgaaaatttttgaaaatgaattccaca aaaacagagtgatcatttagttgcattacatttagaatcaagtctagagtgtttcatataggattgttgc atatgcataggggataatgaggaaatagccttgtaagcatgatgattcactaaaatgagttattagttct ctgaggcatttgaatgactttgaagtaaaaccgcaccatgttctatagaaaccactcgatgcatgtcatg acacctttccctgtcaatttgaacttgaatctgacttataattatcatgtttgcatcgtttttgaactcg tggatagaatacatatttggattttctttcacttattcaccactcttgttaatccaagtagctgcttcta ttattggagtagttcccccacccttaaccaacccttctttcaagccatgttattttgtgagagcatgtga ggcctattttcaggattgagcttggtagaacgtgttaggtttgagccgacaagagtagtatctcatgtag ttccaatccgcgttttcggacttggtaggactaggtgggaacttatttggggattgagattgagtgtgaa aagaaaaagaaagaaaaaaataggttagagtctttaggagaaggaagctgaaaatcatattgtggtctag tgaatgtttgggaagcatagaattgttaagatatgttctaaagaaatagatggcaataaaaaaaaaaaaa aaaaaaaaaagaaaaaaaagaaaaaaaatgataagagccataagagtttaagaaaaacccaagtcactag actagactcatagagtttaagaaaagcttctccttaagaacaagagccaaagaaaagaaaaggaaacgtt gatccaagaaaaaatcgaaatatgcctttagtgcaaaagggtagacttaggatcatcattgggtttagag ataggactatctacttgttattcacggtttatgtcattgggtagatggggtgctattcttgtatgcatag gttggcacttacctttagcattcttctaaagctaagtcctttttattgagagtcccctgctattaaaatc tgaaccactaaaaagggaccatctttgtctcataacctgtcattagccaaatgagttcactagcgatgca tatcttgattcatttttttgaacttaatgaatgttaaagggattggttgatcttgacgcattgtgcattt gagtgtaggtttggatcataacagagcatggctaaagtttttgagtagaagtcgatcacctcgcatctta gaactgttagctggtacattgatcttaattgtcatatctcatgctttggttctgaatccccaacttcaaa cctctccttctgcttatgtcttcttgtttgcttgagggcaaacaaagactaagtttgggggagt1