;ID   ATHILA_I    DNA   ; ATH   ; 7414 BP
;XX
;DE   Internal portion of the ATHILA endogenous retrovirus.
;XX
;AC   X81801
;XX
;DT   11-NOV-1996 (Rel. 2.2, Created)
;DT   04-DEC-2001 (Rel. 6.3, Last updated, Version 2)
;XX
;KW   LTR retrotransposon; GYPSY superfamily; ATHILA_LTR; ATHILA; ATHILA_I.
;XX
;OS   thale cress.
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryotae; mitochondrial eukaryotes; Viridiplantae;
;OC   Charophyta/Embryophyta group; Embryophyta; Magnoliophyta;
;OC   Magnoliopsida; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1]  (bases 1 to 7414)
;RA   Pelissier,T., Tutois,S., Deragon,J.M., Tourmente,S., Genestier,S.
;RA   and Picard,G.
;RT   Athila, a new retroelement from Arabidopsis thaliana
;RL   Plant Mol. Biol. 29 (3), 441-452 (1995)
;XX
;RN   [2]  (bases 1 to 7414)
;RA   Pelissier,T.
;RT   Direct Submission
;RL   Submitted (19-SEP-1994) T. Pelissier, GDR 977 Biomove CNRS,
;RL   Universite Blaise Pascal, 24 Avenue des Landais, 63177 Aubiere
;RL   Cedex, FRANCE
;XX
;CC   ATHILA_I is an internal portion of the ATHILA LTR retrotransposon.
;CC   ATHILA 10.5 kb elements, representing 0.3% of the genome, present
;CC   several features of retrotransposons and retroviruses. Athila elements
;CC   are flanked by 1.5 kb long terminal repeats (ATHILA_LTR) that are 
;CC   themselves bounded by 5 bp perfect inverted repeats. These LTRs 
;CC   start and end with the retroviral consensus 5'TG...CA3' nucleotides.
;CC   A tRNA-binding site and a polypurine tract are found adjacent to the 5'
;CC   and 3' LTR respectively. The central domain is composed of two long
;CC   open reading frames (ORFs) of 935 and 694 amino acids. Despite several
;CC   indications of recent transposition activity, the translation of these
;CC   ORFs failed to reveal significant homology with proteins associated to
;CC   retrotransposition. The authors suggest that the Athila family could 
;CC   result from the transduction and dispersion of a cellular gene by a
;CC   retrotransposon.
;CC
;CC   primer_bind     4..18
;CC                   /note="complementary to Lupine tRNAglu 3' end"
;CC   CDS             193..3000
;CC                   /note="Athila ORF 1"
;CC                   /codon_start=1
;CC   mRNA            3709..>5793
;CC                   /note="Athila ORF 2"
;CC   misc_feature    7407..7413
;CC                   /note="poly purine track"
;XX
;DR   Positions    1540   8953  Accession No X81801        GenBank (rel. 95.0)
;XX
;SQ   Sequence 7414 BP; 2284 A; 1624 C; 1622 G; 1884 T; 0 other;
ATHILA_I
atttggcgccgttgccagtttggtgtttgtttgctatatttgagatcattagaaacattaagatcaagtt
cgtttttgttacacaagttactaacattgtgccttttctgtctttgtgtttcaggtaccttgcaacctgc
tttccagacgcatctctgattcgtctgttatagcacggctgctatctgttgcatgcatacacggtcacaa
ggaaatcagaacctgctattcaacgataacatcgaccgtattgctcgccaactaagagaacagacagaaa
cagacacaatggctgacgttgtagatgagcaagagcaacctaccaacattggtgctggtgacttccctca
caaccacaaccagcgtcatggaatcgttccacctccagtacagaacaacaactttgagatcaaaagcggt
ctcattgctatggttcaggggaacaagtttcatggcctgctaatggaggatccgctagatcatcttgatg
agtttgaaaggctctgtcgccttactaaaatcaatggagttagtgaagatggtttcaagcttcgcttgtt
cccattttcacttggagataaagcccatctgtgggaaaaaacgctaccccatggatcaatcacgacctgg
gatgactgcaaaaaggccttcttggcaaaattcttctccaactccaggactgcaagactccggaacgaga
tatccggattcactcagaagcaaaatgaaagcttctgtgaagcttgggagcgctttaagggttatccaac
caaatgccctcatcacggatttaaggaagcatctcttctcagcacactttacagaggcgtcctgcctaag
atacgtatgctccttgacaccgcctccaacgggaatttcctgaacaaggatgtcgaagacggatgggagc
tagttgaaaacttggctcagtcggatggcaattacaatgaggattatgacagaagcatctgcaccagtac
tgaatctgatgacaagcaccgccgagagattaaagctctccatgacaaaatcgacaaactcctccaagtg
cagcagaagcatgttcacttcgcctctgaagacgaagtattccaaatccaagagggggagaatgatcagg
gcgctgaaatcagttatgttcagaaccaaggcggctccaacaaagggtacaacaattacagacccaaccc
aaatctgtcatacagaagcacaaatgtcgccaatcctcaggaccaagtttaccttcaacagcagctgcag
cagaaccaacccaaaccgtttgttacttacaaccaaagcctagggttcgttcctaagcaacagttccaag
gagggtatcaacagcaacagccaccacctggtttcacaccacaacaacatcaggcctctgcaccccagga
atcagacatcaagaatatgctccaacagatcatgcaaggtcaagccgcaggggtaatggaatctgctaag
aagatagctgacttaaacaacaaagttgacagacagttcaatgaactgagtagcaggttggaatcattga
acacaagggtacggtatctggacggcgtcactacttcaccacctgttacaaacaacccaggccagcttcc
aggaaaagcaatccaaaaccctaaggagtacgccactgcacacgcaatcactatcagacatgagcgagag
ctgccaactcaacatgtctctacatcaaacactgaggacagcgtaattcaagaaggggaggcttccactc
agattgaagtctctgttgctgagatcgactattcagctcaacctttctttcaggctcagtctgacttgga
ggagaaggctgctataatcgagaggatggtgaaacgattcaagccagctccatcaccctcacttgctctt
ccatggacgttcaggaaagcttggaaaagcagatatgaatctctggcagaaaaacaacttggtgagatgg
aagctgtgatgcccttagtagaagttctcaagttgatttcagaccctcacaaagatgtgagaaacctgat
actggaaaggctcaacatatatcaggattcagatgacgaagacgatgtcattatgcatcaagccgctggt
gagaggattattcaagagaagctggaagatcctggatctttcactctgccatgttcgattaggcaattga
ctttcaacaattgtctatgtgatctaggagcctcagtcagtttaatgccactctctgtggcaaggaagct
gggattcattcagtataagccctgtgacatcactttgatccttgctgatagaacttcaaggagacctttc
agcattctagagaacgtaccagtaatgatagatggagttgaggtacccactgattttgtcgtgcttgaaa
tggatgaagagtctaaggatcctttgatcttaggaagacctttcttagcctcagtgggagcagtaataga
tgtcagatatggtaagattgatctcaaccttgggaggcatgtcaagttacagtttgacatcaacaaattt
ccgacaaggccaccaatagaaggaaaaaccttagaggtccagagagcggatccaagtgaaggccttgaga
tcaaaagagggaaggagcaaatttcggttcaaactccgccatcgaactcgactactcgatcgagtatacc
ctcctcacttgattggctggaactcaaaaggaagactgatctacagaacagaaccattcagaagctgagc
tacactgttgaaaagcttagagacaaactaagtcgaatgcaaaaggaggttcagccccagctcaaccatg
acacgatttcaagaaaggagatcacctgggattggtcagaagagaaagactaccctccagaggaggaagc
agcatactatgaggaaagaagaattgagtactcagctgtgcaactgtcaagggaggatgctgagtatgat
gatgatatcagtgaagaggaggacttctcaagttccctctgtcacattttctccacttgattagtgtgag
gagtcaagctagagactctaaacaagctcacttgggaggaagtcccatgactatccctgtatatactttt
tatttatttgttacttttgattcgttttggttgtgtctttgattctcaggaaatgaataacagctggagt
gatcaacaaggatgtgatcgagtgttaaaatgaaaattttccaaattttttttcaaatactcgatcacaa
cataggagtgaaagctcaggagtgcgatatagtagattcgggataacaattttctactagatcgcaagga
ggagtgaaagtgtccacagaacgtcaatgacaaacctcagtcccattaagcctactggatttcaatctcc
accaaaactccaaaaagcccatgctgtgtccaattgtagcccatgacacgaaattaaaggatcaacaacc
catactcgatcacaaggcgcgtgacttgaaggaaccccatcctgccgacataaagcaccatgttttgtca
ccttatcacgttgccgacatcataacgggcattgcatcccatccgtccgtttaaaagcaacacacctcct
ccactcattcaaatcgcaaagggagataagtgtcatcactcttcacttttcttcatcacttgtgatttca
aaacatctctcttgtttatctctctacttcactaccaaaaacgctctcaactctttcgttgttgctagct
gttattgactaaaacagaacctcagaaactactctatcgcaaccctgtctctcatttgcttgcttcactc
gatcgcgacctgagctctcactatcaaattctactcgatcacaactctatatcaataggaagtcatttct
ttcattgtctttgtttgtttctcactaacctactgagctttagttgtgagtttcaaatcgatttcagctt
caagatgagttcacacagctacgaatcatctatggatgccgactacaatgttgatgatgttgaatcttgg
tcaactaggccaaagagagaagcagatgagtataggcgatttgtggaggaaacagagcgagcagtcgcca
atgacaggagaagagaagagatagccagagggaagaggtcaatgacggaaaattatgagctaattgatga
agagatggaagatgatgcggagtacattccagagcaaacccgcaagaccaccaagtccctaatgaaagaa
gacaagctgacgccgggagactactacaaggctctcaaaagaaatccgttttggggaacaagatatcctc
acccggagacaatggcggagttaggcatcttggaagacgttcagctcttgttcgagaaatgccacatgac
aacactcatgtctcatccctatcctacctatgaggatgagacacttcaattcctttcatcacttcaagtg
gaattgtttgagggattgtcagctgttgagttaagagaagaagggcttggttacttaagcttcaccatcg
acgaccgcgactatatcatggcgatcaagacgctggaatccatgtttgggtttcccagtggcacgggaat
gaaaccgaagtttgcccgagaggaattgaaatctttgaggaacacaatcggagacaccaccagcttcaac
tccgcaagatccaagagcaactccatctgcaatcccgccattcgctatttccagagagccatggccaatg
tcccttacgctagggagatatctgccactgtgacgaaccaagacatggagacgttagacctggcgctgct
aagtttcctccgttacaccaaggacgggaaaacgatgaagggagataccaatgacacaccaccgtcaatg
tatcttctcaatcacttatccagctacagagggtgggcaatggccaatgacaaaaagaacgcgaaaggag
ccctttgcattggaggagtggtcacaccaattctcctatcgtgcgacgtcccgctgcgaccggatccaat
ccaaccacgttggatggatatcgcccatctcaagcttgcccatgtgatcgaacataaggtctatgacggg
aggtacgctttcaagtttgatcacccgtcaacaggcgaagcaacttttcttctcccattcagtgaaatga
caaccatcacggtgagagacaacatcgacttctcgccaccacaagaaatcctccatgctgtgattggagg
atccgcgtcccgaaatgctgaagaagtagaaggtcagagtgacaatgaggagaccaattgggagaactat
gacacgagccgctaccactttgaagaacacaagcccccttcacgagaaagcaagagtctcgctgaagcac
accggaagctcaatctaacccaaaggtggtgcaagtttcaagacaagatcatccataagtgtgtcaaggc
cattgacaacatgcagaaggcaatcagttgcaccacatccaccagtgcgatcaccagggacaacccgcca
gaagacatgccatccaggcgtcatgacattgccccgccaaggcaatctgcttatcaacagagagaacgcg
ctgtcccgcaagaacccgcacgacactcgtctcacgaggtcagggagcacaagaggcggaagagcgctag
gatggttcgatcatcaagcaagggtcggataatgtcggggcggagaacacgtgagcgtcgtgctgaacaa
cctgctggggttgcgatcgagcacaatgacgaggagatgacggagccacatcaagaaccggtcatgcctg
ctgagtacactcaagcagatatgaatgactacatctccaggacattctactgaggtaacacaccaaaact
ctttgtaaatatcgcttttgtttttgtttctgtgtttatcttgaattctttcacaaattttgattacaca
agggactgcataatttaagtttgggggagggttcaagacgtatttgacttgttttttgtgaatatcattt
gagtctgcatatcatctaagtcatagaaaaaaacaaaaaaatttgaaaatttttgaaaatgaattccaca
aaaacagagtgatcatttagttgcattacatttagaatcaagtctagagtgtttcatataggattgttgc
atatgcataggggataatgaggaaatagccttgtaagcatgatgattcactaaaatgagttattagttct
ctgaggcatttgaatgactttgaagtaaaaccgcaccatgttctatagaaaccactcgatgcatgtcatg
acacctttccctgtcaatttgaacttgaatctgacttataattatcatgtttgcatcgtttttgaactcg
tggatagaatacatatttggattttctttcacttattcaccactcttgttaatccaagtagctgcttcta
ttattggagtagttcccccacccttaaccaacccttctttcaagccatgttattttgtgagagcatgtga
ggcctattttcaggattgagcttggtagaacgtgttaggtttgagccgacaagagtagtatctcatgtag
ttccaatccgcgttttcggacttggtaggactaggtgggaacttatttggggattgagattgagtgtgaa
aagaaaaagaaagaaaaaaataggttagagtctttaggagaaggaagctgaaaatcatattgtggtctag
tgaatgtttgggaagcatagaattgttaagatatgttctaaagaaatagatggcaataaaaaaaaaaaaa
aaaaaaaaaagaaaaaaaagaaaaaaaatgataagagccataagagtttaagaaaaacccaagtcactag
actagactcatagagtttaagaaaagcttctccttaagaacaagagccaaagaaaagaaaaggaaacgtt
gatccaagaaaaaatcgaaatatgcctttagtgcaaaagggtagacttaggatcatcattgggtttagag
ataggactatctacttgttattcacggtttatgtcattgggtagatggggtgctattcttgtatgcatag
gttggcacttacctttagcattcttctaaagctaagtcctttttattgagagtcccctgctattaaaatc
tgaaccactaaaaagggaccatctttgtctcataacctgtcattagccaaatgagttcactagcgatgca
tatcttgattcatttttttgaacttaatgaatgttaaagggattggttgatcttgacgcattgtgcattt
gagtgtaggtttggatcataacagagcatggctaaagtttttgagtagaagtcgatcacctcgcatctta
gaactgttagctggtacattgatcttaattgtcatatctcatgctttggttctgaatccccaacttcaaa
cctctccttctgcttatgtcttcttgtttgcttgagggcaaacaaagactaagtttgggggagt1