;ID   ATHILA5_I   DNA   ; ATH   ; 7505 BP
;XX
;DE   ATHILA5_I is an internal portion of the ATHILA5 endogenous 
;DE   retrovirus - a consensus sequence.
;XX
;AC   .
;XX
;DT   14-DEC-2000 (Rel. 5.9, Created)
;DT   14-DEC-2000 (Rel. 5.9, Last updated, Version 1)
;XX
;KW   Gypsy-like endogenous retrovirus; ATHILA5p1; ATHILA5p2;
;KW   ATHILA5_LTR; ATHILA5_I.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryotae; mitochondrial eukaryotes; Viridiplantae;
;OC   Charophyta/Embryophyta group; Embryophyta; Magnoliophyta;
;OC   Magnoliopsida; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1]  (bases 1 to 7505)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (December 2000)
;XX
;CC   ATHILA5_I is an internal portion of the ATHILA5 endogenous 
;CC   retrovirus. There are several copies of ATHILA5 in the genome;
;CC   they are ~96% identical to the consensus sequence. Long terminal
;CC   repeats from ATHILA5 are deposited in Repbase Update as
;CC   ATHILA5_LTR. ATHILA5 has generated 5-bp target site duplications.
;CC   ATHILA5_I encodes two proteins, ATHILA5p1 and ATHILA5p2.
;CC   ATHILA5p1 (885 aa, position 292-2949):
;CC   MANEANANELPGNIGAGDAPRNHHQRAGIVPPPIQNNNFEIKSGLISMIQGNKFHGLPME
;CC   DPLDHLDNFDRLCSLTKINGVSEDSFKLRLFPFSLGDKAHLWEKTLPVESVDTWDDCKKA
;CC   FLAKFFSNSRTARLRNEISGFNQKNSESFAEAWERFKGYTTQCPHHGFKKASLLSTLYRG
;CC   ALPKIRMLLDTTSNGNFLNKDVAEGWELVENLAQSDGNYNEDYDRTNRGSSDSEDRHKKE
;CC   IKALNDKIDKLVLAQQRNVHYITEEELTQLQDGENLTIEEVSYLQNQGGYNKGFNNYKPP
;CC   HPNLSYRSNNVANPQDQVYPPQNQPPQAKPFVPYNQGYNQKQNFGPPGFTQQPQQTSAQD
;CC   SEMKTLLQQLVQGQASCSMTMDKKLAELTTRIDCSYNDLNIKIDALNTRVKSMEGHIAST
;CC   SAPKHPGQLPGKSVQNPKEYAHAISTVNTSATADSGIQEGEVLRPRSRQEIELDFFARLV
;CC   ERAHDPSNPIPIPPPYEPKPYFPERIAQINERIFQKHKMMFIKCIKELEEKIPLVDTPKE
;CC   VIMERPQEAQQIVELSFECSAIIQRKVIPKKLGDPGSFTLPCSLGPLVFNNSLCDLGASV
;CC   SLMPLSVAKRLGFDKFKPSSIHLILADRSVRVPHGMLEDLPVKIGSVEIPTDFVVLEMDE
;CC   EPKDPLILGRPFLATAGALIDVQMGKIDLNLGKNLHMSFDIAKKMKKPTIEGQLFFIEEG
;CC   NLDAELLSGLENSIPYSIPTHHLGEPEEPLMIEGEPSSEVETKRNHFDVGPIARELMELR
;CC   KQYGAQGETMEKLDLKMEELNYAILELKEMIKGYPGPEIEEYFEEPDLGEEDYTTDEKEA
;CC   YFEERSNEYSTLQLSRENAEYDSDFEDSASEDEDFSVPLLNLFST
;CC   ATHILA5p2 (679 aa, position 3785-5824):
;CC   MDPEESRRRASARAVARGFAMINEKGPSRNQEAASPGFTRLTGRVQWPSMAPESSMGRSA
;CC   AAREEIARGKRVWESEPVAEEEVPVLEKEASEEDVEIDEEVPIVPARRRRNNPRRKKEPT
;CC   IEEHYQYLMELSFEGTRYPHRPTMQALGICRDVDYLMEMAKLETFFSYKCEGYKTESCQF
;CC   LATLKLHFYAEERERELHKGVGYITFMVFGIQYSLPIRQLDAVFKFPTKYGIRQNFSKDE
;CC   LHDLWLTIAGPLPYKSSRAKSSLIRSPVLRYLHLCIASAFFPKKTTGHVNEGELKMLDLT
;CC   LCFILGRTRNGIEMEGDRADTSLSVVLIDHLIGFREYATGIHQSGYGGSLCAGGVITPIL
;CC   IAAGVPLHTPTVTANYIDMEYLKRKCYLDRSAPADQLYFKFKHSTLGLSRLALPCKEFTT
;CC   VRIGNNIDFDPPQSILVNVLAPLQAEPSIGSESQEEGAEFNQEEAEQEDYTRPSNFQQAE
;CC   YGQAEYEQAEFSRAEQFDEQEDSCEAAAEQYFFEDYAESDQERDPGQVHKKLGMLKGLGK
;CC   FQRKLFSGLKKKVRKMKRAMDGMAVQIQELQRRQRSPPPPPEFRRCNSTSVAPQRDVRFD
;CC   PPRASNYELGRSSTFSDRRFNRRPPGPVNQLVLNADPSREEYYSGYMLDGSTEYNPNTST
;CC   HDPEYTQDRLDEFVQNLFV
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 7505 BP; 2200 A; 1564 C; 1698 G; 2043 T; 0 other;
ATHILA5_I
atttggcgccgttgccatttgggtgtttttcttgttacatttaggatttctgagttattaagatcaagtt
ctatttctttctttctggttactcacttgtttcttcatttgcttgtttttgtcttgcaggtactcatact
cgactagcacgtcgagtacatcaaccgagtaagatcaaaacgtatgcaaactcgttctcaaggttctggt
aatctcctgcggtacagagacgacatcgacaggattcagcgtgaactcagagaacaacaagccacttcaa
acccagtagtaatggctaatgaggcgaatgctaatgagttgccaggcaacattggtgctggtgatgcacc
tagaaaccaccatcagagagctggtattgtcccacctcctattcagaacaacaacttcgagatcaagagt
ggtcttatctccatgattcaaggcaacaagtttcatggactgccaatggaggacccattggatcatcttg
ataactttgatagactctgtagtctcaccaagatcaatggagttagtgaggacagcttcaagctcaggtt
attccctttctcacttggagataaagctcatctttgggaaaagactttgcctgttgaatctgtagacact
tgggatgattgcaagaaggctttcctagccaagttcttctctaactcaagaacggctagattgaggaatg
agatttcgggattcaatcagaagaattcagaatcattcgctgaagcatgggagcgtttcaaaggatatac
cactcagtgcccgcaccacggattcaagaaggcctccctcctcagtactctataccgaggtgccttaccc
aagatccggatgctcttggatacaacttccaatggtaactttctcaacaaagatgtagctgaaggttggg
aacttgttgaaaacctagcacaatctgatgggaattacaatgaggactatgatcgcactaacagaggcag
tagcgattctgaggacaggcacaagaaggagatcaaagctcttaatgataagattgacaaattggtgctt
gctcaacagagaaatgtccactacattacagaagaagagcttacacaactccaagatggggagaatctta
ctattgaggaggtgagctacctacagaatcaaggtggctacaacaagggattcaacaactacaaaccccc
tcatcctaatctctcttacagaagcaacaatgtggcaaacccacaagatcaagtctaccccccacagaac
caaccgcctcaagctaagccctttgtaccctacaaccaaggctacaaccaaaagcagaactttggacctc
caggcttcacccagcaaccacagcaaacttcagcacaagactcagagatgaagactctacttcaacaact
tgtgcaaggacaggcctcatgttccatgaccatggataagaagctagctgagctcaccaccaggattgat
tgctcttataacgacctgaatataaagatagatgcacttaacactagagtcaagagcatggagggacaca
ttgcttctacttcagctcctaagcaccctggacaacttcctggaaaatctgttcaaaatccaaaggagta
tgcccatgctatctccacagttaatacttctgccactgcggacagtgggattcaagaaggggaggttttg
agaccaagatcaagacaggagattgaactcgacttctttgctcggcttgtcgaacgagcacatgacccga
gcaacccaatccctattccacctccctatgaacctaaaccatactttccagaaaggattgcacagattaa
tgaaaggatcttccagaaacacaagatgatgttcatcaagtgtatcaaagagttagaagagaagataccc
ttggttgatactcccaaggaagtgattatggaaagaccccaagaagctcagcaaatagttgaattgagtt
ttgagtgcagtgctatcattcaaaggaaggtgataccaaagaagctaggtgatccaggttccttcactct
accttgttcactaggacccttagtgttcaacaatagtctttgtgatttgggagcttcggttagtttgatg
cctttgtctgttgcaaagagattgggatttgataagttcaagcctagcagcattcatcttattctagctg
atagatcagtgagagtgcctcatgggatgctagaagacttaccggtaaagattggatcagtcgagattcc
aaccgactttgtggttctagagatggatgaggaaccgaaagaccctctcattcttgggagaccattttta
gcaactgcgggcgctcttatcgatgtgcaaatgggtaagatcgacttgaaccttggtaagaatctccata
tgagctttgacattgctaagaagatgaagaagcccaccatagaagggcagcttttcttcattgaagaagg
aaatttagatgctgagttgttgagtgggttggaaaattctattccgtactccattccgactcaccaccta
ggagagcccgaggagcctcttatgatagaaggagaacctagctcagaggttgagactaagaggaaccatt
ttgatgttggtcctattgctagagagcttatggagctcaggaaacagtatggagctcaaggggaaaccat
ggaaaagttggacctcaagatggaagagctgaactacgctatcctggagcttaaggagatgattaaaggt
tacccaggtcctgagattgaggaatactttgaggaacctgatttgggagaagaggactatactactgatg
aaaaggaggcttactttgaggaaagatccaatgagtactctacactccagctatcaagagaaaatgcgga
gtatgattcagactttgaggattcagcaagtgaggatgaggacttctcagttcctctcctcaatctcttc
tctacctaaacattgtgagagtcaagcttagtgactttaaacaagctcacttgggaggaagtcccatgtc
tatccttgtatatattgctttcttgttatttttgatgtttttgtttaagtgtttcaggaaaaaagacttc
tgaaaaatttcaggccacactcgaccgtaccactcggctacaggccgagtgtgggcttcaattgaccaaa
ggcccaagattgaatagtactcggaccataccactcggcccatggtcgagtatgggcctcactgttcaaa
ggcccaagaagatgcagcactcggcccgaagccgagtcagaaaagaagcccatcaggcccaacactgtac
tcgacacgcgcgtcgagtatgtcggccgagttcacacgtgcgaggtcaacacaattcaaattcaaatttg
aattcgaatttgctcggccccaccaatcaaatcctgctccccaaagcccttcaaagattcttctgcaaat
agtgaaagcatgtccccttgaccaaatgaagggtgaagatctaaggtgtttggagggctaggatcaaatc
ttcttgtctataaaaccacactcaacaagctaagtcacttacactctctctttgctcaaaaatttcgaat
tttacttcttctctccaaacatttcaagattttactctcacttctctctagaattcccagaaactttacc
aaaatctcttgttttctctccatacaaaccttttcaaaacctctcttacagcctttgttcagttttaaac
atcaatttctcttctactttgacctactttgggttattttgtggtgactatctgcagtaaatcatcccaa
gatcatggatcccgaagaatcaagaaggagagcctctgctcgagctgtggcaagagggtttgcgatgatt
aatgagaagggtccgtcaaggaatcaggaagctgcgagtcctggttttactcggttgacaggccgagtac
agtggccgagtatggccccggagtcttctatgggtcgttctgcggctgctcgagaggaaattgcaagagg
caagagggtttgggagtccgagccagttgcagaagaagaagtgcctgtgctagagaaggaagcatctgag
gaagatgtggaaattgatgaggaggtcccgatagttcctgcaaggagaagaaggaacaacccaaggagaa
agaaagagcctaccattgaagagcactaccagtacctcatggagctgagttttgaggggacaagatatcc
ccatagacctaccatgcaagctttggggatatgtagggatgttgactacctcatggagatggccaagctg
gagaccttcttctcctacaagtgtgaaggatacaaaactgagagctgccaattcctagccactttgaagc
tccatttctatgctgaagaaagggagagagagctacataagggagttggctatatcacattcatggtgtt
tggaattcaatactcccttcctattaggcagttggatgctgttttcaagttccccaccaagtatgggatc
cgccaaaacttcagcaaggacgagctccatgatctatggttgacaatcgccggtccactcccctacaagt
catctagggccaagagttcgttgataaggagcccggtgcttaggtatctccacctttgcattgcaagtgc
cttcttcccgaagaagacaaccggccatgttaatgagggagagctgaagatgcttgatctcaccttatgc
ttcattttggggcgcacaaggaatgggatagagatggaaggggatagggctgatacatccctttcggtgg
tgttgattgatcacttgattggttttagggaatatgcaaccggcatccaccaatccggctatggaggaag
cttatgtgctggaggagtgatcacgcccatcctaatagccgctggagttcctcttcatacccccactgtc
acagcaaactacattgatatggagtatttgaagaggaaatgctacttggataggtctgccccagctgatc
aactctatttcaagttcaagcactccacgctaggtctctctaggctagcactcccttgcaaggagttcac
cacagttagaattgggaacaacattgactttgatcctcctcaatcgatcttagtcaatgtccttgcgcct
ttacaagcagagccgagcatagggagtgagtctcaagaagaaggagcagagttcaaccaagaagaagctg
agcaggaagactatactcggccgagtaactttcagcaggccgagtatggccaggccgagtatgaacaagc
tgagttcagtcgagctgaacagtttgatgagcaagaagattcgtgtgaagctgcagcggagcagtatttc
tttgaagattatgctgagtccgatcaagagagagatcccggtcaagttcacaagaagttgggaatgctca
agggtttgggcaagtttcagagaaagttgttcagtgggttgaagaagaaggtgagaaagatgaagagggc
aatggacggcatggcggttcagattcaggagctgcaaagaaggcagagatcgccaccacctccacccgag
ttcaggagatgtaactcaacgagtgtggcaccgcaaagggacgttcgtttcgacccgccaagagcctcaa
attacgagctcgggaggagctccaccttctcagatcgccgtttcaacaggcgcccacccggtccagtgaa
ccagttggttctgaatgctgacccgagccgagaggagtactactcgggctacatgctcgacgggtcaacc
gagtacaaccccaacacctccacccatgatcctgagtacacacaagaccgcctggacgagtttgtccaga
acctcttcgtctaaatgttgaggtatcactccatttcactgtatatatcattgcatttcttttatttctt
gctttgtgtggttatttctcttgaattcttctttgaatttttattacacaagggactgtgtaatttaagt
ttgggggagagttcaagatgtatctaacattgtttcatgttttcttattcaaatttttgcatcatctaag
gcatagaaaacccataaaaatttgaaaatttttcgaaaatgattccaaaaaaatagagtgtcatgtagtt
tgcatttgcattattagggctgtttttagaatgttatcatataggttgttgcattttgcacttgcatagg
ggataatgatgatcatagccttgtaaatttgcaatgttcactagatagtttcaatgcccttgttgttagt
tgtctagtgcttaaccgattgaacttgaagtaaaaccgcaccatcttttgaattcatatacttgatcttc
cttagtcgaaactcgctgtgatttgaagctattccctatcaatttgaaccataatttgacttttaattat
catactatgcattgcttgttaaactcatggttacccttaaaatatttggatcttcttattcatttcacca
ctcttgttgatccaaatagctgtctctcacctttagagcagtttccccacaccctaacctaagccttctt
tcaagccatatatcacttgtgagtgtttgtgaggtcttatttcgattaagcttggtagaaagtgttaggt
ttgtaacgacaaagatagtatctcatgtagttctagttcgcgttatccggactagataggactaggtggg
tacttattctatgggttgggaagagtttaaaagagaaaaagggttgaattcattgtttacaagaaaaggg
aaaagaattctaggagaagtaagctaaagaagttagaaaaagtctagtaaagggtttgagattgttaaag
aaagagattgggttattgttagctaatgaagaagggtaaaaagccctaagcttaatagagattaaaaaca
gaaccttagtactaaagaaagccaaacccgctagaagtatcaaagagaaaagaaaagcttctcctagagt
taagagaaaaagaaaagaatgggttaagaaagagttcaaaagattatgaatgcaaaagggtagagttaag
ttcttaattgggatgggagatgggattgccattagatcttcattgattatactttgggtagatgggatct
tatctttgtatgcataacttgggacttacctttagcattctactaaagcttaatcattcttgagagatcc
cttgttactaaagcctattctttaagggaccatttttgtctcttgaccctttacccttagccaaatgagt
ttaatatgcattgtgtagtatgatccatggttcttgcttaatgaatgttaaagggaatatgctgatttga
atgcttgaatagactaagtgaaagattaggttgtgttgtgaagaagatggctaaagtttttaagtagaga
tcattcaacctagcactctagaactagcaacatggacattgagactatttattttacatgcatattttgg
ttctgaatccccaccttcaaacctcactcctagcctagttctatttgttgcttgaggacaagcaaagagc
taagtttgggggtgt1