;ID   ATCOPIA69_I DNA   ; ATH   ; 4270 BP
;XX
;DE   Internal region of ATCOPIA69 copia-like LTR-retrotransposon.
;XX
;AC   AL161502
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA69LTR; ATCOPIA69_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4270)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA69 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 24 (2001)
;XX
;CC   ATCOPIA69_I is an internal region of the ATCOPIA69 copia-like 
;CC   endogenous retrovirus flanked by the 4% divergent ATCOPIA69LTRs
;CC   and a 5-bp target-site duplication.
;CC   ATCOPIA69p, a 1348-aa copia-like polyprotein encoded by ATCOPIA69_I
;CC   is free of false stop codons and frame-shifts.
;CC   ATCOPIA69p:
;CC   MSSTKLKISQFDGSGDFSLWKTRMFSHLRVMGLKDALVEQEQLPPLKDEEESDPAKKKQRIDEEKARIDR
;CC   DEKAMDMIFINVGDKVLRNIENSKTAAEAWATLDRLYLVKSLPNRVYLQLKVYNYRMQDSKTLEENVDEF
;CC   LKMISDLSNLQIQVPDEVQAILILSALPDSYDMLKETLKYGREGIKLDEVISAAKSKELELRDSSGGSRP
;CC   VGEGLYVRGKSQAGGVGNKSTEGKKVCWICGKEGHFKRQCYKWLEKNKGNSAGETALVKDDAQDLVGLVA
;CC   SEVNISEDKNDQEEWIMDTGCSFHMTPRKEYLINFEKAKSGKVRMANNSYSEVKGIGSVRFVKKDGTSIV
;CC   LQGVRYIPEISRNLISMGTLEAEGYEFKGNHGILRVMQDSNEFIRGRRRASLYILEAQAEMANSESLVTT
;CC   TGETDQTQLWHSRMGHIGQQAMDILSKKGCFGNDKISEIKFCEDCVIGKTHKVSFGTAQHTTKEKLDYVH
;CC   SDLWGSPNVPYSLGRCQYFISFTDDWSRKVWIYFLKTKDEAFQSFTEWKTMVETQSERKLKHLRTDNGLE
;CC   FCNHKFDGICKKEGIVRHRTCTYTPQQNGVAERLNRTIMNKVRSMLSESGLDKKFWAEAAATSVYLINRS
;CC   PSSALENKIPEELWTSAVPSLSGLKRFGCIVYVHSQEGKLDPRAKKGVFLGYPQGVKGFRVWMIDEEKCS
;CC   ISRNVVFREDVMYKDILNKLTSGMSLELPFVTNKVPSFECTGLNKDGERLVQGGAIEKESDETLELSNTD
;CC   QEDTGTERIQRTHQIARDKPKRQIVIPSRLKDYEMNEEILDEIAGYAYLITEDGGNPEPTDFQEVLQDPD
;CC   NKKWLEAADEEIESLIKNKTWTLVDRDNSQKPIGCKWIFKRKAGIAGVEQPRFKARLVAKGYAQKEGIDF
;CC   QEIFSPVVKHVSIRLLLSMVVHLNMELQQMDVKTAFLHGYLDETIYMDQPEGYIHEKYPDKVCLLKRSLY
;CC   GLKQSPRQWNNRFNEFMQRIGYERSKYDSCVYYKILLSGDYIYLLLYVDDILIASKDKEQVCELKTLLNS
;CC   EFEMKDLGDAKKILGMEITRDRQAGTLTISQEGYLLKVLKDFGMDQAKSVNTPMGIHFKLKPANDEEVQK
;CC   QSEVMRAIPYQSVVGSLMYSMISTRPDLAHSVGLVCKFMSKPLKEHWQAVK*ILRYISGTLDRKLCYKNE
;CC   GECILEGYCDSDYAADKGSRRSTSGVVFTFGGNTISWKSNLQKVVALSSTEAEYMALTDAAKEAIWLKGL
;CC   VNELGFTQKTVNIHCDSQSAIALAKNAVYHERTKHIDVKYHFIRDLVNNGEVQVLKIDTEDNPADIFTKV
;CC   LPVSKFQDALELLRVSQN
;XX
;DR   Positions  133749  129480  Accession No AL161502    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4270 BP; 1487 A; 677 C; 1056 G; 1050 T; 0 other;
ATCOPIA69_I
aattggtatcagagcccaggttctgagctcaagaatttcagatcgattcaaggatgtcgaacacgaaact
gaagatttctcagttcgacggatcaggcgacttctcactatggaagatgggattaaaagatgcgctggtg
gaacaaactaagtcatcttcattgacagatgaagaagaagacgatccagcaaagaaaaaaaagattctcg
aagaggaaaaagcaagaattgatcgagatgagaaagcgatggatatgatcttcataaatgtcggagataa
agttctgagaaacatagaacattcaaagacagccgcagaagcatgggcaactcttgataaattgtatttg
gtaaagactctaccaaaccgtgtttaccttcaactcaaggtttacaactatagaatgcaagattcaaaaa
ctcttgaagagaacatagatgagtttctaaagatgatatcagatctaagtaatctttagattcaagttcc
agaagaagtccaagcaatcttgattctaagtgctttaccagaaggctatgatatgcttaaggaaaccttg
aaatatggaagagaaggcataaaacttgatgacgttgtgagtgctgcaaaatcaaaggaactagaactaa
gagatggtttaggaggatcaagaccggttggtgaaggtctctatgtaaagggaaagtttcaggccaaagg
aagtgataacaacaaagggaataactcaacagaaggaaagaaagtctgttggatatgtgaaaaggaaggt
cacttcaagagacaatgttacaagtggcttgagaagaataagggaaatggtgcaggggaaacaacattgg
taaaggacgatgctcaagacttggtcgggctagtagcatcagaagctaacctaagtgaggataagagaga
tcaagaagaatggataatggacactgggtgctctttccacatgacacctaggagagactatcttgtagac
tttgtagaaggcaaagcaggaaaggttagaatggctaataattcattctctgaagtaaaaggaattggaa
aggttaagttcacaaatgaggatggaagacagatcatccttcatggagtgaggtacatcccagagatatc
tagaaatctgatctctatgggaactcttgagtcagagggatatgagtttagaggaggtaacggtgtctta
aaggtaattcagggatcataagtgttcatgaaaggagtcagaagagcctcgttatacattttacaagcgg
aagcgagaaagtcagatgcagactctcttacaacagtctcaggtgaatcagatcagactcagttatggca
tagcagaatgggacatataggacagcaggctatggaagttttgagtaagaaaggttgctttggtaatgac
aagatatcagagataaagttttgtgaagactcataatagggaagactcacagagctagtttcggatcagc
acaacatgtaactaatgagaaacttgactatgttcattctgatctatggggatctcctaacgtaccgcac
agtcttggaaaatatcagtacttcatatcatttacagatgactggtcaagaaaggtttgggtgtactttc
tcaagtctaaagatgaagcctttgcttcattcactgaatggaaaaagatggtggagactcaaagtgacag
aaaactcaagaaattaagaacagacaacgggttagaattctgtaatcaaaagtttgattgtttctgcaag
aaggaagggagagtaagacatagaacatgtacttacactccacagcaaaacggagttgcagaaagattaa
atagaacaatcatgaacaaggttagaagtatgctgagtgaaagtggcttagacaagaaattttgggctga
agcagtttcaacctcagtatacttgattaacaaatcaccatcatctacaatggagaataaaatccctgaa
gaactgtggacctcagtgatttccaatctgtcaagactaagaagatttggctgcattgtatacgttcatt
ctcaagaaggaaaactggatcctagagccaagaaaggagtgtttgtgggttatccaagtggagttaaggg
ttttagagtctggatgattgaggaagagaagtgcaccataagtcgaaacgttgtgttcagagaagatgtg
atgtacaaggacatcatgaacgccacaacctcatgtataagtcttgaactccctttgactactaataaag
ttcccatcttcgaatgtgcaggtgccagtaaaaccagagacagttcagatcatggtggagctacagagag
tatttctgatgagactacagaaatcattgacattgatcaggtagacactacaccagaaggaaatcagaga
acaagacagatagctcgagatcgacctaaaagacaagtgattatcccatcaagactcaaggattatgaga
tggatgaggaagtattagatgagattgcaggctatgcttacctcataacagaggatgggggaaattctga
acctgagtgctatcaggaagcagttcaagaccctgatagtgagaaatggttagaagcagctgatgaggag
atagaatctctgataaagaataagacatgggttcttgtagagagaaacagtctacagaagcctattggat
gtaagtggatattcaaaaggaaagctggaattgcaggagtggagaaaccaaggtttaaggctaggcttgt
agctaagggatactcacagaaagagggaatagactttcaagaaatattttcaccagtggtgaaacatgtc
tctattcgcctcctgctatcaatcgttgctcacctagacatggagttacaacagatggatgtaaagacag
cctttttacacggctacctggatgagacgatctatatggagcaaccagagtgatatactcatgaaagata
tccagacaaagtttgcttactgaagaagtcgctgtatggactgaagcagtctcctagacaatggaacaac
aggttcaatgagtttatgcagaagattggatatgaaagaaacaagtatgatagctgtgtttacttcaaga
tgttgcagagtggagagtacatctacttgcttctatacgttgatgatatactaatagcatctaaggataa
aaaggaggtatgtgagttaaaggttcttctaaactctgaattcgaaatgaaagacttgggggatgctaag
aaaatcttaggtatggagatcgtcagagatagacaagctggaactctctccatttctcaagagggctatc
tcctgaaagttcttggagattttggcatggatcgagccaagacagtcaacacaccctggggatccatttt
aaactgaaacctgcaactgatgaagagattcagaaacagtcagaagtcatgagaacaatcccttatcaaa
gtgcagttgggagcttgatgtactcaatgattggtacaaggccagacctagctcattcagttggtgtagt
atgcagattcatgagtaaaccattgaaggaacactggcaggcagtaaagtggatattaagatacattggt
ggtactttagaccgaaagctgtgctataagaatgaaggagagctagtcttagaaggttattgtgactcgg
actatgctgcagataaggaaacaaggagatccacttcaggagtggtgtttacctttggtggaaacacgat
aagctggaagtcaagcttacagaaagtagtagctctatcaagcactgaagctgagtatatggctctaact
gatgcagcaaatgaagcagtttggctgaaaggtcttgtaagtgagttaggttttgcacaaggatcagtaa
acatccattgtgactcacagagtgctattgccttgactaagaacgcagtctaccacgaaaggaccaaaca
tattgatgttaaatatcacttcatcagagaattggtgaacgatggtgtggtgcagatattgaagattgac
actgaagacaatccagcagatatattcaccaaagtgctaccagtgagcaagtttcaagacgctcttgact
tgctcagagtatctcaaagttaaggtggagctttgctccgggttttaagctgagtgggtttactcaggga
gaagaagctgagtgggtttactcagggagaagaagctgagtgggtttactcaggaaaaagaagctgaaca
agttagttcaggtaccggagaggaattcaaaagctaaaggcagcagagcataaagatcaaggtggagatt
1