;ID   ATCOPIA65_I DNA   ; ATH   ; 4165 BP
;XX
;DE   Internal region of ATCOPIA65 copia-like LTR-retrotransposon.
;XX
;AC   AB025639
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA65LTR; ATCOPIA65_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4165)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA65 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 18 (2001)
;XX
;CC   ATCOPIA65_I is an internal region of the ATCOPIA65 copia-like 
;CC   endogenous retrovirus flanked by the 1% divergent ATCOPIA65LTR 
;CC   long terminal repeats, and by a 5-bp target-site duplication.
;CC   ATCOPIA65_I encodes well preserved remnants of the 1374-aa 
;CC   copia-like polyprotein ATCOPIA65p.
;CC   ATCOPIA65p:
;CC   MSEIVEATSKGKEGGGSASIQCPMLNSVNYTVWTMRMEAVLRVHKLWGTIEPGSADEEKNDMARALLFQS
;CC   IPESLILQVGKQKTSSAVWEAIKSRNLGAERVKEARLQTLMAEFDKLKMKDSETIDDYVGRISEITTKAA
;CC   ALGEDIEESKIVKKFLKSLPRKKYIHIVAALEQVLDLKTTTFEDIAGRIKTYEDRVWDDDDSHEDQGKLM
;CC   CVKTXSQDGKLMYANSDSQGQYEFQDRGRGRGRGRFGRGRGRGYQQRDKSKVTCYRCDRLGHYASDCPDR
;CC   LLKLIRLQEQKEKEEDDTHEAESLMMHEVVYLNEKNIRPTELESCINNAWYLDNGASNHMTGNRAWFCKL
;CC   DEMITGKVRFGDDSCINIKGKGSIPFISKGGERKILFDVYYIPDLKSNILSLGQATESGCDIRMREDYLT
;CC   LHDREGNLLIKAQRSRNRLYKVSLEVENSKCLQLTTTNESTIWHARLGHISFETIKAMIKKELVIGISSS
;CC   VPQEKETCGSCLFGKQARHSFPKATSYRAAQVLELIHGDLCGPISPSTAAKKRYVFVLIDDHSRYMWSIL
;CC   LKEKSEAFGKFKEFKALVEQECGAIIKTFRTDRGGEFLSHEFQEFCAKEGINRHLTAPYTPQQNGVVERR
;CC   NRTLLGMTRSILKHMNMPNYLWGEAVRHSTYLINRVGTRSLSNQTPYEVFKHKKPNVEHLRVFGCVSYAK
;CC   VEVPNLKKLDDRSRMLVYLGTEPGSKAYRLLDPTKRRIFVSRDVVFDENRSWMWQESSSETDKESGTFTI
;CC   TLSEFGNNGVTENDISTEPEETEEAEINGEDENIIEEAETEEHDQSQEEPQPVRRSQRQVIRPNYLKDYV
;CC   LCAEIEAEHLLLAVNDEPWDFKEANKSKEWRDACKEEIQSIEKNRTWSLVDLPVGSKAIGVKWVFKLKHN
;CC   SDGSINKYKARLVAKGYVQRHGVDFEEVFAPVARIETVRLIIALAASNGWEIHHLDVKTAFLHGELREDV
;CC   YVSQPEGFTNKESKEKVYKLHKALYGLRQAPRAWNTKLNEILKELKFEKCHKEPSLYRKQEGENILVVAV
;CC   YVDDLLVTGSNLDIILNFKKGMVGKFEMSDLGKLTYYLGIEVLQSKDGITLKQERYAKKILEEAGMSKCN
;CC   TVNTPMIASLELSKAQDEKRIDETDYRRNIGCLRYLLHTRPDLSYNVGILSRYLQEPRESHGAALKQILR
;CC   YLQGTTSHGLYFKKGENAGLIGYSDSSHNVDLDDGKSTGGHIFYLNDCPITWCSQKQQVVTLSSCEAEFM
;CC   AATEAAKQAIWLQELLAEVIGTECEKVTIRVDNKSAIALTKNPVFHGRSKHIHRRYHFIRECVENGQIEV
;CC   EHVPGVRQKADILTKALGKIKFLEMRELIGVQGVSKEDFKLKRE
;XX
;DR   Positions  13692  17856  Accession No AB025639   GenBank (rel. 124.0)
;XX
;SQ   Sequence 4165 BP; 1476 A; 708 C; 961 G; 1020 T; 0 other;
ATCOPIA65_I
tttggtatcagagcatctaggtttgatattgaaacacaaacatgagtgaaatcgttgaagcaacaagcaa
aggtaaagaaggtggaggatcagcgtcgatccaatgtccgatgctaaactccgtcaactatactgtatgg
accatgaggatggaggctgtgcttagagtacacaaactttggggaacaattgaacccggatcagccgacg
aagagaagaatgatatggctcgggctttgctctttcaatccatacctgagtcgttaattttacaagttgg
taaacaaaagacttcttcagctgtctgggaagccataaaatcaagaaatcttggtgcagaacgagtaaaa
gaggcgagattacagacacttatggcagaatttgataagctgaagatgaaggatagtgagacgattgatg
attacgttggtaggatctcagagattactacaaaagctgcagctttaggagaagatatagaagaatccaa
gatcgttaaaaagtttctcaaaagtttgccaagaaagaaatacatacacattgttgcagccttagaacaa
gttcttgatctgaaaacaactaccttcgaagacattgcaggaagaatcaagacttatgaagacagagttt
gggacgatgatgactcacatgaagaccaaggcaaacttatgtgtgttaaaacataatcacaagatggcaa
actcatgtatgcaaattcggattcacaagggcaatacgaatttcaggacagaggtagaggaagaggtcgt
ggacgatttggaagaggaagaggaagaggttatcaacaaagagataaaagcaaagtcacatgttataggt
gtgatagactcgggcactatgcctctgattgtccagaccgtcttctcaagctgatccgactccaagaaca
gaaagaaaaagaggaagatgacactcatgaagcagaatcacttatgatgcatgaggtggtatatctcaac
gagaagaatattcgcccaacagagttagaatcgtgtattaacaatgcttggtatcttgacaatggtgcta
gtaaccatatgacgggaaatcgtgcttggttctgtaagcttgatgagatgatcacagggaaagtaaggtt
cggtgatgattcatgcatcaatataaaaggaaagggttcgattccttttattagtaaaggaggtgaaaga
aaaatactatttgatgtttactacataccagacttgaagagtaacatcttaagtttaggacaagcaactg
aatcagggtgtgacatcagaatgagagaagactacttaaccttgcatgatcgagaaggaaatctactaat
aaaggcgcagcgatcaaggaacagattatataaagtgagtctagaagttgaaaactccaagtgcctgcag
ctcacaacaacaaatgaatcaacaatatggcatgccagactaggacacatcagttttgagaccattaaag
ctatgataaagaaagaacttgttattgggatatctagctcagttccacaagaaaaggaaacatgcggttc
ttgtttgttcggaaaacaagctagacattcattcccaaaagcaacttcttatcgtgcagcacaagtactt
gaactcatccatggtgatctctgtggacctatttcaccatctacagcagctaagaagaggtatgtatttg
tattgattgacgatcattcacgatacatgtggtctattctactaaaggagaaaagtgaagcgtttggaaa
gtttaaagagtttaaggcactagttgagcaagagtgtggggctatcatcaagacattcagaactgataga
gggggagagttcttatcacacgaatttcaagagttttgtgcaaaagagggaatcaatagacacttaactg
caccatacacgcctcagcagaatggagttgtggagagaaggaacagaacactcctaggaatgacaagaag
tattctcaaacacatgaacatgccgaattatctttggggagaagctgtgagacattcgacttatcttata
aacagagttggaacaagatcactttcaaatcaaacaccttatgaagtctttaaacataagaagccgaatg
ttgaacatttaagagtgtttggttgtgttagctatgctaaagtcgaagttccaaatctgaagaaattgga
tgataggtctcggatgcttgtttatcttggtacagaacctggttctaaagcgtatcgactacttgatcca
acaaaaagaagaatctttgtgagcagagatgtcgtctttgatgaaaacagaagctggatgtggcaagaat
caagctcagaaactgacaaggaatcagggacattcacaattaccttaagcgagtttggaaataatggagt
cacagagaatgatatctctacagaaccagaagaaacagaagaagctgagataaatggagaagatgagaat
atcattgaagaagcagaaactgaagagcatgatcaatctcaagaagaacctcaacccgtaagaagatcac
aaagacaagtaatccgacctaactacttgaaagactacgtgttatgtgcagaaatcgaagcagaacacct
tttacttgctgtcaatgatgaaccgtgggacttcaaagaagcaaacaagtcaaaagaatggagagatgct
tgtaaagaggaaattcaatcaatagagaagaatcgcacttggagtttggtcgatctccctgttggaagca
aagcaataggagtcaagtgggtttttaaactgaagcataactctgatggcagcataaataaatataaagc
aagactagtggcaaaaggatacgttcaacgacatggtgtagactttgaagaagtatttgctccggtggct
cgtattgaaacagttcgtctcataattgctttagcagcctcaaatggttgggagatacatcatttggatg
ttaaaactgcattccttcatggggaattaagagaagatgtctacgtctcacaacctgaaggcttcacaaa
caaagaaagcaaagagaaagtctacaaactgcacaaagctctctatggattacgtcaagcaccccgggct
tggaacactaagctaaatgaaattctcaaagagttgaagttcgaaaaatgtcacaaagaaccctcattat
acagaaaacaagaaggcgagaacattcttgttgtagccgtatatgtggatgaccttcttgtcacaggctc
taacttagacatcattctcaactttaaaaagggaatggttggaaagttcgagatgagtgacttaggtaaa
ctcacatattatcttggtatagaggttctacaaagtaaagatgggattacgctgaaacaagaaagatatg
caaagaagatcttagaggaagctggaatgagtaagtgtaacacagtcaatacaccaatgatagctagttt
ggagctgtctaaagcacaagatgagaagaggattgatgagactgattacagaagaaatattggttgtctt
cgttacttactccacacccgtccggatctctcttacaatgttggcatactgagtagatacttgcaggaac
cgagagaatcacatggagctgcactcaagcaaatcctaaggtaccttcaaggaacaacttcacatggact
ttacttcaagaaaggagaaaatgcaggattgatcggctatagtgacagcagtcacaacgtggacttagat
gatggtaaaagcaccggaggtcatattttctatcttaatgattgtcccatcacatggtgttcacagaagc
aacaagtggttacgctatcttcttgtgaagctgaatttatggcagccactgaggcagctaaacaggctat
ttggctccaagaacttctggctgaagtcattggtactgagtgtgagaaggtaacaattcgagttgataac
aagtctgctatagctctcacaaagaacccggtgttccacgggagaagtaaacacatccaccggagatacc
atttcattcgggagtgcgttgaaaatggacaaattgaagtagaacacgttcccggagtaagacaaaaagc
tgatatactaacaaaggctcttgggaagattaagttcttagagatgagagagcttattggagtacaagga
gtgtcaaaagaagatttcaagcttaaaagggagat1