;ID   ATCOPIA45_I DNA   ; ATH   ; 4211 BP
;XX
;DE   Internal region of the ATCOPIA45 copia-like LTR-retrotransposon.
;XX
;AC   AL161592
;XX
;DT   01-OCT-2001 (Rel. 6.2, Created)
;DT   01-OCT-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA45LTR; ATCOPIA45_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4211)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Repbase Reports 1:(1) p. 2 (2001)
;XX
;CC   ATCOPIA45_I is an internal region of the ATCOPIA45
;CC   copia-like endogenous retrovirus flanked by 1% divergent 
;CC   ATCOPIA45LTR LTRs and 5 bp-long target-site duplication.
;CC   ATCOPIA45_I encodes the 1367-aa ATCOPIA45p copia-like polyprotein
;CC   (it is disrupted by two stop codons emerged after integration of
;CC   ATCOPIA45 in the genome).
;CC   ATCOPIA45p:
;CC   MGDIVPVTNNTKEGSSSSSIQCPMLTATNYTFWTIRMTMALKVHKVWETIEEGLDDIDKNNMASALLLQS
;CC   IPEALTLRVGKLKTAKKIWDAIKARNLGADRVKDARLQTLMGEFERIKMKETEKIDDFAGRLSELSTKSA
;CC   DLGNDIEEPKLVKKFLNSLPRKRYIHIIAALEQVLDLNTTSFEDIVGRLKAYEERICDEEDNQDDQGKLL
;CC   YANSEEKSAQNNWNPNKGKSQGGRGYGRGRGRDRFGNAQGNRDMTNVICYRCDKLGHYASDCPDRLLKLQ
;CC   ETQDVKNDDTQKADALMMHEVVFLNEKNVMPNKLEASLDVDNVWYLDNGASNHMTGNLAYFGEIDERVTG
;CC   KVRFGDDSRIDIKGKGSITFIAKNEERKILADVYYIPDLRSNIVSLGQATESGCDVRMRDDHLTLYDRDG
;CC   KLLIKATRSRNRLYKVIMEVDDTKCLQLESLSETTKWHARLGHIGTDNLKRMMQKELVIGIPNIKVEKEM
;CC   CGSCLLGKQARKPFPQATPYRATSILELLHGDLCGPITPSTVA*NRYIFVLIDDYSRYMWSLLLKEKSEA
;CC   FNKFKSFKACVEQETGATIKTFRTDRGGEFVSQEFQAFCDASGIKRHLTAPYSPQQNGVVERRNRTLMEM
;CC   TRSILKHMSVPNYLWGEAVRHSTYLINRVATRTLVDQTPYKVLKSKKPNVEHLRVFGCIGYAKAEAVHLR
;CC   KLDDRSRMLVHLGTEPGSKGYRLLDPTRRKVIVSRDVVFDEEKRWKWNNSEDEINNVPGMFSLSFEEFGN
;CC   NGIREEDDITEETEINDGENHDREAEIPTQAIETVEQ*VEPQVTLRKSVRVISKPSYLDDYVLLASIECE
;CC   RLLLMINEEPWDYNEAKELQEWKKACVEEIASITKNHTWDLVDLPIGAKPIGLKWVFKLKRNSDGSVNKH
;CC   KARLVAKGYVQRHGIDFDEVFAPVARIETVRLIIALAASNGWEIHHLDVKTAFLHGELKEVVYVSQPEGF
;CC   VIGGSEDKVYKLNKALYGLKQAPRAWNNKLNKILMELKFTKCSKEPSLYCRRDKDELLVVVVYVDDLLVT
;CC   GSNLQVILEFKEEMAKKFEMSDLGKLTYYLGIEVFQHEGGIMLKQERYANKILEETKMDDCNAVQIPMDA
;CC   NLKLSKAQEEKNIDEKEYRRNIGCLRYLLHTRPDLSYCVGVLSRYMHEPKESHGAALKQILRYLRGTQSF
;CC   GLCFKRMNKTELVGFSDSSHNVDEDDGRSTTGHIFYLNDCPITWCSQKQETVALSSCEAEFMAATEAAKQ
;CC   AVWLQELLEEIVGKTCKQVLILIDNKSAIALTKNPVFHGRSKHIHKRYHFIRECVANEQVEVEHVPGTEQ
;CC   RADILTKALGRIKFKEMRDLVGVQDMTKCSFKLKGEH
;XX
;DR   Positions   14885   19095  Accession No AL161592    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4211 BP; 1448 A; 686 C; 1019 G; 1058 T; 0 other;
ATCOPIA45_I
attggtatcagagcataatattgattcaagcatctttacgatcaaacaaaggcaatcaacgaagttagcg
ttgattccgtacggagaaagaaggtaacttgtgaagaagacatgggcgatatagtaccagtaacgaacaa
caccaaagaaggtagcagctcttcctctattcaatgtccgatgcttacggcaacaaactatacgttttgg
acaatacgtatgacgatggctctaaaagttcacaaggtatgggaaacgattgaagaaggattagatgata
tcgataaaaacaatatggcaagcgctctccttctccaatctatccctgaagcattaaccttgcgagttgg
aaaacttaaaaccgcaaagaagatatgggatgcgataaaggctagaaatctaggagctgatagggtcaaa
gatgcaagacttcagacattgatgggcgagtttgaaaggataaagatgaaagaaactgagaagatagatg
acttcgctggaagactctcagaattgtccaccaaatcagcagatcttgggaatgatattgaagaacctaa
gttggtgaaaaagtttctcaatagcctaccacggaaacgttatatacatatcattgctgcactagaacaa
gttcttgatcttaatacaacaagtttcgaggatatagtcggcagactaaaggcatatgaagagagaatct
gcgatgaagaagataaccaagatgatcagggaaagcttttgtatgcgaactctgaagaaaagtcagctca
gaacaattggaatcccaataaaggaaaaagtcaaggtggtcgaggatacggacgaggaagaggcagagat
cggtttgggaatgcacaaggcaacagagacatgacaaacgtgatttgctataggtgtgataagctaggac
actatgcttccgactgtcccgatagattgcttaaacttcaagaaacacaagatgttaagaatgatgatac
acaaaaggctgacgcattaatgatgcacgaagtagtgtttcttaacgagaagaatgtaatgccaaataaa
cttgaggcgagcttggatgttgataatgtgtggtacttggataatggcgcgagcaaccatatgacgggaa
acttggcttactttggtgagattgacgaaagagttacgggaaaggttcgttttggtgacgattctcgtat
tgatatcaaaggaaaaggctcgattacgtttatagctaagaatgaagagagaaaaatcttggctgatgtc
tattacattcccgatttaagaagcaacatcgtgagtcttggtcaggctaccgagtccggatgtgatgtta
gaatgcgagatgaccatctaactctatatgatagagatggaaagctgctaataaaagcgacaagatccag
aaatcgactttacaaggtgatcatggaagtcgatgatacaaagtgtttgcaacttgagagcttgagtgaa
acgacaaagtggcatgcaagattgggtcatattggaactgacaacttgaaaagaatgatgcaaaaggaat
tggtcattggcattcctaatatcaaagtcgagaaagaaatgtgtggctcatgcttgcttggtaagcaagc
tagaaaaccatttccacaagcaactccatatcgtgcaactagcatactcgagctcttacatggagatctt
tgtggaccaattacgccttctacagtagcataaaataggtatatctttgtcctaattgatgattattcac
gttatatgtggtcacttcttctcaaggaaaagagcgaagcgttcaataagtttaaaagctttaaggcatg
tgtggagcaagaaactggtgctaccattaaaacgtttcgaacggatagagggggagagtttgtttctcaa
gagtttcaagcgttttgtgacgcctccgggattaaaaggcacttaactgcaccgtattctccacaacaaa
acggagtggtcgagaggcgcaatagaacgttaatggaaatgactagaagcatcttgaaacacatgagtgt
cccaaattatctatggggagaagcagtgagacattcgacttatcttataaacagagtagcgacgagaact
ttggtagatcaaaccccatacaaagtcttaaagagtaagaagccaaatgtggagcatttacgggtctttg
gttgcattggttacgctaaagcagaggctgtacatttgagaaagttagacgatcgctcacgaatgctagt
acacctaggaacagagcctggatcaaaaggctatcgtttattagatccaacaaggagaaaggtgatagtt
agcagagatgttgtatttgatgaagaaaaaaggtggaagtggaataatagtgaagatgaaattaataatg
ttccaggaatgttcagtcttagttttgaagaatttggcaacaatggtataagagaggaggatgatatcac
agaagaaacagagatcaatgatggagagaatcatgatcgtgaagctgaaattcctactcaagcaatagag
actgtagaacaataagtagagccgcaagtcacactaagaaaatccgtgagagtgatctcaaaaccaagtt
acttagacgattatgttttgttggcctcaatcgaatgtgaacggctcttacttatgataaatgaagaacc
atgggactacaatgaggcaaaggaattgcaagaatggaagaaagcgtgtgtagaagagattgcgtcgata
actaagaaccacacttgggatttggtggatcttccgatcggagctaaaccaattggactcaaatgggtct
ttaagttaaaacggaactctgatggaagcgttaacaaacacaaagcaaggcttgtagctaagggttatgt
tcaacgacacggtatcgactttgatgaagtatttgctccggttgcccgaattgaaaccgttcgacttatc
attgctttagcagcttctaatggatgggaaatacatcatcttgatgtgaaaactgcatttcttcatggag
aattaaaagaagtggtctacgtttcacaacctgaaggatttgtgataggaggaagtgaagataaagtgta
caagctgaacaaagctctttacggactcaaacaagctcctagagcatggaataacaagttgaacaagatt
ctcatggaacttaagtttactaagtgctctaaagaaccgtcattatattgtcgaagagacaaggatgagc
tacttgttgtggtggtctatgtggatgacctacttgtcacgggatctaacttacaagtcatacttgagtt
taaagaagagatggcaaagaagtttgaaatgagtgatcttggaaagcttacgtattacttaggtattgaa
gtgtttcaacatgaaggcggcattatgcttaaacaagagaggtatgcaaacaagatcctagaagaaacaa
aaatggatgattgtaatgcggttcagattccgatggatgcaaacctaaagctaagtaaagcacaagaaga
gaaaaacatcgatgagaaggaatatagaagaaatattggatgtcttcgatatttgctgcacacacgtcct
gacctttcttattgtgttggagtgcttagtagatatatgcacgagccaaaggaatcacatggagcagcct
tgaaacagattcttaggtacttgcgaggaactcaatcttttggtctctgtttcaaacgaatgaataaaac
agagctagtaggatttagtgatagcagtcacaatgttgatgaggatgatggaagaagtacgacgggtcat
attttctatcttaatgactgtcctatcacttggtgctcgcaaaagcaagaaactgtggctctatcgtctt
gtgaagctgagtttatggcggctacagaggcagcaaaacaagcggtttggcttcaagagttacttgaaga
gattgttggaaaaacgtgtaagcaagtgttgatattaattgacaacaagtcggctatagcactcacgaaa
aacccggttttccatggacgaagcaaacacattcacaagaggtatcactttattcgtgagtgcgttgcga
atgaacaagttgaggtggagcacgtccccggaaccgaacagagagccgacattctaacaaaggcactcgg
gagaatcaagttcaaagagatgagagatcttgtgggagttcaagatatgacaaagtgtagcttcaagctt
aagggggaaaa1