;ID   ATCOPIA87_I DNA   ; ATH   ; 4087 BP
;XX
;DE   Internal region of the ATCOPIA87 copia-like LTR-retrotransposon.
;XX
;AC   AB028606
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; the ATCOPIA87 
;KW   family; ATCOPIA87LTR; ATCOPIA87_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4087)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal portion of the ATCOPIA87 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(3) p. 18 (2001)
;XX
;CC   ATCOPIA87_I is an internal region of the ATCOPIA87 copia-like 
;CC   endogenous retrovirus flanked by the 99% identical ATCOPIA87LTR
;CC   long terminal repeats, and a 5-bp target-site duplication (TAGAC). 
;CC   ATCOPIA87 forms a young family of copia-like endogenous retroviruses
;CC   present today in the A. thaliana genome. 
;CC   ATCOPIA87_I encodes well preserved remnants of the 1338-aa 
;CC   ATCOPIA87p copia-like polyprotein. 
;CC   ATCOPIA87p:
;CC   MVSELQVKEETSQEPRFERFDGRGDYTLWKRKLLAQLEVMGISDALKEKEEKKEAVETERVKVVSSSSER
;CC   RREEHKKDHSREEKENKARSVIILSVADNILRRIRTEETAAGMISVLDKLYLSDPLSSRISLKRKLFEFK
;CC   MSENKAVEENIEDFFRIVEDLEKLDVYVSDEDKAFMLLLSLPRKLEQLKYSLDYCEEPLTLGRVMTAIYK
;CC   KELEVAQIERQTEEEEKRLSLRERERSDYREEQAKGKEKVRSEAREKKGPCWRCGQKGHVKTECFQEKKN
;CC   KSRKKSVRYEESSAQSIVSGGSVFMVSEAAARASKGSSEEWICDTGCTSHMSSRKEWFEDLVFSESGNVS
;CC   MANDTTLQVKGIGSVRILNDDGTTVLLTNVMYIPGMSKNLISLGTLENKGCWFKSKNGILKVIKGCITLM
;CC   KAEKVGTLYMLKGKAVTARRRAVQGPKEETKMEHIKPAHMSQTSLEIPVKKGCIRKQKIGEVKVCEDRAQ
;CC   KEVKRIKFDSEKIVTEMKPKIVHSKIWSFASTPRKIKEGLRIKGITAKETGKSVCDTILLKSCTGGVDIA
;CC   VNLKKSLFSSSFEVSRDKETSSASNCRKLVISQDEKFQRDIICECCQEHECERRKCVSEKGIFSLHDKNE
;CC   LVYNESVIQGGASSSENSQQHSSLNSHQVMATFMRSRKEEMAKDGTYHHSTSHQVSMKRSGVTVYAPSSQ
;CC   RKPSPRERNIISVGCYDGEIFFKILLPEDKEDCFIRKNAEFLEGYLIKVLNLKFQDLEDQRRKSVRVSLK
;CC   SLQDYETSSFQVQGGAYSAKGSFISEGQESLNTSLQGRDAASETSEEETDLQGYLLTRINVKREVQPSKK
;CC   DEETKLVGSVQFIIEDAGKTEPTSFEEEKRDPSWSKVEEIQFLHNRDTWDLADKLMNQEASICKGIYKKK
;CC   PEFSGIGKPSFKVNLRVKGHSKEKVVLKLMEMKQKTDFLHENSEKNIRMLQPAVYEEKSRKEKAFLLENF
;CC   LYGLKNLPRSQNQRNDTLLIQDGYMRSKYVLWVYFKVSKMEIHQNRGFESPRLSQEAYVIKVIRLFKEDQ
;CC   CEFVFTSSETLFEFQALEEEKQGDQADYMKLISCLSDVDSVVVAIIGTRVNIAWFRGVVFRSKKEHWKSM
;CC   LRRMRGIKGTTESVQVHRKQEDFELTRVNDSKVAVGADQKRVLTCRSFTTAGSSISCKYMFQEVDFVSTK
;CC   KGERNRWTGAYKRVVWVRDFKDLIESSLVFKKQEESMLRGCYSSKFAAGSVLKVNFQQCVFSFAGNTLRR
;CC   ESGLLKVGVSSTTEAEKRVQEMDIQETVLSRELLDMDTGGLKKTTKLGMLSSAAGVVTKGLTRSMFQDGG
;CC   PMLRFTKN
;XX
;DR   Positions 25401 29487 Accession No AB028606    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4087 BP; 1479 A; 582 C; 1018 G; 1008 T; 0 other;
ATCOPIA87_I
aaaatggtatcagagctccaggttaaggaggaaacctctcaagaacctagattcgaaagattcgacggga
gaggagattacacactgtggaaaagaaagcttcttgctcagcttgaagttatgggtatctcagatgctct
aaaggagaaagaagaaaagaaggaagctgttgagacagaaagggtgaaggtcgtgtcttcaagctcagaa
agaagaagagaagaacacaagaaagaccattcacgagaagagaaggagaacaaggccagatctgttatca
tacttagtgttgcagacaatatcctaagaagaatcagaacagaggagactgctgcaggtatgataagtgt
tctagataaactctacttgtctgatccactatcaagtcgtatatctcttaaaaggaagctctttgagttt
aagatgagtgaaaacaaggctgtagaagagaatatagaagatttcttcaggatagttgaagatctagaaa
agttagatgtttatgtgtctgatgaagacaaagcattcatgttgcttttgtctctccctagaaagcttga
acagcttaagtactctttagattattgcgaggaacccttaactttgggtagagtaatgactgcaatatac
aagaaagagcttgaggttgctcaaatagaaagacaaacagaggaagaagagaagcgtttgtctttaaggg
aaagagaaaggtcagattacagagaagaacaagctaagggaaaagagaaggtcagatctgaggcaagaga
aaagaaaggtccatgttggagatgtggacaaaagggacatgtcaagacagagtgcttccaagagaaaaag
aacaagtcaaggaagaagtcagttagatatgaagaatcatcagcgcaaagtattgtttctggtgggtctg
tgttcatggtttcagaagctgctgctcgagcaagtaaaggaagctctgaagaatggatttgtgacacagg
ttgtacttctcatatgagctcaagaaaagaatggtttgaggacttggtattttctgaatctggaaatgtg
tcaatggcaaatgacactactttacaggttaaagggattggaagtgtgaggatcttgaatgacgatggaa
caacagtcttgttgactaacgtcatgtatattccgggcatgtcaaagaatctcatatccttagggacact
tgagaacaagggatgctggtttaaatccaagaatgggattttaaaggtcataaagggatgcataacattg
atgaaggctgaaaaagttggtacactttacatgctaaagggtaaagcagtaacagcgagacgaagagctg
tacaaggaccaaaagaggagactaagatggagcatatcaagcctgctcacatgagtcaaacgagtctcga
gattccggtcaagaaagggtgtatcagaaaacagaaaatcggtgaagtgaaagtctgtgaagacagggca
cagaaagaagtaaaaagaatcaagtttgattctgaaaagattgtcactgaaatgaagcctaagattgtgc
attcaaagatatggagtttcgcatctactccgagaaaaatcaaagaaggattgagaataaaaggcatcac
tgcaaaagaaacagggaaatcggtttgtgacactattctgttaaagtcatgtacaggaggtgttgatatt
gcagtgaatctcaagaaaagtttattctcatcaagttttgaagtctcaagagataaagagacaagttctg
catcaaactgcaggaaattggtcataagtcaagatgagaaatttcagagagacatcatttgtgaatgttg
tcaagaacatgaatgtgaaaggcgaaagtgtgtctcagaaaagggaattttcagtctacatgacaagaat
gagttagtctacaacgaatctgttattcaaggtggagcaagctcaagtgaaaattcacagcaacattcaa
gtttgaattctcatcaagtcatggcaacgttcatgagatcaagaaaagaagaaatggcaaaagatgggac
atatcatcactcaacaagtcatcaagtgagtatgaaacgttctggagttacagtatatgcaccatcaagc
caaaggaagccaagtccaagagaaagaaatattatttctgtcggatgttatgatggagaaatctttttca
agatcttgttgcctgaagataaagaggactgcttcataaggaaaaatgcagagtttctagaaggttattt
gatcaaagtgttgaatcttaagtttcaggatttagaggatcaaagaagaaaatctgtgagagtaagtctg
aaatctctacaagattatgaaaccagtagctttcaagttcaaggtggagcttattcagcaaaaggcagtt
tcataagtgaaggacaagagtctttaaatacgtcgttgcaaggtagagatgcagcatcagaaaccagtga
agaagaaacggatctgcagggttatctattgacaagaattaatgtcaaaagagaagttcagccaagtaag
aaagatgaagaaacaaagttagttggatcagttcagtttatcattgaagatgcaggaaaaacagaaccaa
ctagctttgaggaggaaaagagagatccaagctggtcaaaagttgaagaaatccagtttttgcataatag
ggatacatgggatctggctgataaactcatgaatcaagaggcaagtatatgcaaagggatctacaagaag
aaaccagaattttctggaattggaaaaccaagttttaaagtcaacttgagggttaaaggtcactcaaaag
aaaaagtggttctgaagctaatggagatgaagcaaaagacagatttcttacatgaaaattctgagaagaa
cattcggatgcttcaacctgcagtttatgaagaaaaatcaagaaaagagaaagcatttctcttggagaat
tttctgtatggtttgaagaatttaccaagatcacagaatcagagaaatgacacactcttgattcaagatg
gttatatgagaagcaagtacgttttgtgggtatacttcaaggtgtcaaagatggagatacatcaaaacag
aggatttgaaagtccaaggctgtctcaggaagcttatgtgataaaggtaataaggttgttcaaggaagat
caatgtgagttcgtttttacttcatcggaaactctatttgagttccaagcactcgaagaagaaaaacaag
gagatcaagcggattatatgaagttaatatcatgcttaagtgatgttgatagtgtggtagtcgccatcat
tggcacaagagttaatattgcttggtttagaggagtggtgtttcgttctaagaaagaacattggaagtca
atgttacggaggatgcgaggcattaaaggaaccacggaatcagttcaagttcaccggaaacaagaagatt
ttgagcttacaagagttaatgactcaaaagttgcagttggtgctgatcaaaaacgagttttgacgtgtcg
gagtttcacgactgcaggcagctcaataagctgcaaatatatgtttcaagaagtagattttgtgtctaca
aagaagggtgaacgcaacagatggactggggcttataagcgagttgtttgggtgagagactttaaggatt
tgatagaatcaagtctggttttcaagaaacaagaagagtctatgttaagaggatgctacagctcaaagtt
tgcagcaggatcagttttaaaggtaaactttcaacaatgtgtgttcagttttgcaggaaacacattaagg
agagaatctggtctgctgaaagtgggtgtttcatctacaacagaggctgaaaagagggtccaggaaatgg
acatacaagagacagttttgtcaagagaattgcttgacatggacacaggagggctgaagaaaactaccaa
gttgggtatgttgtctagtgcagccggtgttgtcaccaaagggttaacaaggagcatgttccaagatggc
gggccgatgctcaggtttaccaagaactgaccaaggtcagggtccggggaaggaatccaagaactagaat
catcaactcaaagacaaggtggagttt1