;ID   ATCOPIA58_I DNA   ; ATH   ; 4801 BP
;XX
;DE   Internal region of the ATCOPIA58 copia-like LTR-retrotransposon -
;DE   a consensus sequence.
;XX
;AC   .
;XX
;DT   01-OCT-2001 (Rel. 6.2, Created)
;DT   01-OCT-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; gag, reverse transcriptase; integrase; 
;KW   ribonuclease H; ATCOPIA58LTR; ATCOPIA58_I.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4801)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Repbase Reports 1:(1) p. 18 (2001)
;XX
;CC   ATCOPIA58_I is a consensus sequence of an internal region of the 
;CC   ATCOPIA58 copia-like endogenous retrovirus. There are 4 copies of 
;CC   ATCOPIA58_I present in the genome; they are 98% identical to the 
;CC   consensus sequence, and are flanked by ~1% divergent ATCOPIA58LTR 
;CC   LTRs and 5-bp target-site duplications.
;CC   ATCOPIA58_I encodes the 1021-aa ATCOPIA58p copia-like polyprotein.
;CC   ATCOPIA58p:
;CC   MDYPKEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGEDVLKTEDQWNDA
;CC   EEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQFENLSMEET
;CC   ENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSLDTDSIDFEEVVGMLQAYE
;CC   LEITSGKGGYSKGLALAASAKKNEIQELKDTMSMKNEIQELKDTISMMAKDFSRAMRRVEKKGFGRNQGT
;CC   DRYRDRSSKRDEIQCHECQGYGHIKAECPSLKRKDLKCSECKGLGHTKFDCVGSKSKPDRSCSSESESDS
;CC   NDGDSEDYIKGFVSFVGIIEEKDESSDSEADGEDEDNSADEDSDIEKDVNINEEFRKLYDSWLMLSKEKV
;CC   AWLEEKLKVQELTEKLKGELTAANQKNSELTQKCSVAEEKNRELSQELSDTRKKIHMLNSGTKDLDSILA
;CC   AGRVGKSNFGLGYNGAGSGTKTNFVRSEAAAPTKSQTGFRSNYDAVPARRVYQNHDHYHSRRTVTGYECY
;CC   YCGRHGHIQRYCYRYAARLNKLKRQGKLYPYQGRTSKMYVRREDLYCHVAYTSIEEGIKKPWYFDSGASR
;CC   HMTGSQSNLENYTSVKESKVTFGGGDKGKIKGKGDLTKAEKPQLTNVYFVEGLTANLISVSQLCDEGLTV
;CC   SFNSVKCWATNEKNQNTLTGVRTGNNCYMWEEPKECLRAEKEDPVVWHQRLGHMNARSMSEIVSKEMVRG
;CC   VQELKHIEKIVYDAYNQGKQIRVQHKRVVGVVERKNQTFQEMARAMIHGHGVPEKFWTEAISTACYVINH
;CC   VYVRIGGTFDKLVKAFVKTMTTEFRLSMVGELKYFLGLQINQIDEGIAISQSTYAQNLVKRFDMCSSNPV
;CC   ETPMSTTNLCSCCTKILWMKHMGLDYGMSFSDPLLVKCDNESAIAISKNPVQHSITKHIAIRHHFVRELV
;CC   EEKQITVEHVPTEIQLADIFTKPLDLNMFVNLQKSLGIGEV
;CC   Presumably, ATCOPIA58 was a semi-autonomous endogenous retrovirus. 
;CC   It is ~85% identical to ENDOVIR1.
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 4801 BP; 1598 A; 755 C; 1186 G; 1262 T; 0 other;
ATCOPIA58_I
tttggtatcagagcgggcatctgaaccaagttgtacttaacaacaggtgcagatcctgcggagaggatgg
actaccccaaagagttcgttgcggtcggtaaagcaatcatgttggaaaaaggaaattacggacactggaa
agtgaagatgagagctctcatacgtggtctaggaaaggaagcctggattgctacgagcattggatggaag
gctccggtcatcaagggagaagatggagaagatgtgctaaaaactgaagatcaatggaatgatgcagaag
aggcaaaggccacagctaattcaagggcactgtctttgatattcaactccgtgaatcaaaatcaattcaa
gcggattcaaaactgtgaatcagctaaagaagcatgggataaacttgctaaagcatatgaagggacaagt
agtgtcaaaagatccagaatcgacatgttagcatctcagtttgaaaatctcagtatggaagaaacagaga
acattgaggagttcagtgggaaaatcagtgccatagcgagtgaagcacacaatctaggaaagaaatacaa
agataagaaactggtcaagaaactgttgaggtgtctcccatcaaggtttgaaagtaagagaacggccatg
ggaacgtcgttggacactgactcaatcgattttgaagaagtagtgggaatgctccaagcatatgaattag
aaatcacttccggaaagggaggttactccaaaggacttgctttagctgcatcggcaaagaagaatgagat
acaggaattgaaggatacaatgagcatgaagaatgagatacaggaattgaaggatacaataagcatgatg
gcaaaagacttcagtagagcaatgaggagagttgagaagaaaggattcggaagaaatcagggaactgata
gatatcgagaccgaagttcaaaaagggatgagattcaatgtcatgaatgtcaaggatacggacatattaa
agctgaatgtccctccttaaagagaaaagatctcaagtgctctgagtgtaagggtcttggacacactaag
ttcgactgtgttggatcaaagtctaagcctgatagatcctgcagttctgaaagtgaaagtgactcaaatg
atggagactcggaagattatataaaaggtttcgtgtcttttgtaggaatcattgaagaaaaagatgaaag
ttcagacagtgaagcagatggtgaggatgaggacaactcagctgatgaggattctgacatcgaaaaggac
gttaacatcaatgaagagttcaggaaactgtatgacagctggttgatgctgagtaaagagaaagttgcct
ggctggaagagaagctaaaagttcaagaactgacagaaaagctgaaaggagagttaactgctgcaaatca
gaagaactctgagctgactcagaaatgcagtgtggctgaagagaaaaacagagaactttctcaagagctt
agtgacactcgcaagaagatccacatgctgaacagtggaacaaaagatttggatagtatacttgctgctg
gaagagtgggaaaatcaaattttggtttaggatacaatggtgctggatcaggtacaaagacgaattttgt
acgaagcgaagctgctgctccaacaaaaagtcaaacaggttttcgaagcaactatgatgctgttccagca
agacgcgtgtaccagaatcacgatcactatcattcccggagaactgtgacaggttacgaatgttactact
gtggaagacatggtcatattcagagatattgctacaggtatgctgctaggttgaataagctgaagagaca
aggaaaactatatccatatcaaggaagaacctccaagatgtatgtcagaagggaggatctctattgtcat
gtagcatacacctcgattgaagaaggaataaagaaaccatggtattttgacagtggagcatccagacata
tgacaggaagtcaatccaatcttgaaaattacacctctgtcaaggaaagtaaagttacttttggaggtgg
ggataaaggaaaaatcaagggaaaaggtgatttgactaaagcagaaaagcctcagcttacaaatgtgtac
tttgtcgaagggcttactgcaaatctgattagtgtgagtcagctatgtgatgaagggctgactgtgagtt
tcaatagtgtaaaatgctgggctacaaacgagaagaaccaaaacactctcactggagttagaactgggaa
caattgctacatgtgggaagaacctaaagagtgtcttagagctgaaaaagaggatccagtggtatggcat
caacgtcttggtcacatgaatgcgaggagcatgtcagaaatagtgagcaaggaaatggttagaggagtac
aagagctgaaacacatagagaaaattgtgtacgatgcctacaatcaaggtaaacagattagagtccaaca
caagagagttgtaggtgttgttgagagaaagaaccagacttttcaagagatggccagagccatgattcat
ggacatggagttcctgaaaaattctggacagaggctatcagtacagcatgttatgtgataaatcatgttt
atgtgaggattggaggcacattcgacaagttggtcaaagcgtttgtgaagacaatgacaactgagttcag
gttgagtatggtaggcgaacttaagtactttctggggttgcaaatcaatcagattgatgaagggattgct
atctcgcaaagcacctatgctcagaacctggtgaaacgcttcgatatgtgttccagcaatccagttgaaa
ctcctatgagcactaccaacctctgcagctgctgtactaaaattctttggatgaaacacatgggtttgga
ttacggtatgtcattttctgaccctttacttgttaaatgtgataatgaaagtgctattgccatatctaag
aatccggtacaacactcaatcactaaacacatagctataagacatcattttgttagagaattagttgaag
aaaaacaaattaccgtagaacatgtgcctactgaaattcaacttgctgatattttcactaagcctttgga
cttgaacatgtttgtgaacttgcaaaagtccctgggtattggtgaagtctaactatcttcttgatgagtg
tttgtgctgaaacagggttggttgtacagtgagaagcaagttttcatatctattttacaaagtgtcataa
ccggttttcaaatcttctgctgtaaaaagttcaccaagagaaagagctacccttcatggttcggagatgc
aaagtcagtaaagtgtgtgacagacatagtgtgtagagcattaaggaagggatgaaatgcagaaaggggc
agaaaagaagcatttcatttcataagatgaagcatgagcttatctaagacacgggggacaaaggagaaat
cttgtcactaacccgatgggaacaaaggagaaatcttgttcacctacactttggagagaagggatgtcaa
acaaaaatattgagagaacaagtaagaaagagcatcactacactctgcaacaagacaagagaaaaaaaaa
aaaaataaaaaaaaaaaaggggggtgcaaaaacaagttctctcactgacatctcgtctctctgcagaaag
gagcagtaagaagcaacaataagttctgaatacaaaagatcagacttgtaatttatcaatagcctgaatg
ttgctgcacagtctgtcacactccttgttaaaaggactcgctaacactctgagatatgacggtgtgaaga
ctactcttggagaatcaacattctcttttgtgttgtgtcagcagattatgttctgaatttttttgaatca
agttacaatgagcttagtattgtacaatcacctctgggacagttctggacctcagtatgagggagagatg
gttattactgtgctgatactaatggggacttgttctgtgtatatcaagatggcttcagaaaaatgagttg
actttcagaggtacttagttcaattctataagctaattgggttgttcggttttgttttatggtggttatt
gtgttttatgagatttgaccggaccaaaatccggttagcactgatttttggttttgagtttctatggttt
gcataaagggatctatcgaatgaaacaagtgagaaaacccagaaaattgagctctcacgccgaaacacta
gcgatgggagaacacgcagcgatgcccggcgtatgcgaaagatgtttaatccagagtatgtgtttggcac
atccttacctgtagatttgagatatgtctctaccttgctgacgacaataacgacacatatggaaggtatg
gtgtcttggtcctgtgctcagtgtctggtttaattggtagagtgtctagatagagtttttgatcaattgt
gtcatgcaagcagggggagattatggactggtccctgagaactctaattcatttgtaaactctctagatg
ttgataacgaagaaggctttgaagatactcaatcttcttagtgtgtttagatctagttgtgggggagttt
aggattaagggggagtttttaagctgtgtttgttaatcatgtccttttggttttagtaggtctttgtttc
taagtactggatgatgtgttttgaatcttaaaaacttatttatgcaatctatgatgagaccctctattgg
ttatttccgctgcttattatctctatgacttgatgtgtgcttgtgagttttctttgaggggtttcttgca
ggattgcttgaattgacaaattgtatcaaaaagggggagat1