;ID   ATCOPIA51_I DNA   ; ATH   ; 4214 BP
;XX
;DE   Internal region of ATCOPIA51 copia-like LTR-retrotransposon.
;XX
;AC   AL161515
;XX
;DT   02-SEP-2001 (Rel. 6.2, Created)
;DT   02-SEP-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA51LTR; ATCOPIA51_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1]
;RA   Spiegel,L.A., Huang,E.N., Nascimento,L.U., de la Bastide,M.,
;RA   Vil,D.M., Preston,R.R., Matero,A., Shah,R., O'Shaughnessy,A.,
;RA   Rodriguez,M., Shekher,M., Schutz,K., See,L.H., Swaby,I.,
;RA   Habermann,K., Dedhia,N.N., Mewes,H.W., Lemcke,K. and Mayer,K.F.X
;RL   Direct submission (March 2000)
;XX
;RN   [2]
;RA   Terol,J., Castillo,M.C., Bargues,M., Perez-Alonso,M.
;RA   and de Frutos,R.
;RT   Structural and evolutionary analysis of the copia-like elements in
;RT   the Arabidopsis thaliana genome
;RL   Mol. Biol. Evol. 18, 882-892 (2001)
;XX
;CC   ATCOPIA51 was found by [1], minor modifications of the sequence
;CC   coordinates were made by [2].
;CC   ATCOPIA51_I is an internal region of the ATCOPIA51 copia-like 
;CC   endogenous retrovirus flanked by 1% divergent ATCOPIA51LTR 
;CC   LTRs. ATCOPIA51_I encodes the 1392-aa ATCOPIA51p copia-like 
;CC   polyprotein. 
;CC   ATCOPIA51p:
;CC   MAPAYPFPDNVHVSSSVTLKLNDSNYLLWKTQFESLLSSQKLIGFVNGVVTPPAQTRLVVNDDVTSEVPN
;CC   PQYEDWFCTDQLVRSWLFGTLSEEVLGHVHNLTTSRQIWISLAENFNKSSIAREFSLRRNLQLLTKKDKS
;CC   LSVYCRDFKIICDSLSSIGKPVEESMKIFGFLNGLGREYDPITTVIQSSLSKLPAPTFNDVISEVQGFDS
;CC   KLQSYDDTVSVNPHLAFNTERSNSGAPQYNSNSRGRGRSGQNRGRGGYSTRGRGFSQHQSASPSSGQRPV
;CC   CQICGRIGHTAIKCYNRFDNNYQSEVPTQAFSALRVSDETGKEWYPDSAATAHITASTSGLQNATTYEGN
;CC   DAVLVGDGTYLPITHVGSTTISSSKGTIPLNEVLVCPAIQKSLLSVSKLCDDYPCGVYFDANKVCIIDLT
;CC   TQKVVSKGPRNNGLYMLENSEFVALYSNRQCAASMETWHHRLGHSNSKILQQLLTRKEIQVNKSRTSPVC
;CC   EPCQMGKSTRLQFFSSDFRALKPLDRVHCDLWGPSPVVSNQGFKYYAVFVDDFSRFSWFFPLRMKSKFIS
;CC   VFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGL
;CC   SMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKF
;CC   DPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTP
;CC   PSVPSSQLQPLARQMTPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNAL
;CC   GQGSLENLHPMITRSKDGIQKPNPRYALIVSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIHMLNTWSLV
;CC   PATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPVVRTATIRLVLDTATANEW
;CC   PLKQLDVSNAFLHGELQEPVFMFQPSGFVDPNKPNHVCRLTKALYGLKQAPRAWFDTFSNFLLDFGFECS
;CC   TSDPSLFVCHQNGQSLILLLYVDDILLTGSDQLLMDKLLQALNNRFSMKDLGPPRYFLGIEIESYNNGLF
;CC   LHQHAYASDILHQAGMTECNPMPTPLPQHLEDLNSEPFEEPTYFRSLAGKLQYLTITRPDIQYAVNFICQ
;CC   RMHAPTNSDFGLLKRILRYVKGTINMGLPIRKHHNPVLSGFCDSDYAGCKDTRRSTTGFCILLGSTLISW
;CC   SAKRQPTISHSSTEAEYRALSDTAREITWISSLLRDLGISQHQPTRVFCDNLSAVYLSANPALHKRSKHF
;CC   DKDFHYIRERVALGLIETQHIPATIQLADVFTKSLPRRPFITLRAKLGVSASPVSPTPSLKE
;XX
;DR   Positions  4999   9212  Accession No AL161515    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4214 BP; 1202 A; 1030 C; 791 G; 1191 T; 0 other;
ATCOPIA51_I
tggtatcagagctattataccttaacaggtgattcaatggcccctgcttacccttttccagacaatgtcc
atgtctctagttccgttaccttaaagctcaacgatagtaactacttgttgtggaagacacagtttgagtc
ccttctatcgagccaaaagctcataggttttgtcaatggagtcgtcactcctccagctcagactcgtctt
gttgttaatgatgatgtcaccagcgaagttccgaatcctcaatatgaagactggttttgcacagaccagc
tcgtccggtcgtggttgtttggtacgctttcagaggaagtgcttggtcatgtccacaacctcactacatc
tcgtcagatttggatctctctagctgaaaatttcaacaaaagtagcatcgccagagagttttctcttcgt
cgtaatcttcaacttctgacaaaaaaagataagtctctatctgtttactgtcgtgattttaaaataatat
gcgactctctaagctccattggcaaaccagtagaggaatccatgaaaatctttggctttctcaatggact
cggcagagagtacgatcctatcaccacagttatccaaagctccctaagcaagctccctgctccgacgttt
aacgacgtcatctccgaagttcaagggtttgacagtaagctgcaatcttatgacgacactgtctctgtta
atcctcatcttgcgttcaatactgaaagatctaactctggcgctcctcaatacaattccaattcccgtgg
tcgtggtcgttctgggcaaaacagaggacgcggtggctactctacacgcggcagaggattttctcaacat
caatccgcttcaccatcatcaggacaaagaccagtttgtcaaatttgtggtcgcataggacacactgcta
tcaaatgctacaaccgatttgacaacaactaccaaagtgaagtccctactcaagcattttctgctctccg
tgtctctgatgaaaccggcaaggaatggtaccccgattctgcagccacagcccacataacagcctcaaca
tctggtctgcaaaacgcaacaacatatgagggaaacgatgcagtcttggttggagatggaacatacctcc
ctattacacatgttggatccaccacaatttcctcatccaaaggtactattccgttgaatgaagtcttagt
gtgccctgctatacaaaaatctcttctatctgtgtccaaactttgcgatgattatccatgcggtgtttat
tttgatgctaataaggtttgcataattgatttaaccactcagaaagtggtgtccaagggtccacgaaata
atgggctctacatgctggagaattcagagtttgtagcactctattcaaatcgtcaatgtgcagctagcat
ggaaacatggcatcatcgacttggccactcaaactcaaagattcttcagcaacttttaacccgcaaggaa
atccaagtgaataaaagcagaacttctcccgtttgtgagccttgccaaatgggaaagagcactagattac
agtttttctcttctgattttcgagctttaaaacctttagatcgagttcattgtgatctttggggaccatc
accggttgtatcaaaccaaggattcaaatactatgcagtttttgttgatgatttctcaagattctcttgg
ttttttcctttgcgcatgaagtcaaagtttatttcagtgtttattgcatatcagaaattggttgagaatc
aacttggtacaaaaatcaaagagtttcaaagcgatggagggggagaatttacaagcaacaaattaaaaga
acactttagagagcatggcattcatcatcgtatatcttgtccatatacaccgcaacaaaacggtgttgcc
gaaaggaagcacagacatttggtagagcttgggctttcaatgttatatcacagtcatacacctctcaagt
tctgggtagaagctttcttcactgccaactatctcagtaatctcttgccttcttctgtcctcaaggaaat
aagtccctatgaaactttgtttcaacaaaaagttgattatacacctctccgagtgtttggtacagcctgc
tacccctgcttgagaccgttagcaaagaacaagtttgatccacgctcgttgcaatgcgtgtttcttggct
atcacaaccaatacaagggataccgctgtttgtatcctcctaccggtaaagtctacatctctagacatgt
catttttgatgaagctcaattcccatttaaagaaaagtaccacagtctggttccaaaataccagacgacc
ttactacaggcttggcaacatactgatctcacaccaccttcagtgccttcttctcaattacaacctcttg
caagacaaatgactcctatggcaacaagtgagaatcagccaatgatgaattatgagacagaggaagccgt
caatgttaatatggaaactagctctgatgaggaaactgaatcaaatgatgaatttgaccacgaagtagct
cccgtactaaatgatcaaaatgaagacaatgcactaggacaaggctcattagaaaatctccatcccatga
ttacaagatcaaaagatggaattcagaagccaaacccccggtatgctctcattgtctctaaatcctcttt
tgatgaaccaaaaactattactactgctatgaagcatcctagctggaacgctgcagttatggatgagata
gatcgcattcacatgctaaacacttggtctctagttcctgcaacagaggacatgaatattctgacatcca
aatgggttttcaagactaaactcaaacctgatggcaccatagataagttgaaagctcgtctagttgccaa
agggtttgatcaagaagaaggagtcgactatcttgagacattcagtccggttgttcgaactgcaactata
cgtcttgttctcgataccgctactgcaaatgagtggcctctcaaacagcttgatgtgtccaacgcgtttc
tccatggagaattacaagaaccggtgtttatgttccaaccctctggttttgttgatcctaacaagcctaa
tcacgtttgtcggctcaccaaagctctttatggtctaaaacaagcgcctagagcctggtttgacaccttt
agcaactttcttcttgactttggctttgagtgcagcacatctgatccttccctcttcgtttgtcatcaaa
atgggcaaagtctcatactcctcttatatgtcgacgatatactcctcacaggaagtgatcaactgctcat
ggataaacttcttcaagctctcaacaaccgcttttcgatgaaagatcttgggcctcctcgctattttttg
ggtatagaaattgaatcttacaacaatggtctatttttacatcaacacgcatacgcttccgacattcttc
atcaagcaggcatgacagaatgcaaccctatgcctacccctctgccacaacacttggaagacctcaattc
agaaccctttgaagagccaacatactttcggagtttagctggcaagttacaatacttaacaatcacaaga
ccggatattcaatatgccgtgaacttcatctgccaaagaatgcacgctccgaccaactctgattttggcc
ttctcaaacgcatactcaggtatgtgaaaggaactatcaacatggggcttccaatcagaaaacaccacaa
ccctgttctttcgggattttgcgatagtgattacgctggctgcaaggacactagacgctccactactggt
ttctgcatcctcttgggatctactctgatatcttggtctgcaaagagacaacccactatctctcactcct
caacagaagccgaatatagagctctttccgatacagctcgagaaatcacttggatttcctctcttctccg
agatcttggaatctctcaacatcaacctacacgagtgttctgtgataacctatctgctgtctacctctct
gcaaatcctgctcttcataaacgatctaaacacttcgataaagactttcactacatcagggaacgtgtgg
ctctcggtctcatagaaacgcaacacatcccagcaactattcaacttgctgatgtcttcaccaagtcact
accgcgacggccctttatcacgcttagagccaaactcggcgtgtctgcgtcaccggtctcacccacgcca
agtttgaaggaggg1