;ID   ATCOPIA76_I DNA   ; ATH   ; 4272 BP
;XX
;DE   Internal region of ATCOPIA76 copia-like LTR-retrotransposon.
;XX
;AC   AC006841
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA76LTR; ATCOPIA76_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4272)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA76 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 34 (2001)
;XX
;CC   ATCOPIA76_I is an internal region of the ATCOPIA76 copia-like 
;CC   endogenous retrovirus flanked by the 99% identical ATCOPIA76LTR
;CC   long terminal repeats, and a 5-bp target-site duplication (GGTTT). 
;CC   ATCOPIA76_I encodes the 1361-aa ATCOPIA76p copia-like polyprotein
;CC   A corresponding CDS is interrupted by a copy of ATGP2NLTR, which
;CC   is flanked by the GACAT target site duplication. The ATGP2NLTR and
;CC   one the GACAT repeat were removed from the ATCOPIA76_I nucleotide
;CC   sequence.
;CC   ATCOPIA76p:
;CC   MSAARIEVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRRELLEEK
;CC   RRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFKMSENLSIEGNIDEF
;CC   LRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVTLSLDEVVAAIYSKELELGSNKKSI
;CC   KGQAEGLFVKEKTETRGRTEQRGNNNNNKKSRSKSRSKKGCWICGEEGHFKSSCPNKNKTQPQTKTTNNN
;CC   NKGESSNGSSNYSEANGLYVSEALSSTDIHLEDEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTV
;CC   SKVRGIGTIRVKNEAGMVVRLTNVRYIPEMDRNLLSLGTFEKSGYSFKLENGTLSIIAGDSVLLTVRRCY
;CC   TLYLLQWRPVTEESLSVVKRQDDTILWHRRLGHMSQKNMDLLLKKGLLDKKKVSKLETCEDCIYGKAKRI
;CC   GFNLAQHDTREKLEYVHSDLWGAPSVPFSLGKCQYFISFIDDYTRKVRIYFLKTKDEAFDKFVEWANLVE
;CC   NQTDKRIKTLRTDNGLEFCNRSFDEFCSQKGILWHRTCAYTPQQNGVAERMNRTLMEKVRSMLSDSGLPK
;CC   KFWAEATHTTAILINKTPSSALNYEVPDKRWSGKSPIYSYLRRFGCIAFVHTDDGKLNPRAKKGILVGYP
;CC   IGVKGYKIWLLEEKKCVVSRNVIFQENASYKDMMQSKDAEKDENEAPPSSYLDLDLDHEEVITSGGDDPI
;CC   VEAQSPFNPSPATTQTYSEGVNSETDIIQSPLSYQLVRDRDRRTIRAPVRFDDEDYLAEALYTTEDSGEI
;CC   EPADYSEAKRSMNWNKWKLAMNEEMESQIKNHTWTVVKRPQHQKVIGSRWIYKFKLGIPGVEEGRFKARL
;CC   VAKGYAQRKGIDYHEIFAPVVKHVSIRILMSIVAQEDLELEQLDVKTAFLHGELKEKIYMVPPEGYEEMF
;CC   KEDEVCLLNKSLYGLKQAPKQWNEKFNAYMSEIGFIRSLYDSCAYIKELSDGSRVYLLLYVDDMLVAAKN
;CC   KEDISQLKEELSQRFDMKDLGAAKRILGMEIIRNREENTLWLSQNGYLNKILETYNMAESKHVVTPLGAH
;CC   LKMRAATVEKQEQDEDYMKSIPYSSAVGSIMYAMIGTRPDLAYPVGIISRYMSQPAREHWLGVKWVLRYI
;CC   KGSLGTKLQYKRSSDFKVVGYCDADHAACKDRRRSITGLVFTLGGSTISWKSGQQRVVALSTTEAEYMSL
;CC   TEAVKEAVWMKGLLKEFGYEQKSVEIFCDSQSAIALSKNNVHHERTKHIDVRYQYIRDIIANGDGDVVKI
;CC   DTEKNPADIFTKIVPVNKFQAALTLLQVKPE
;XX
;DR   Positions  95983  89644  Accession No AC006841    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4272 BP; 1402 A; 707 C; 1069 G; 1094 T; 0 other;
ATCOPIA76_I
gagtggtatcagagccataggttcattgttctgtggatctctaaagattcaaagatcctgaactatgtct
gcagcaagaatagaggttgaaaagtttgatggtcgtggggattacacgatgtggaaagagaagctcatgg
cgcatctagacattctaggcctgagtgtggctcttaaggaagaagacgatttggtggagaaagttgcaga
gatgcagcttactgaagaagaagagaaggaagaggtactgagacgagagctcttggaagaaaaaaggaga
aaggcgagaagcgccattgtgctgagtgttactgatagagtcttgaggaagattaagaaggagcagtctg
cagctgctatgctaggcgtgcttgacaaactgtacatgtctaaggcgctaccaaacagaatctaccaaaa
acagaagctttacagcttcaagatgtcagaaaacctcagcatagaaggtaatatagatgagttcttgcgt
attatagctgatttagagaacacgaacgtgttagtttctgatgaagaccaagctatattactactcatgt
cattacctaagccgtttgatcaacttagagatactctgaagtatgggttaggaagagtcactttatcgtt
agatgaggttgtagcagctatctactctaaggaactagagttagggtctaacaagaagagtattaagggt
caggctgaaggtctctttgtcaaggaaaagactgagacaagaggaaggactgaacagcgaggcaacaaca
acaacaacaagaagtctagatccaaatctagatctaagaagggttgttggatttgcggtgaagagggaca
cttcaagagttcatgtcctaacaagaacaaaacacaacctcagacaaagacaactaacaacaacaacaaa
ggtgaatcatcaaacggtagcagtaactactctgaagctaatgggctttatgtttctgaagctttgtctt
ctacagacatacatcttgaggacgaatgggtcatggacactggctgtagttaccacatgacatacaagcg
tgaatggtttgaagatttgaatgaggatgctggtgggtctgtgaggatgggaaacaagactgtttcaaag
gtcagaggaataggcacaatccgggttaagaatgaagcaggaatggtggttcgtcttacaaatgtgagat
acattccagaaatggataggaatcttttatctctgggaacatttgagaagtctggctacagtttcaagtt
agaaaatggaacactgagcatcattgcgggagacagtgttctactcacagtaagaaggtgttatacactc
tacctgttacagtggagaccagtaacagaagagtctctctctgtggtgaagagacaagatgacacaatct
tgtggcatcgaagattgggacacatgagtcaaaagaacatggatttattgctgaaaaaaggtcttttgga
caagaaaaaagtgtccaagctggagacatgtgaggactgcatatacgggaaagctaagaggattggattc
aacttagctcaacatgatacaagagagaagctggagtatgtgcattcggatttatggggagctccatcag
tgccattctctctaggtaaatgtcaatacttcatatcgtttattgatgattacactagaaaagttaggat
ttatttcctgaagactaaggatgaggcatttgataaatttgttgaatgggctaaccttgttgagaaccaa
actgacaagaggataaaaactcttagaacagacaatggtcttgagttttgtaacaggtcatttgatgagt
tctgctcacagaaagggattctatggcatagaacgtgtgcatacacgccacaacagaatggtgttgcaga
gcgcatgaacaggaccttaatggagaaagtcaggagtatgcttagtgattctggtcttccaaagaagttt
tgggcagaggctactcatacgacagctatactcatcaacaaaaccccatcatcagctctaaactatgaag
taccagacaagagatggtcagggaagtcaccaatctacagctacttaagaagattcgggtgcattgcgtt
tgttcacactgacgatggaaagctcaatccgagagctaagaaaggaatactagtaggatatcctattggt
gttaagggttacaagatttggttgttagaagagaagaagtgtgtggtgagcagaaatgtgatttttcaag
aaaatgcatcttacaaggacatgatgcagagtaaagatgctgagaaagatgaaaacgaggcaccaccaag
ctcttatttggatttggatcttgatcatgaagaagttatcacctcaggtggagatgatccgattgtcgaa
gctcagtctccattcaatccaagtccggcaaccactcaaacctacagtgaaggagttaactcagaaactg
atataattcagtcaccactgagttatcaactggtgagagatcgagatagaagaacaatcagagctccagt
gaggtttgatgatgaagactatctcgctgaagctctctatactacagaagacagtggagaaatagaacct
gcagattacagtgaagctaaaagaagcatgaattggaataagtggaaacttgctatgaatgaagaaatgg
agtcacagatcaaaaaccatacttggacagtggtcaaaagacctcaacatcagaaggttattggtagtag
gtggatctacaagtttaaacttgggattcctggagttgaagaaggtagattcaaggcaaggcttgttgcc
aaagggtatgctcaacgtaaaggaatcgattaccatgagatctttgctcctgttgtgaaacatgtctcca
ttagaatactgatgtctattgttgctcaagaagacttggagctggaacaacttgatgtaaagacagcatt
tctacacggtgagctgaaagagaagatttacatggtacctcctgaaggttatgaagaaatgtttaaagaa
gatgaagtttgtcttcttaataagtcactgtatggactcaagcaagctccaaagcaatggaatgagaagt
ttaatgcttacatgtctgagattggctttataaggagtttgtatgacagttgcgcatacattaaggaatt
gagtgatggttcaagggtttatctgcttctgtatgtggacgatatgctagtggcagctaagaacaaagaa
gatatatctcagcttaaagaagaactcagtcagagattcgacatgaaggatttgggggctgctaaaagaa
tcctcggtatggagattatcagaaatagagaagagaacactctgtggctgtcacagaatggctatctgaa
taaaattcttgagacttacaacatggcagagtcaaaacatgtggtgacaccacttggagctcacttgaag
atgcgagcagccacagttgagaagcaagagcaagacgaggactacatgaagtcaattccctactcaagtg
cagtaggaagtatcatgtacgcaatgataggtactcgccctgatctagcttatcctgttggaatcattag
tcgctacatgagtcaaccggctagagaacactggcttggagtcaaatgggtcttgaggtacatcaaaggc
tcactgggaactaagttgcaatacaagagaagcagtgactttaaggttgtgggatactgcgatgctgacc
acgctgcatgtaaagatcggagaagatcaattacagggcttgtgtttactcttggaggaagcactatcag
ttggaaatcaggtcaacagagagttgtagctctctcaactacagaggcagagtacatgtctctaactgaa
gctgtgaaagaagcagtgtggatgaagggtttgttgaaggaatttggttatgaacaaaagagcgtggaga
tcttttgtgattctcaaagtgctattgcactctccaagaataatgttcatcatgaaagaacgaagcatat
agatgttcgatatcaatatattcgggacataattgctaatggtgatggtgatgtggtgaagattgacact
gaaaagaatccagctgatatcttcaccaagatcgtgcctgtaaacaagtttcaggcggctttgaccttgt
tacaggtcaagcctgagtagtaaaactcaggaggaatccgagtatggaatcctacaactaggttctcggg
gtactctcttatctctctcattgacagtttgtgcaagtttttggttttatcatcaagtttcaggtggaga
tt1