;ID   ATCOPIA89_I DNA   ; ATH   ; 4576 BP
;XX
;DE   Internal region of the ATCOPIA89 copia-like LTR-retrotransposon.
;XX
;AC   AC068809
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; the ATCOPIA89 
;KW   family; ATCOPIA89LTR; ATCOPIA89_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4576)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal portion of the ATCOPIA89 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(3) p. 22 (2001)
;XX
;CC   ATCOPIA89_I is an internal region of the ATCOPIA89 copia-like 
;CC   endogenous retrovirus flanked by the 99% identical ATCOPIA89LTR
;CC   long terminal repeats. and a 5-bp target-site duplication (AACAT). 
;CC   ATCOPIA89 forms a separate family of copia-like retroviruses
;CC   present in the A. thaliana genome since members of other families
;CC   are less than 75% identical to ATCOPIA89_I and ATCOPIA89LTR. 
;CC   ATCOPIA89_I (positions 91-4566) encodes remnants of the 1492-aa 
;CC   ATCOPIA89p copia-like polyprotein. The ORF which encodes 
;CC   ATCOPIA89p is damaged by one false stop codon at position 
;CC   1690-1692 (marked by X in the ATCOPIA89p sequence).
;CC   ATCOPIA89p:
;CC   MGVPVRKSTRRLGRSVGAGSSGTKLKAPMSSNSPDPAPVVPIRSISEIDAVDSPHSPFFLHSADHPGLTL
;CC   VADRLDGTNYTQWSSAMKISLDAKNKIAFIDGSLPRPAEGTPLSRIWSRCNSMVKSWLLNAVSKQIYGSI
;CC   LNLDDATMIWNDLHDRFHMTNLPRTFHPIQQIQDLRQGSMDLSSYYTALKTLWDQLDGSEPTESCLCCHS
;CC   FNCVSKRHYRGKVDRGRIIKFLAGLNETYSIIRGQIIMKKPLPDIAEVYHILDQDDSQRKFGNSIVPAAF
;CC   QVGIAGSHPGVVNADASSPSAGSLIAAYQSFKKDKPTCSYCGFTGHVVDRCYKKHGYPPGWKPRKQQVNA
;CC   SPTPQSSSPAITAQVSATVGSGDKQDGGLDSLIGNLSKEQLQHFIAYFSSHLTTQFPASVPLNEASTSGI
;CC   SFSPSTYSFIGILTVTQSVTNKRSWIIDSGATHHVSHDKSLFSDLDASVSQHVNLPNGSVVMVAGVGTVI
;CC   INTSISLKNVLYIPDFRMNLLSISSLTTDLGSRVIFDPASCLIXDPTKGLTIGRGRRIANLYLLDVEEPA
;CC   DSRQLSSYSLNDVIDSAVWHKRLGHTSFSRIDMLTDVLGISKQRNKGVIHCDICQRAKQKKLSYPNRNNL
;CC   CSASFDLLHIDVWGPFSEPTVEGFRYFLTIVDDHTRVTWVYLLRLKSDVLTVFPEFLQMVETQFDKRVRC
;CC   VRSDNAPELKFTELYRRLGIIPYHSCPETPEQNSVVERKHQHILNVARALLFQSNLPLSLWGDCILTAVF
;CC   LINRTPSPLLENKTPFEKLTNTTPEYTDLRTFGCLCYASTSPKQRTKFEDRAKACVFLGYPAGYKGYKLM
;CC   DIESNVVFISRNVKFFEDIFLFQNSQASDEVDVTGFFPQISTRVADPGESSGTRTEGEYSSTMRQGESNS
;CC   ESVIPNTEDTSRRRVSRPPGYLQDYQCYSVKESVKESTEHPISQVFSTDNLSSSYCAYINALTKYPTPTS
;CC   YTQASKVKEFCDAIKDEIGALERTNTWIVCAIPPGKTVVGCKWIYTIKLNADGSLERYKARLVAKGYTQK
;CC   EGLDYVETFSPVARMATVKFLLSVAAPRKWFLDQLDISNAFLNGDLHEEIFMALLPGYVDKDGKPFPPNS
;CC   VCKLQKSLYGLKQASRQWFLKLSHCLMSMGFRNGTGDDTLFLRRTDDTYMAILVYVDDIIIASSSSTATA
;CC   SFTAALKESFKLRDLGPLKYFLGLEVARTSAGISICQRKYVLDLLDETGLLGCKPSSIPMDPSQKLNLET
;CC   GDLLTDVEMYRRLVGKLMYLTFTRPDITFAVHKLCQFTSAPRQPHLTAVYKVLHYLKGTIGQGMFYSANS
;CC   DLKLKSFSDADWGTCTDSRISVTGFCMFLGPSLVSWKSNKQETVSMSSAESEYKAMSTAVKEMLWLRKLM
;CC   NDLWIDASEASVLYCDNTAAIHIANNSVFHERTKYLDLACHLVRERVLMGQIKTLHVQTEHQLADALTKP
;CC   LYPTLFLRLIRKMGVINIYTPS
;XX
;DR   Positions 74828 70253  Accession No AC068809    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4576 BP; 1216 A; 893 C; 994 G; 1473 T; 0 other;
ATCOPIA89_I
tggtatcagagctatacgcttagctccgatttcgttttcctcctctgattctatttctttggcgtgtttt
gtgtggttgtttagatcgatatgggagttccggttcgtaaatctactcgtcgtctgggtagatctgttgg
tgctggtagctcggggactaagttgaaagctccgatgagctcaaactcgcccgatccagctcctgttgtt
ccgattcgctccatctctgagatcgatgccgtggatagtccacactcgccgttctttcttcacagtgctg
atcatccgggtctgactttggttgctgatcgtctagatggtacaaactacactcaatggagctctgcgat
gaagatttctcttgatgcgaagaacaaaatcgctttcatcgatggatctcttcctcgtcctgcagaaggt
actcctctctctcggatctggtctagatgtaacagtatggttaagtcctggttgttgaatgctgtttcaa
agcagatctatggtagtattttgaacttggatgatgctactatgatctggaatgatttacatgatcgttt
tcacatgacgaatttgccgagaacatttcatcccatccagcaaattcaggatctgcgtcaaggttccatg
gatctatcaagctattacacagctttgaaaactctctgggatcaacttgatggcagtgaacccactgagt
cgtgtttatgttgtcattcttttaactgtgttagcaaaaggcattatcggggtaaggtggatagaggtcg
catcatcaaatttcttgctggtttgaatgaaacctattccatcatcagaggtcagataatcatgaagaag
cctcttcctgacattgctgaagtttatcacatcttagatcaagatgacagtcaaaggaaatttggtaaca
gcattgttccagctgcttttcaagtagggattgcagggtctcatcctggagtggtgaacgctgatgcatc
ctcgccttctgctggttctttgattgctgcttaccagtcttttaagaaagacaaaccaacatgttcctat
tgtggtttcactggacatgttgtggatagatgctacaagaagcatggctatcctcctgggtggaaaccta
gaaaacaacaagtcaatgcttctccgactcctcaatcgtcttctcctgcaatcacagcacaagtgtctgc
tactgttggatcaggtgataaacaggatggtggtctggatagtttgattgggaatctcagtaaagagcag
ttacaacacttcattgcttacttcagttctcatctcactactcagtttcctgcttcagttcctttaaatg
aggcttctacttctggtatatctttctcaccatctacctatagttttattgggattttaactgttactca
gagtgttacaaataagagatcatggataattgattctggtgctactcatcatgtaagtcatgataagagc
ttattcagtgatctagatgcttctgttagtcagcatgttaaccttccaaatggtagtgttgttatggtag
ctggtgtgggaacagtgattataaacacttctatcagtttgaagaatgttctctacataccagatttcag
aatgaatcttctcagtataagctcgttgactacagatcttggttctcgagttatttttgatcctgcttct
tgcctcatataggatcctaccaagggattgacgattggaagaggtagacggattgctaatctttacttgt
tggatgttgaagaacctgcagattcaagacaactatcttcttatagtttgaatgatgtaatagactctgc
tgtttggcataagagattaggacatacctctttttctcggattgatatgcttacagatgttcttggaatt
tctaaacaaaggaataaaggagttatacattgtgacatttgtcaaagagctaaacaaaagaaactttcat
atcctaatagaaacaatctttgttctgcatcttttgacttgttacacattgatgtttggggtccattctc
agagccaacagtagagggattcagatacttcttgacaatagtggatgatcatactcgtgtcacctgggtt
tatttgctgagattgaagagtgatgttttgactgtgtttccagaatttctgcaaatggtagagactcagt
ttgataagcgagttcgttgtgtgagatcagataatgctccggagttgaagtttactgagctttatcgacg
gctgggcatcattccttaccactcctgtcctgaaacaccagagcagaactccgtggttgagaggaaacac
cagcatatactgaatgttgcacgagctcttctttttcagtccaacctaccgttgtctctgtggggggatt
gtattcttacagctgtgttcttgatcaatcgcacgccatcaccattgttggaaaataagactccttttga
gaagctcactaatacaacaccagagtacactgatttgagaacttttggctgcttgtgttatgcgagtact
tctcctaagcaacgaactaaatttgaagatagggctaaagcttgtgtcttcttgggttatccagcaggtt
acaaaggatataaactgatggacattgaaagtaatgttgtgttcatttccagaaatgttaaattctttga
agacatatttctttttcagaattcccaagctagtgacgaagttgatgtcacagggttctttcctcagata
agtactcgtgttgctgatccaggagaatcttcagggactagaactgagggggaatattcgtctacaatga
gacagggggaatctaactctgagtcagtaatacctaatacagaagatacatcacggcgacgagtttcgag
acctcctggttatttgcaagactatcaatgctattcagtgaaagaatctgtgaaagagtctacagaacat
ccaatatcacaggtcttttctactgataatctctcttcttcttattgtgcatacattaatgctctcacca
aatatcctactccaactagttatactcaagctagtaaagtcaaggaattttgtgatgctataaaggatga
gataggggctttagagcgaactaatacttggattgtttgtgctattcctcctggtaagactgtggttgga
tgtaaatggatttatactatcaagttgaatgcagatggcagtcttgaacgttataaagctcgtctagtgg
caaagggttacactcagaaagaaggacttgattatgtggaaacgttctcaccagtggcaaggatggcaac
tgtcaagttcttactctctgttgcagctcctagaaaatggtttttagaccaactggatatatccaatgct
tttctcaatggagatcttcatgaagagattttcatggccttgcttcctggttatgttgataaggatggga
agccatttccaccaaattcagtttgcaagcttcagaagtctctgtacgggttgaaacaagcttcacgaca
gtggtttttgaagctctctcactgcttgatgtctatgggttttagaaatggaacaggagatgatacttta
tttctcaggagaacagatgatacttacatggctatattggtttatgttgacgatataatcattgcaagca
gctcttctacagccactgcgtcttttacagccgctttgaaagaatcttttaaactgagagatttaggacc
tctcaaatacttcctaggtcttgaggttgcaaggacttctgctggcatttctatttgtcaacgcaagtat
gtgttagatcttttggatgagacaggtcttctaggatgtaaaccatcttctatacccatggatccaagtc
agaaactgaatctagaaactggagatcttttgacagatgtggagatgtatagacggctcgttgggaagct
tatgtatctgacttttactcggccagacatcacctttgcagttcataagctgtgtcagttcacctctgcg
ccgcgacaaccacacctcacagctgtctacaaagtgttacactacttgaaaggcaccattggtcagggta
tgttctattctgccaattctgatcttaagttgaagagtttttcagatgcagactgggggacttgtactga
ttcaagaatctcggttactggcttctgtatgttccttggtccatctttggtttcatggaagtccaataag
caggaaacagtctccatgtcatctgcagaatccgagtacaaagctatgtcaactgcagtcaaggaaatgt
tgtggcttcgaaagctcatgaatgatttgtggattgatgcttcagaagcttctgtgctttactgtgacaa
cacggctgctatacatatagctaacaattcagtctttcatgagagaactaagtatttggaccttgcttgt
catcttgtcagggaaagggttctaatggggcagatcaagactcttcatgtgcagactgaacatcaattgg
cagatgcgttaactaaacctctatatcctactttatttttgagactcattcgcaagatgggtgttattaa
catatacactccatcttgaaggggaa1