;ID   ATCOPIA72_I DNA   ; ATH   ; 4483 BP
;XX
;DE   Internal region of ATCOPIA72 copia-like LTR-retrotransposon.
;XX
;AC   AC007109
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA72LTR; ATCOPIA72_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4483)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA72 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 28 (2001)
;XX
;CC   ATCOPIA72_I is an internal region of the ATCOPIA72 copia-like 
;CC   endogenous retrovirus flanked by the identical ATCOPIA72LTRs
;CC   and a 5-bp target-site duplication (GGTGA). ATCOPIA72_I encodes
;CC   the 1471-aa ATCOPIA72p copia-like polyprotein.
;CC   ATCOPIA72p:
;CC   MVPGIRVTRKSARSKVSTGSVARKSSKSTGVLDSASDSPPMARSQTAGASRGVFSSGFDDPTQSPFFLHS
;CC   ADHPGLSIISHRLDETTYGDWSVAMRISLDAKNKLGFVDGSLPRPLESDPNFRLWSRCNSMVKSWLLNSV
;CC   SPQIYRSILRLNDATDIWRDLFDRFNLTNLPRTYNLTQEIQDLRQGTMSLSEYYTLLKTLWDQLDSTEAL
;CC   DDPCTCGKAVRLYQKAEKAKIMKFLAGLNESYAIVRRQIIAKKALPSLAEVYHILDQDNSQKGFFNVVAP
;CC   PAAFQVSEVSHSPITSPEIMYVQSGPNKGRPTCSFCNRVGHIAERCYKKHGFPPGFTPKGKSSDKPPKPQ
;CC   AVAAQVTLSPDKMTGQLETLAGNFSPDQIQNLIALFSSQLQPQIVSPQTASSQHEASSSQSVAPSGILFS
;CC   PSTYCFIGILAVSHNSLSSDTWVIDSGATHHVSHDRKLFQTLDTSIVSFVNLPTGPNVRISGVGTVLINK
;CC   DIILQNVLFIPEFRLNLISISSLTTDLGTRVIFDPSCCQIQDLTKGLTLGEGKRIGNLYVLDTQSPAISV
;CC   NAVVDVSVWHKRLGHPSFSRLDSLSEVLGTTRHKNKKSAYCHVCHLAKQKKLSFPSANNICNSTFELLHI
;CC   DVWGPFSVETVEGYKYFLTIVDDHSRATWIYLLKSKSDVLTVFPAFIDLVENQYDTRVKSVRSDNAKELA
;CC   FTEFYKAKGIVSFHSCPETPEQNSVVERKHQHILNVARALMFQSNMSLPYWGDCVLTAVFLINRTPSALL
;CC   SNKTPFEVLTGKLPDYSQLKTFGCLCYSSTSSKQRHKFLPRSRACVFLGYPFGFKGYKLLDLESNVVHIS
;CC   RNVEFHEELFPLASSQQSATTASDVFTPMDPLSSGNSITSHLPSPQISPSTQISKRRITKFPAHLQDYHC
;CC   YFVNKDDSHPISSSLSYSQISPSHMLYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEITS
;CC   LPPGKKAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVKLLLKVSASKKWY
;CC   LNQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIYGLKQASRQWFLKFSNSLLALGF
;CC   EKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAAQSLTEALKASFKLRELGPLKYFLGLEVARTSE
;CC   GISLSQRKYALELLTSADMLDCKPSSIPMTPNIRLSKNDGLLLEDKEMYRRLVGKLMYLTITRPDITFAV
;CC   NKLCQFSSAPRTAHLAAVYKVLQYIKGTVGQGLFYSAEDDLTLKGYTDADWGTCPDSRRSTTGFTMFVGS
;CC   SLISWRSKKQPTVSRSSAEAEYRALALASCEMAWLSTLLLALRVHSGVPILYSDSTAAVYIATNPVFHER
;CC   TKHIEIDCHTVREKLDNGQLKLLHVKTKDQVADILTKPLFPYQFAHLLSKMSIQNIFVFS
;XX
;DR   Positions  16028  11546  Accession No AC007109    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4483 BP; 1153 A; 911 C; 940 G; 1479 T; 0 other;
ATCOPIA72_I
tggtatcagagccatcgagctcaattttttcttgattttgttcgatttcttcttcttttgagctagtttt
cttcgtttcttcacgatccaatggttcctggaattcgtgttactcgaaaatcagctcgctcgaaggtgtc
gacgggttctgttgctcggaaatcatcaaagtccaccggtgttctcgattctgcttccgattctcctcca
atggctcgttctcagaccgctggagcttcgcgaggtgttttctcatcgggatttgatgatccgacgcagt
ctcctttcttccttcatagtgcagatcatccaggtttgagcatcatttctcatcgtttagatgaaacaac
ttatggtgactggagtgtggctatgaggatctcgttggatgctaagaacaaactaggatttgtagacgga
tctttacctcgtcctttagaatcggatccaaatttccgtttatggtctagatgcaacagtatggtgaaat
cctggttgcttaactctgtttctcctcagatctatcgtagcatcttacgtctgaatgatgctacagatat
ttggcgtgatctctttgacaggtttaacctgacgaatcttccacgtacctacaatctgacacaggagatt
caggatcttcgtcaaggaacaatgtctctatctgagtactatactcttttaaagactctctgggatcagc
ttgacagtacagaggctttggatgatccttgtacttgtggaaaagctgttcgtctgtatcagaaggcaga
gaaagctaagataatgaaatttcttgcaggattgaatgagtcttatgccattgttcgtagacagatcatt
gcaaagaaggcacttcctagtttggcagaagtttatcatatcttggatcaggataatagccagaagggat
tctttaatgttgttgctccacctgcagcttttcaagtctctgaggtatctcattctcctatcacttctcc
tgagataatgtatgttcagagtggaccaaacaaaggtcgtcctacgtgttcattctgcaacagagttggt
catatagctgaaagatgctataagaagcatggttttccaccaggtttcactcctaaagggaagtcctctg
ataaacctccaaaacctcaggcagtggcagctcaggttactctttctccggataagatgacaggacaact
tgagactcttgctggtaacttctcccctgatcagatacagaatttgattgccttgttcagttctcagttg
cagccacagattgtttctcctcagactgcttcttctcagcatgaagcaagttcttctcagtctgttgctc
cttctggtatcttattctctccttccacatattgctttattggcatcttggcagtttcacataactcttt
gtccagtgacacttgggttattgactctggggctacacatcatgtgtcccatgacagaaaattgtttcag
actttagatacttctattgtgagttttgtgaatcttccaacaggtccaaatgtcagaatcagtggagtgg
gaacagttttgataaacaaagacattattctccagaatgttttgtttattcctgaattcagattgaattt
gatcagtatcagctctttgactactgaccttggtactagagtgatctttgatccttcttgctgtcaaata
caggatcttaccaaggggttgacgcttggagaaggtaaaaggattgggaatctctatgtgttggacacac
aatctcctgctatctcggtgaatgcagttgtggatgtgagcgtgtggcacaagagacttggacacccatc
tttttcaagactggattctctttctgaagttttgggaactactagacataagaataagaaatcagcttat
tgtcatgtttgtcatttagccaaacaaaagaagttgtcatttccttctgcgaacaacatttgtaattcaa
catttgagctgttacacattgatgtttggggacccttttcagtggagacagttgaaggatacaaatattt
cttaactatagttgatgatcattctagagcaacgtggatttatttgcttaagtctaagagtgacgtcctc
acagtgtttcctgccttcattgacttagttgagaatcagtatgatacaagagttaaatctgtgagatctg
ataatgctaaagagttggctttcacagaattttacaaagcaaagggaatcgtttcttttcattcttgtcc
tgagacaccagaacaaaattcagtggttgagaggaagcatcagcatattcttaatgtggctcgggctttg
atgtttcagtctaacatgtctttgccatattggggtgactgtgttttaactgctgtcttcttgattaaca
ggacaccttctgctttgttatcaaacaagactccttttgaggttctcactggaaagctaccagattactc
tcagctcaagacatttggttgcctttgctacagctctacttcatcgaaacagcgacacaagttccttcca
aggtcaagagcgtgtgttttcttgggctatccgtttggttttaaaggctacaagttgttggatttagaga
gcaacgtggttcatatatcgaggaatgtggagtttcatgaggagttgtttccattagcgagttctcaaca
gtctgctactacagcttcagatgttttcacaccaatggatcctttgtcctcaggtaattccatcacttct
catcttccatcaccacaaatttctccatcaacacaaatttctaaacgtaggattactaaattccctgctc
atctccaagactatcactgttattttgtcaataaagatgactcacatcctatttcatcttctctttctta
ctctcaaatctcaccatctcatatgttatacatcaataacatttccaaaattccaatccctcaatcttat
catgaggcaaaggattccaaagaatggtgtggtgctattgatcaggaaattggtgcaatggaaaggactg
atacttgggagattacaagtttacctcctgggaagaaggcagttggatgtaagtgggtatttacagtgaa
gtttcacgcagatggcagtttggaaagattcaaggccagaattgttgctaagggttatactcagaaggaa
ggtttggattacactgagactttctctcctgttgctaagatggccacagtaaagttacttttgaaagttt
cagcttctaagaagtggtatttgaatcagctggatatatctaatgcttttctcaatggagatttagagga
aaccatatatatgaagctgcctgatggttatgcagatattaagggaacttctctgccacctaatgttgtt
tgtcgtttgaagaagtccatttatggtcttaaacaggcatctcgtcaatggtttttgaagttttctaact
ctctgttggctctgggtttcgaaaaacagcatggtgatcatactctctttgttcgctgtattggttctga
gttcattgtcctcttagtttatgttgatgacatagtgattgcgagtactacagaacaagcagcacagtcg
ttgacagaggctttaaaagctagctttaagctgagggaacttggtccactgaagtatttcttgggtttag
aggttgctcgcacttctgaaggtatttccctatctcaaaggaagtatgctttagaattgctcacttctgc
agatatgttggactgtaaaccatcctccatacctatgactccgaatattagattatctaagaatgatggt
ctactcttggaggacaaagaaatgtatcgaagacttgttggcaagttgatgtatctgaccataactcgcc
ctgatatcacatttgcggtgaacaagttatgtcagttctcttctgctcctcgtactgcacatcttgcagc
tgtctacaaagtcttacaatacattaaaggtacagtgggtcaaggtctgttttattctgctgaggatgat
ctgactttaaaaggctatactgatgcggattggggtacttgcccagatagtcgtcgatcaaccacaggtt
tcactatgtttgttggttcctctctgatatcctggcgctccaagaaacagcctactgtctcacggtcgtc
tgcagaggcagagtatcgagcattggctttggcttcttgtgaaatggcgtggctgtctacactgttattg
gctttgcgtgttcattcaggtgtgcctattttatactctgacagtaccgccgctgtgtatatagccacta
acccagtgtttcacgaacgaacgaaacacatcgaaatcgattgtcacaccgttcgtgagaagctggataa
tggtcagttgaagctgcttcatgtcaagactaaagatcaggttgctgatatccttactaaaccactcttc
ccttatcaatttgctcatttattgtccaagatgagtatccaaaacatctttgtattctcatcttgagggg
gac1