;ID   ATCOPIA82_I DNA   ; ATH   ; 4279 BP
;XX
;DE   Internal region of the ATCOPIA82 copia-like LTR-retrotransposon.
;XX
;AC   AP001298
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; the ATCOPIA82 
;KW   family; ATCOPIA82LTR; ATCOPIA82_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4279)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal portion of the ATCOPIA82 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(3) p. 8 (2001)
;XX
;CC   ATCOPIA82_I is an internal region of the ATCOPIA82 copia-like 
;CC   endogenous retrovirus flanked by the 98% identical ATCOPIA82LTR
;CC   long terminal repeats, and a 5-bp target-site duplication (TTGGT). 
;CC   ATCOPIA82 forms a separate family of copia-like retroviruses
;CC   present in the A. thaliana genome.
;CC   ATCOPIA82_I encodes (positions 14-4279) well preserved remnants 
;CC   of the ~1422-aa ATCOPIA82p copia-like polyprotein. A false stop 
;CC   codons at position 1756 corresponds to H (based on polyproteins
;CC   most close to ATCOPIA82p).
;CC   ATCOPIA82p:
;CC   MDSQIQLYSHPVIHISNYVTVQLTERNYLLWKTQFESFLSGQNLLGFVNGAIKPPPAVNTLTQINGLTTE
;CC   VQNPDYQAWQRSDQVVRAWLLGSLSEDILREVVHTITAQEVWTALAQHFNKVSSSRLFKLQRKLQTIEKL
;CC   DKSMEDYVREIKRICEQLASIGNPVSQKMKIFAALHDLGRDYEPIKTSIEGSMDLHPPPTFESVIPRLTG
;CC   FADRMAGYNAGNEVSPHLAFNITTTNGSHYYSSQGRGNGKPGNNNKGRLNFTTKGRGFHQQISSGSSGGD
;CC   RIICQICGKPGHPALKCWHRFNNSYQHEELPSALAALRITDVTETAGHDWFTDSAATAHVTNSTNRLQQS
;CC   QPYSGSDAVMVGNGEFLPITHTGSTSLQSTSGNLPLTDVLVCPDINKSLISVSKLTSDYPCCVEFDCDTV
;CC   RITDKATKRLLTMGHHNKGLYMLKNHSPLEVYYSSRQQAASDAVWHRRLGHPNAQILQHLSTTKAISVNK
;CC   NTKMVCEACQLGKSLKLPFSASSFVASRPLQRIHCDLWGPSPIMSVQGFRYYAVLIDNYSRFSWFYPLKL
;CC   KSDFALIFPVFQAMVENQFQ*KIGTFQCDGGGEFISKDFIAHLQKHGIQQLMSCPHTPQQNGLAERKHRH
;CC   IIELGLSMIFQSSMPQKYWVEAFYTANFLINLLPSSVLEKKCSPYEVLMGKPPNYTSLRVFGCACYPTLR
;CC   DYATTKFDPRSLKCVFLGYNDKYKGYRCLLPTTGRVYISRHVIFDESLFPFSSMAYIHLQPANVTPLMSA
;CC   WLKGCSVQEQNQTGTSTENQDHNDQNSVPSGRVLLREEESTGCTAGFDHVPIGNSSSSSTQQITTSEDSP
;CC   IIQPLPSTTEQSTQNQSSQSASSNSESSQVQTAQPSQSTHPMTTRLKDGIRKPNPRYGLHTQRVSYPEPK
;CC   TVTAALKDEGWTDAMHEEMDNCSEAKTWSLVPYTPNMHVLGSKWVFRTKLNADGSLDKLKARLVAKGFNQ
;CC   EEGIDYLETYSLVVRTPTVRSVLHLATIMQWDIKQMDVKNAFLHGDLTETVYMMQPAGFVEKSKPDHVCL
;CC   LHKSIYGLKQFPRAWFDKFSTYLIEFGFECSKPDPSLFVYIKNKSIILLLLYVDDMIITGNSSDAMSKLL
;CC   DSLNTEFRMKDMGRLHYFLGIQVQFHSEGMFLSQQKYAEDLLAVAVMSDCAPMPTHLPLQLTAIPAQDEI
;CC   FDNPTYFRSLAGKLQYLTLTRPDIQFAVNFVCQKMHAPTVSDFNLLKRILRYIKGTITMGISFNKNTYCR
;CC   LRAYCDSDYGNCIDSRRSIGGYCTFLGTNIISWSSQKQDSVSKSSTEAEYRTLSDTASEVTWLGSVLKEL
;CC   GIPLLDTPEIYCDNLSSVYLSANPAFHKRSKHFQLHYHYVRERVALGALIVKHIPGHQQIADIFTKSLPI
;CC   KPFCDLRYKLGVDVPPTPSLRG
;XX
;DR   Positions  60791 56513  Accession No AP001298    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4279 BP; 1277 A; 940 C; 852 G; 1210 T; 0 other;
ATCOPIA82_I
tggtatcagagccatggattctcaaatccagctctactcgcatcctgtcattcacatttccaactacgtt
acggttcaacttactgagagaaactatcttctctggaagactcagtttgagtcctttctctctggacaaa
accttctagggtttgtcaatggtgctatcaagcctcctccagctgtcaacactctcacacagatcaatgg
tctcaccacagaagtccaaaatcctgactatcaagcatggcagagatctgatcaagttgttcgagcatgg
cttctgggatctctatcagaagatattctcagagaggttgttcataccatcacagcacaagaggtttgga
cagctttagctcagcacttcaataaggtatcttcatcccgtctctttaagctgcaaagaaaactgcaaac
catagaaaagttagataaatctatggaagactatgttagagagatcaagagaatctgtgaacaacttgca
tctattggtaatccggttagtcagaaaatgaagatttttgctgcattacatgacctaggaagagattatg
aaccaatcaagacttctatagaaggatctatggacttacatcctccacctactttcgagtctgtgattcc
taggttgactggttttgctgatagaatggctggttacaatgctggaaatgaggtgtctccccacttggca
ttcaacattaccacaactaatggttcccattactatagtagccaaggtcgtggaaatggaaaacctggaa
acaacaacaaagggagattaaattttacaacaaaaggaagaggctttcaccagcaaatctcatcaggttc
ttcaggaggtgacagaataatatgtcagatatgtggcaaacctggacatcctgctctgaaatgctggcac
cgcttcaacaacagctaccaacacgaggaactgccaagtgctctagctgcgttaaggattacagatgtta
cagagactgctggtcatgattggtttactgattcggctgcaacagctcatgtcacaaactcaactaatag
gcttcagcagtctcagccttactcaggatctgatgcagtaatggttggtaatggtgagtttctccccata
actcacactggatcaactagtcttcagtcaacctcaggtaatcttcctttaactgatgttctagtttgtc
ctgatattaataaatccttgatatctgtttctaagctcacatcagactatccctgttgtgttgaatttga
ctgtgacactgtgcgtattactgataaggcaacaaagaggttgttaacaatggggcatcacaataagggg
ttgtacatgttgaagaatcactcacctcttgaagtctactattcctcaagacagcaagctgcaagtgatg
ctgtttggcatagaagactcggtcatcctaatgctcagattcttcagcacctgtcaacaactaaagctat
ttcagtcaacaaaaacaccaagatggtatgtgaagcctgtcagcttgggaagagtcttaagttacctttt
tctgcttcttcgtttgtagcctctagacctttgcaaagaatacattgtgatctttggggtccttcaccaa
taatgtcagtacaaggttttcgatactatgctgttctcattgacaattactcacgcttcagctggtttta
tcccctcaagttgaaatcagactttgctttgatatttcctgtgtttcaagcaatggttgagaatcagttt
caatagaaaattggaacctttcaatgtgatggtgggggtgagtttataagcaaagacttcatagctcatc
tacaaaaacatggcattcaacaactgatgtcatgtccacacacacctcaacaaaacggtttagcagaaag
aaaacacagacacattattgaattaggactctcaatgatattccaaagtagtatgccacagaagtattgg
gttgaggcgttctacactgcaaattttcttatcaatctactaccaagttcagtattggagaagaaatgca
gtccctatgaagttctaatgggaaagcctccaaattacacatcactacgtgtctttgggtgtgcctgcta
tcctaccctaagagattatgcaaccacaaagttcgaccctagatcactcaagtgtgtgttcctaggatac
aatgacaagtataagggctacagatgcttacttccaaccacagggcgtgtgtatatcagccgtcatgtca
tctttgatgagtcactctttcctttttcatctatggcttacattcacttgcaacctgcaaatgttacacc
attgatgtctgcttggttaaagggttgctcagtacaagaacagaatcagactggtacttcaacagaaaat
caagatcacaatgatcaaaattcagtaccttctggtagggttttgttgagagaagaggagagtactgggt
gtacggcaggctttgatcatgttcctataggcaacagctcttcttcttctactcagcaaatcactacctc
agaagactctcctatcattcaaccgctgccatcaactacagaacagtctactcagaatcaatcttctcag
agtgcatcttccaacagtgagtcatctcaggtccaaacagctcagccatctcagtctacccaccctatga
caacaagattaaaagatggaatcagaaaaccaaatccgaggtatggtttacacactcaaagagtatctta
cccggaacctaaaacagtgacagctgctttaaaagatgaaggctggactgatgcaatgcacgaagaaatg
gacaattgctctgaagctaaaacctggtccttggtgccctatacaccaaatatgcatgtattaggcagca
agtgggttttcagaactaaactcaatgctgatgggtcccttgacaagctgaaagccagactggttgcaaa
gggatttaatcaagaagaaggcattgattacctggaaacttatagcctagtggtgagaactccgacagtt
agatcagttcttcacctagcaaccatcatgcagtgggacataaagcagatggatgtgaagaatgccttcc
tccatggtgatctcacagagacagtgtacatgatgcagcctgcaggttttgtagagaaatcaaagccaga
tcatgtctgtcttttgcataagtctatctacggattaaaacaattcccacgtgcttggtttgataaattt
agcacttatctgattgagtttggctttgaatgtagcaagccggatccatccttgtttgtatacatcaaga
acaagagtatcatcctgctgctgctttacgttgatgacatgataataacaggaaacagttcagatgcgat
gtcaaagctattagacagcttgaatactgaattcagaatgaaagatatgggaagactacattactttttg
gggattcaagtgcagtttcattcagaaggaatgtttctctcccaacagaagtatgcagaagatcttcttg
ctgtggctgtgatgagtgattgtgctccaatgcctactcatctgccacttcagttgactgctatacctgc
acaagatgaaatctttgacaatccaacatatttcagaagtcttgcaggtaaacttcaatatcttacctta
acaagacctgatatacagtttgctgttaactttgtttgccagaagatgcatgctccaacagtgtctgact
tcaatctgctcaaacgaattctgaggtacatcaagggaactataacaatggggatctcctttaataagaa
cacatattgcagattgagagcttactgtgatagtgattatggaaattgcatagactcacgaagatccatt
ggaggttactgtaccttccttggtacaaatatcatctcatggtcctcacaaaagcaagattcagtctcca
aaagctcaaccgaagctgaatatcgaactctgtctgacactgcctcggaggtaacctggttaggatctgt
tctcaaggaattgggcattcctctacttgacactccagaaatctattgcgacaatctctcctccgtgtat
ctctctgcgaatccagctttccacaaaagaagtaaacactttcagctacactatcattatgttcgagaga
gagtagcacttggagctctgattgtgaagcacatacctggtcatcaacagatcgcggacatcttcaccaa
atctctccccatcaagccgttctgcgatcttcgttacaaactaggcgtcgatgttccacctacgccgagt
ttgcggggg1