;ID   ATCOPIA55_I DNA   ; ATH   ; 4424 BP
;XX
;DE   Internal region of ATCOPIA55 copia-like LTR-retrotransposon.
;XX
;AC   AL161511
;XX
;DT   01-OCT-2001 (Rel. 6.2, Created)
;DT   01-OCT-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; ATCOPIA55LTR; 
;KW   ATCOPIA55_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4424)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Repbase Reports 1:(1) p. 12 (2001)
;XX
;CC   ATCOPIA55_I is an internal region of the ATCOPIA55 copia-like 
;CC   endogenous retrovirus flanked by 1% divergent ATCOPIA55LTR 
;CC   LTRs and a 5-bp target-site duplication.
;CC   ATCOPIA55_I encodes the 1421-aa ATCOPIA55p copia-like polyprotein.
;CC   ATCOPIA55p:
;CC   MEQPIELYSQPLLNISNCVTVKLNGRNYLLWKTQFESFLSGQGLLGFVTGALKPPDPVLATPLTAEAAAV
;CC   ETVNPAYLSWVKSDQVVRSWLLGSLSEDILSEVVNTTTSQEVWLALAKHFNRVSSSRLFELQRKLQTIEK
;CC   RDRSMSDYLKEIKSICEQLASVGSPVNEKMKIFAALHGLGREYEPIKTSIEGSMDTVPTTFEDISPRLTG
;CC   FDDRLLAYTDAASITPHLAFNTQRYDSTTYYNKGRGSSSQKSKGRGGYTTQGRGFHQQISSGSSVSSGQS
;CC   VERPVCQICGKIGHPALKCWHRFDNAYQHEDMPTALAALRITDVTDQAGSEWCADSAATAHVTSSPHHLQ
;CC   QSRAYSGSDTVMVGDGNFLPITHTGSALLPTTSGTLPLLDVLVVPDIAKSLLSVSKLTTDYPCTLEFDAN
;CC   GVIVKDKVTKRLLTLGQNKNGLYTLKDPPVQAFYSSRQQAASDEVWHRRLGHPNSKILQQLVSTKAIIIN
;CC   KSTNRMCESCQIGKSSRLSFSDSQFVATRLLERVHCDLWGPSPVLSNQGFKYYVIFIDHWSRYCWFYPLK
;CC   CKADFYITFCKFQKFVETQFNQKISTFQCDGGGEFISHRFLKHLEESGIQQSISCPYTPQQNRLAERKHR
;CC   HITELGLSMLFSAKLPQKVWVEAFFTSNFLSNILPTTTLPNQMSPFERLHGHQPEYSALRTFGCSCFPTL
;CC   RNYASNKFDPRSLKCVFLGYNDRYKGYRCIYPPTGRVYISRHVIFDESSFPFQDTYLHLQNLGSTKLLEA
;CC   WQQNFMPSQKNQSETQAASVFSEDDFPPLPVTRVQVSPPNVTPQAAQSTVQREEQPADTDIQSNSPRNQA
;CC   ESPALVDRECIERTTGSDPASIGDNALSPQDSATQRSPVQSTETAGTSDQNQRTEAAVDPVQQVHPMVTR
;CC   SKKGVVKPNPRYVLLTQKASHPEPKTVTQALKHEGWKGAMGEEIDTCVETNTFSLVPYTPDMNVLGSKWV
;CC   FRTKINADGSLNKLKARLVAKGYHQEEGIDYLETYSPVVRTATVRLVLHIATVMEWNLKQLDVKNAFLHG
;CC   DLNETVFMHQPAGFVDKTKPNHVWHLHKSIYGLKQSPRAWYDKFTNYLLEFGFVCSIQDPSLFFYEQGRD
;CC   VLILLLYVDDIVLTGSNNILMDRLLQEMSKEFRMTDMGSLQYFLGIQAQNSDQGLFLSQQKYAEDLLQVA
;CC   GMIDCAPMPTPLPVQLHKVPKQNELFSNSTYFRSLAGKLQYLTLTRPDIQFSVNFVCQKMHAPTTADYNL
;CC   LKRILRYVKGTITMGLLFNKNTDFTLRTYTDGDYSQHSKQKKSATNNDAVFKLRAFSDSDEKQDVLQEDS
;CC   VPFLATISSPGRRRSNQLSPRAQQKPSIKPCQIQLLKSSGSITCSEISTFHNLIHRSSMETTFPPSILLQ
;CC   TRYFTHALNTFKLTIILLEKG
;XX
;DR   Positions  190949  195248  Accession No AL161511    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4424 BP; 1304 A; 1013 C; 898 G; 1209 T; 0 other;
ATCOPIA55_I
tggtatcagagccatggagcaacctattgagctctattctcaaccactcttaaacatttcaaattgtgtt
actgtcaaactaaatggaaggaactatcttctgtggaaaacacagtttgaatcgtttctctccggccaag
gtttactgggtttcgtcaccggcgctctcaaaccaccagatcctgttcttgcgactccactcaccgctga
agctgcagctgtggagacagtgaaccctgcgtatctctcttgggtgaaatctgatcaagtggtccggtca
tggcttcttggatctctgtctgaagacattctctctgaagtcgtcaacacaaccacgtctcaggaggtat
ggctagctctagcaaaacatttcaatcgtgtttcttcttcacgcttgtttgaactacaaagaaagttaca
aaccattgaaaagcgtgacagatccatgagtgattatttgaaagagattaagtctatctgtgagcaactt
gcttctgttggcagtccagtgaatgaaaagatgaaaatttttgctgccttacatggtctaggcagagagt
acgaaccgattaagacatctattgaggggtccatggatactgttcctacaacctttgaagacatctctcc
tcgtcttactggttttgatgatcgtcttttggcttacactgacgctgcaagcatcactcctcatcttgca
ttcaatacacagcgttatgactcaacaacctactacaacaaaggcagaggcagttcatctcaaaagtcca
aagggcgtggaggctatacaacacaaggaaggggatttcatcagcaaatctcctctggttcttctgtgtc
ttcgggtcagtctgttgaaagaccagtgtgtcagatttgtggaaaaataggacatccggctctaaagtgt
tggcatcgctttgacaatgcatatcagcatgaagatatgccaactgctctcgctgctctccgaatcactg
atgtcacagatcaagcaggcagtgaatggtgtgcagactctgcagctactgctcatgttacaagctcacc
tcatcacctgcagcagagtagagcttattcaggatctgacacggtcatggtaggagatgggaacttctta
ccaatcactcacacagggtctgctctcttaccaacgacatcaggtactctccctcttcttgatgttttag
ttgtccctgatattgcaaagtctctgttatcagtttcaaaactcacaaccgattacccatgtactcttga
atttgatgctaatggggtcattgtaaaggacaaggtaacaaagaggcttctcactctgggtcaaaataag
aatggtctgtacacgctgaaggatccacctgttcaagccttctattcatctagacagcaagcagcctcag
atgaagtgtggcatagacgtcttggacatccgaatagtaagatcctgcagcagttagtcagtactaaagc
tatcatcatcaataagagcaccaataggatgtgtgaatcatgtcagattgggaagagtagtagactttct
ttttcagattctcagtttgttgcaactagactactagagagagttcattgtgatctttggggaccctctc
cagttttgtcaaatcaggggtttaagtactatgtaatcttcattgaccattggtctcgttattgctggtt
ttatcctttgaaatgcaaggctgatttctacattactttctgcaagttccaaaagtttgttgaaacacag
tttaatcaaaagatcagtacctttcaatgtgatggagggggtgaatttataagccatagatttctcaaac
atttagaggaaagtggtatacaacagtcaatatcgtgtccttacacgcctcagcaaaatagacttgctga
gaggaagcacagacacatcacagagcttgggctgtcaatgctgttctcagctaagctgccacaaaaagtt
tgggtggaagcgttcttcacttcaaatttcctgagcaacattcttcctacaactactctaccaaatcaga
tgagtccatttgagagattacatggccatcaaccggaatattcagctttaagaacctttggctgcagttg
ttttcccactctaagaaactatgcatcaaataagtttgaccctcgttctcttaagtgcgtgttcttgggc
tacaatgatcgctataaaggctatagatgcatctatcctccaacaggaagagtttatattagccgccatg
tgatcttcgatgagtcttcttttcctttccaagatacctatcttcacctgcagaacttgggatcaacaaa
gcttcttgaagcgtggcaacagaatttcatgccttctcaaaagaatcaaagtgaaactcaagctgcttct
gtgttctctgaagacgactttcctcctctaccagtcacacgggttcaagtttcaccaccaaatgtcacac
ctcaagctgctcagtccacagtacaacgagaagaacaacctgcagatacagacattcaatcaaactcacc
aagaaatcaagccgagtcaccggctcttgtggacagagagtgcattgagcgtacgacaggctcagatcct
gcttctataggcgacaacgctctcagtccacaagacagtgccactcaacgttctcctgttcagtcaacag
aaacagctggaacttcagatcaaaatcagaggacagaagctgcagttgatccggttcagcaagttcaccc
aatggtaacaagatcaaagaagggagtagtcaaaccaaaccccagatacgtccttctaacacagaaagca
tcacatccagaaccaaaaactgtgacacaagcactgaaacatgaaggctggaaaggtgctatgggcgaag
aaattgacacttgtgttgaaaccaacactttttctttagtcccatacacacctgacatgaatgttttagg
aagtaaatgggtgttcagaaccaaaataaatgctgatggcagtttgaacaagttgaaagctagactagtg
gctaaaggatatcaccaagaagaaggaatagactacttggagacctacagtccagttgtgagaacagcca
cagtgagacttgtcttacatatagcaacagtgatggaatggaatctgaaacagttggatgtgaagaatgc
tttcttacatggagacttaaatgaaacagtctttatgcatcaaccagctggatttgtggataagacaaaa
ccaaatcatgtttggcatctccacaaatctatatacgggttaaaacaatctccccgagcctggtatgata
agtttactaactacttgttggagtttggttttgtttgcagcatacaagatccatcactattcttctatga
acaaggacgagatgtgctcattctacttttgtatgtagatgatatagtcctaaccggtagcaacaacatt
ctcatggatagacttctgcaggaaatgagcaaggagtttcgaatgactgacatgggatctctgcaatact
ttctcgggattcaagcacagaactctgaccaaggcttgttcttatctcaacagaagtatgctgaggatct
tctacaagtcgcaggaatgatcgattgtgcaccaatgcctactcctttgccagttcaacttcacaaagtt
cctaaacaaaatgagctattctcaaactccacttacttccgcagtttggctggcaagcttcagtatctga
cattgactaggccagatattcagttttcagtaaacttcgtatgtcaaaagatgcacgctccaacaacagc
tgattacaatctgcttaagaggatccttaggtatgtaaagggaaccataaccatggggttactcttcaac
aagaacacagacttcactcttcgaacctacactgacggtgactatagtcaacactcaaagcaaaagaagt
ctgctacaaataatgatgcagtcttcaagcttcgagccttcagtgatagtgatgagaaacaagacgttct
acaggaggattctgtacctttcttggcaacaatatcatctcctggtcgtcgaagaagcaaccaactgtct
ccaagagctcaacagaagccgagtataaagccttgtcagatacaacttctgaaatcatctggctcaataa
catgctcagagatctccacattccacaacctgatccaccggagctctatggagacaacctttcctccatc
tatcttgctgcaaacccggtacttcacacacgctctaaacactttcaaactcactatcattttgttagag
aaagggtagcgttgggttcgttgattgtcaagcatgtgccatcccaccagcagttggctgatatattcac
caagccattgcccttcgatgctttcacttcgctaaggtacaaactgggtgtagatttgccacccacacca
agtttgcgggggag1