;ID   ATCOPIA91_I DNA   ; ATH   ; 4988 BP
;XX
;DE   Internal region of the ATCOPIA91 copia-like LTR-retrotransposon.
;XX
;AC   AB026643
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; the ATCOPIA91 
;KW   family; ATCOPIA91LTR; ATCOPIA91_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4988)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal portion of the ATCOPIA91 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(3) p. 26 (2001)
;XX
;CC   ATCOPIA91_I is an internal region of the ATCOPIA91 copia-like 
;CC   endogenous retrovirus flanked by the 93% identical ATCOPIA91LTR
;CC   long terminal repeats and a 5-bp target-site duplication (TTTTT). 
;CC   ATCOPIA91 forms a separate family of copia-like retroviruses
;CC   present in the A. thaliana genome since members of other families
;CC   are less than 75% identical to ATCOPIA91_I. 
;CC   ATCOPIA91_I (positions 357-4973) encodes remnants of the ~1540-aa 
;CC   ATCOPIA91p copia-like polyprotein. The ORF which encodes 
;CC   ATCOPIA91p is damaged by two indels that produce false frame shifts
;CC   at positions 735 and 2162 (marked by Xs in the ATCOPIA91p 
;CC   sequence).
;CC   ATCOPIA91p:
;CC   MTTSSNTNVDSSRQSIDQYENPFFLHSSDHAGLVLVFDRLTTGADFHSWRRSVRMTLNVRNKLGFIDGNI
;CC   TKPSPDHHDFGSWSRCNDMVATWLMNSVCKKIGQSLLFISTAEGIWKNLLARFKQDXXAPRVYEIEQRLS
;CC   IIQQGAMDVSSYYTELVTLWEEYRNYVELPVCTCGRSECNAAVLWERLQQRSRVTKFLMGLNESFESTRR
;CC   QILMLKPIPTIEDAFNMVTQDERQRSIKTPSSKTVVFQASGPNQSSGQCYQDVSSYQGQMDNTAFAVQNE
;CC   YRPRPPRPVCTQCGQSGHVVQKCFKIIGYPPGYIPRFKSTISNYQSQRSPAPSTFQPRGYSANAASKPHS
;CC   VANIMTNPPSLYIPPPATEVNNLDINKLSGDQIQTLIQQLSGRIQTSEPLAPSPSTSAPSTVTEHGIMAV
;CC   QSSAGTIPFPSSSLRFENDQLTFQHQCLSSLYTNLPHGSWIIDSGATTHVCSDLAMFNETNTVFGVTVSL
;CC   PNDTRVQITHTGTIPLSHSLILQDVLHVPSFKFNLISVSSLLKTNHCSSHYYVDSCIIQEFIQGLTIGRG
;CC   ILLHNLYILRLDAPSTHDQFAGSLVVDGILWHQRLGHPFADKLKCIXMHRISGTLSSSKSSFMHPFHCSI
;CC   CHLAKQKRLSYESHNHLSSLSFDLVHLDVWGPFSIESLEGYKYFLTIVDDCTRVTWVYMMRNKSEVTQHF
;CC   SDFIKHVLTQYKAVIKMIRTDNAPELAFKSIVKEHGMVHQFSCAYTPQQNSVVERKHQHLLNVARSLLFQ
;CC   SNVPIAYWSDCLLTVVFLINRIPSVLLKNVSPYELLTKRKPAYDFLRSFGCLCYVSTLQKDRNKFSPRSE
;CC   KCIFLGYSSSYKGYKVLHIDLNCVSVSRNVVFHENIFPFHDNPISPVSDVFSHAILPLPIHVNDEIHTSS
;CC   SPNAHNEHSHASSASSSSTTSSQPSTSSSSIIPVLPEAVTTETACVSLSVARPKRQGKAPSYLSDYHCSF
;CC   TQISEPSPQNSFPHLKVYSTPYLISSVLSYSSLKSPFQSFVLSYSAETEPKTFKQAIISVQWTKAMDEEL
;CC   GAMELNKTWSVVSLPPGKNVIGCKWVYTIKYNPDGTIERYKARLVAKEFTQQEGVDYFDTFSHVAKLASV
;CC   KLVLGCAAKKGWSLTQIDVSNAFLHSELDEEIYMSLPQGYTPSSGFLPSHHVCRLHKSIYGLKQTSRQWY
;CC   KCLSKTLLDAAFIQSQSDNTLFVQLQGTSFIAILVYVDDILIASNDSNQVTLIKAYMADHFKIKDLGPAR
;CC   FFLGLEIARNSEGIAICQRKYCLDLLTDAGLLGCKPSTVPMDPKVNLTADMGTLLDNAKPYRKLVGRLLY
;CC   LCVTRPDITFAVHRLSRFLSCDTDVHMQAAQRVLKYLKGNPGQGLFYSTNTSLCLNGFADADWGTCLDSR
;CC   RSVSGVCIFLGTFLISWKYKKQDVCSSSSTEAEYRSMAVATKDLLWFSYMLKDLHIKVETKAKLFCDNKS
;CC   AMHIANNPVFHERTKHVEIDCHTTRDQMKFGFLQVHHIGTENQLADILTKPLHPGPFKSLLNRLGVSNLF
;CC   LPKE
;XX
;DR   Positions 28125 23138  Accession No AB026643    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4988 BP; 1419 A; 964 C; 912 G; 1693 T; 0 other;
ATCOPIA91_I
tggtagcagagcacaaaagacattttctctcactataacagaaatctcagagtttcttcttccgccaaaa
ctcatcttcttcttcgatttcatccaatttctatggaatcaccatcgttatcaccctaaatcttccattt
cttttctttactgagctccgattcatcaatagaggtctcaaaatcaccaatcgcaatcacattcgcgatt
ttcactaacagttgaagctcaagctcgaatctccttcatctttcagatcgattcgttctacatcacgatc
gattcaactctaatcgtttttcggttcgtcgttctaatcaatttcagtatcatctctcgattcttgattt
gtttcaatgacgacatcatccaatacaaatgtggatagctcgcgacagtctattgatcagtatgagaatc
cattctttcttcacagctccgatcatgccggattggttctagttttcgatcgtctcactactggagcaga
ttttcattcttggagaagatccgttcgtatgacactcaatgtacgtaacaagcttggttttattgatggt
aatatcactaaaccttcacctgatcatcatgattttggatcttggtctcgttgcaatgatatggtagcaa
catggttaatgaattctgtatgcaagaagattggtcaaagcttgctgtttatatctactgctgagggaat
ttggaaaaatcttttagctaggttcaaacaagatggcacctagagtctatgaaatagaacagcgtttgag
tattattcaacaaggtgctatggatgttagttcatattatacagaacttgtgacattatgggaagaatat
agaaattatgtggaattgcctgtttgcacttgtggaagatctgagtgcaatgctgcagtgttgtgggaaa
gattgcagcagaggagccgagtcactaagtttcttatgggactgaatgagtcatttgagtctacaagaag
acaaattttgatgctgaagccgattccaactattgaggatgcattcaatatggtaactcaagatgaaagg
cagagaagtattaagacaccttcatcaaaaactgtagtgtttcaagcttcaggacctaatcaatcttctg
gtcaatgttatcaagatgtctcttcatatcaaggtcagatggataatacagcttttgcagtacaaaatga
atatcgtccgagaccacctagaccagtttgcacccaatgtggtcaatctggacatgtagtacagaaatgt
ttcaagattattggttatcctccagggtatattccaaggtttaagagcaccatctctaactatcaatctc
agagatctcctgctccatcaacctttcagcctagaggatattctgcaaatgctgcatctaaacctcattc
agttgcaaatatcatgactaatcctcctagtctatacattccaccaccagctacagaagttaacaacctt
gatattaacaagctgagtggagatcaaattcagacattgattcaacaactctcaggtcgtattcaaactt
cagagcctttggctccttctccatcaacatcggcaccatcaacagttacagaacatggtataatggctgt
tcaatcgtctgctggtacaattccttttccttcttcatctcttcgttttgaaaatgatcaacttactttc
caacatcaatgtttatcttccttatacacaaatcttcctcatggtagctggataattgatagtggtgcca
ctactcatgtgtgttctgatctagcaatgtttaatgaaactaatacagtttttggggtaactgtttcttt
accaaatgatactagagttcagatcacacacacaggcacaattccattatctcattcactcattttgcaa
gatgtgttgcatgttccttcttttaaattcaatttgatttctgtgagtagtcttttaaaaacaaatcatt
gctcatctcactactatgttgattcttgtattattcaggagtttattcagggattgacgattggtagagg
catcctacttcataatctctatattttgcgacttgatgcaccatctacacatgatcaatttgctggatcc
ttggtagttgatggaattctttggcaccaacgccttggacatccatttgctgacaagttaaaatgcatcg
catttcgggtactctttcttcgtctaagtctagtttcatgcatccatttcattgttctatttgtcattta
gcaaaacagaaaagattgtcatatgagtctcataatcatttgtcttcactttcatttgatcttgttcatt
tagatgtgtggggtcctttttctattgagtctttagaaggatacaaatattttcttactatagtagatga
ttgtactcgtgtaacatgggtttatatgatgagaaataaaagtgaagttacacaacatttttcagatttt
attaaacatgttcttacacaatataaagctgttattaaaatgattagaactgataatgctcctgaacttg
cttttaaatctattgttaaagaacatggaatggtacatcagttttcttgtgcatacacacctcaacagaa
ttcagtagtagaacgtaaacatcaacatctactgaatgttgcaagatcattgttgtttcagtctaatgtc
ccgattgcatattggagtgattgcttgttaacagtcgtgtttcttattaataggattccatcagttttgt
tgaaaaatgtatcaccttatgaattacttacaaagagaaaacctgcttatgattttttaagatcttttgg
ttgtctttgttatgtttctactttgcaaaaggataggaataagttctctcctaggtctgaaaagtgtatt
ttcttgggttattcatcaagttataagggatataaagtcttgcatattgatttaaattgtgtttcagttt
ctaggaatgtagtatttcatgagaacatctttcctttccatgataatccaatctcacctgtttctgatgt
ttttagtcatgctatcttaccattacctattcatgttaatgatgaaatccatacatcatcatcacctaat
gcacataatgaacattctcatgcatcatctgcatcatcgtcatccacaacatcatcacaaccatccactt
caagctctagtatcataccggttttgccagaagcagttacaacagagacagcatgtgtttcactttctgt
tgctaggccaaaacgccaaggtaaggctcctagttacttatctgactatcactgttctttcacacaaatt
tctgagccttcaccacaaaattcttttccacatctcaaagtttattcaacaccatatcttatatcttcag
ttctctcttattctagcttaaaatctccatttcagtcttttgttttatcttattctgctgaaacagaacc
taagacttttaaacaagccattatatctgttcaatggactaaagccatggatgaggaacttggggctatg
gagcttaataagacttggagtgtggtttctttgcctccgggaaagaatgtaataggttgtaaatgggttt
ataccattaagtacaatccggatggtactattgagagatacaaagcaagattggtagcaaaagaatttac
acaacaagaaggtgtggattattttgatacattctctcatgttgctaaactagctagtgttaagctggtt
cttggttgtgctgccaagaagggttggagtttaactcaaattgacgtttctaatgccttcttgcatagtg
agcttgatgaagagatctacatgagtttgccacagggatatacaccttcttctggatttcttccttctca
tcatgtgtgcagattacataagtccatctatgggttgaagcaaacatcacgccaatggtacaaatgtctg
tcgaagactctcttggatgccgcttttattcagtctcaatctgataacaccttgttcgtgcagctacaag
gtacctctttcattgctatccttgtttatgttgatgacattctcattgctagtaatgacagtaatcaagt
cactttgataaaagcatatatggctgaccatttcaaaatcaaagatctaggaccggctagattttttctt
ggtcttgagattgctaggaattcagaaggaattgccatttgtcaaaggaaatattgcttggatcttctca
ctgatgcaggcttgttgggttgcaaaccgagtactgttccaatggatccaaaggtgaatcttactgcaga
tatgggaactctcttggataatgcaaagccatatagaaaattggttggtcgcttgttatatctctgtgtt
acaaggcctgatatcacatttgcggttcatcgacttagtcgtttcctgtcatgtgatactgatgtgcata
tgcaagcagctcagagagtattgaagtatcttaaaggtaatccagggcaaggtttgttttactctacaaa
cacttctctttgccttaatggatttgctgatgctgattggggaacatgcttagactctcgacgttctgtt
tctggtgtttgcatttttctgggaacttttttgatttcctggaaatataagaagcaggatgtgtgtagta
gtagtagcacagaggctgagtatcgcagtatggcagtggctacaaaggatcttttatggttcagttatat
gttgaaggatttgcatattaaagttgaaacgaaggcaaagcttttctgcgataataagtctgcgatgcat
atagcaaacaatccagtttttcacgagaggaccaaacatgttgaaatcgattgtcacacaacgagagatc
aaatgaagtttggattcttgcaagtgcatcacattggaacagagaatcagctggccgatatactgaccaa
gccattgcatcctggccctttcaaatcacttcttaatcgtcttggagtttcaaacctcttccttcctaaa
gagtgatttgtggggggc1