;ID   ATCOPIA81_I DNA   ; ATH   ; 4502 BP
;XX
;DE   Internal region of the ATCOPIA81 copia-like LTR-retrotransposon.
;XX
;AC   AL157735
;XX
;DT   30-NOV-2001 (Rel. 6.3, Created)
;DT   30-NOV-2001 (Rel. 6.3, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; the ATCOPIA81 
;KW   family; ATCOPIA81LTR; ATCOPIA81_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4502)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal portion of the ATCOPIA81 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(3) p. 6 (2001)
;XX
;CC   ATCOPIA81_I is an internal region of the ATCOPIA81 copia-like 
;CC   endogenous retrovirus flanked by the 98% identical ATCOPIA81LTR
;CC   long terminal repeats, and a 5-bp target-site duplication (AACAT). 
;CC   ATCOPIA81 forms a separate family of copia-like retroviruses
;CC   present in the A. thaliana genome. 
;CC   ATCOPIA81_I encodes remnants of the ATCOPIA81p copia-like 
;CC   polyprotein. The ORF which encodes ATCOPIA81p is slightly damaged
;CC   by false stop codons.
;CC   ATCOPIA81p:
;CC   MWLKICGKHLKKRFSVMNGSRLQQLKSELACCKQRGLAIESYFGKLTRIWDNLATHRPLRVCRCGKCTCN
;CC   LGAAQEADREEDKTHEFLNGFDEQFRAVRSTLVARTPIQPLEEVYNIIRQEEDLRVPVEESPSMTAFAVQ
;CC   SKPRLPTDEKEKSFFCKLCNRSGHTAERCYAVIGYPEWWGDRPKGRTLQGKGRGGSTSRGGRGRGASQEV
;CC   ANRVYVPNVENVTTEQANHVLTDDDRSGAHGLNDTQWKMIKSILNAGKQPSTEQQTSMSSLSPWIMDTGA
;CC   SHHLTGRFETLTNVREMPPVLIIMADGREQVSYKEGSISLGSHLVMKSVYYVEELKTDLMSLGQLMDENK
;CC   CVVQLADRFLVVQDRISRMVIGCGKRVGGTFHFRSTKIAASAATKEIKSFELWHNRMGHPSTQVVGKLPF
;CC   VSASVSSSSLNKPCDICLRSKQTRDSFPLSMNKSSNCFELIHVDLWGPYRTPSHSGARYFLTIVDDYSRG
;CC   VWLYLLTTKSEAPEQLKKFCALTERQFNTKIKRIRSDNGTEFLCLTKYFLTNGIIHETSCVATPQQNARA
;CC   ERKHRHILNIARALRFQASLPIEFWGECVLTAAYLINRTPSSVLDFATPFERLFNKAPTYDHIRVFGSLC
;CC   YAHDQNKSGDKFASRSKCCVFVGYPYGKKGWRLYDLEKLEFFVSRDVVFSETQFPFAPINHLQASDESKA
;CC   LWAPISEFLENDDCGLRKPNPIRSVALGPVSSSQLSTPISSANDTRRSENSDGDNGGARQLPEPTAGDKI
;CC   LPSTAPNPIPVAPTSRLTPAAVVMPPVPTEELLGKGKRNRTPSVRLKDFVVPHAPKPKQQEEINLVCAEN
;CC   LTHNVDVHRFSETHVAYVAAVLSNLEPRSFKQAMQEEKWRNAVGSEYGTLEENNTWTIEDLPPNKKAIGS
;CC   QWIFKVKFKSDGTIERYKARLVPMGNKQIEGEDYGETFSPVVKMGTVRLFLDIAVKKGWIIHQMDVHNAF
;CC   LHGDLEEEVYMKLPPGFESVDKNKVCRLRKSLYGLKQAPRCWFAKLSSALLEYGFQQLRSDYSLFTYAQG
;CC   TTRLNILVYVDDLVIAGSSLKATESFKAYLSSCFHMKDLGELKYFLGIEVARNASGIYLCQRKYALDIIT
;CC   ETDQLGAKPAHFPLEANHKLALSESVLLHDPKPYRRLLGRLIYLGVTRPDLAFSVHVLAQFMQNPRLDHW
;CC   LATLRLVRYLKSDPGQGILLRADGNFQVTGWCNADWDNCPITRCSVTGYLCS
;XX
;DR   Positions  64171 59670 Accession No AL157735    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4502 BP; 1285 A; 900 C; 1021 G; 1296 T; 0 other;
ATCOPIA81_I
tggtatcagagcctgaaaagcaaaactcttgattttttttttctcgacaatgtcaactgaaagcaacacc
tccgctgcgactgaagtaaggagaacaatctctccttatgacctgacttctgctgataatcctggagccg
taatctctcatcctctcttgaagggaagcaactatgaagagtgggcttgtggttttcgaaccacgttgat
atcacggaaaaagtatggatttctcgatggttcaatttccaaaccagaagagacttcccctgattttgaa
gactggacgacgattcaagctcttcttgtgtcatggatcaaaatgacaattgaaccaactcttcgatcca
ccatttctcatagagatgtggctcaagatctgtgggaaacatctgaaaaagaggttctctgtcatgaatg
ggtcacgactacaacaactgaaatctgagctagcttgctgcaaacagagaggtttggctattgaatctta
ctttggcaagttgactaggatctgggataacctcgccacacatcgtccgttgcgtgtttgtcgttgtggt
aagtgtacctgcaatcttggtgctgctcaagaggcagatcgggaggaagacaagactcatgaatttctca
atggttttgatgagcagtttcgggcggtccgatcaactctagtggctcggactcccattcaaccacttga
agaggtgtacaacatcattcgtcaggaagaggacttgcgtgttcccgtagaagagtcaccgtcaatgaca
gcttttgcggtgcaatccaaaccacgtcttccaactgacgaaaaggagaaaagtttcttctgcaaactgt
gtaatcgatctggtcacactgcagagcggtgctatgctgtcatcgggtatccagagtggtggggtgatag
acccaagggtcgtactctacaagggaagggacgtggtggcagtacgtctcgtggtggcaggggacgcgga
gcctcccaagaggttgctaatcgggtttatgtgccgaatgtggaaaacgtgacaactgaacaagctaatc
atgttctcacagacgacgaccgatctggagctcatggactgaatgatacccagtggaaaatgataaaatc
aattctcaatgcgggaaaacaaccttcaactgaacaacaaacgagtatgtcttctctctctccttggatt
atggatacaggggcgtcacatcatttaactggaaggtttgagaccttaacaaatgtgcgagaaatgcccc
ctgttttgatcattatggcagacggtagagagcaagtttcttacaaagaaggttctattagtcttggtag
tcatttggtaatgaaatctgtttactatgttgaagagttgaaaactgatttgatgtctttggggcagtta
atggatgaaaacaagtgtgttgttcagttggctgatcggtttctcgtggttcaggaccgcatttcgagga
tggtgattgggtgtggtaaaagagtgggtggtacctttcactttcgtagtacaaagatcgcagcttcagc
cgcaacaaaggaaatcaagtcatttgagttgtggcataatcggatgggtcatccgtcgacgcaagtggtg
ggaaagcttccgtttgtttctgcttctgtttcttcttctagtttgaataagccttgtgatatttgtcttc
gttcaaaacagacaagggatagttttccattgagtatgaataaaagttcaaattgttttgaattaataca
tgttgatttatggggtccttatagaactccatctcattctggagcaagatacttcttaactattgttgat
gactactccagaggagtgtggctgtatttgttaaccactaaaagtgaagctccagagcagttgaaaaagt
tttgtgctttaacagagagacaattcaataccaaaataaagaggatccggagtgacaacgggactgagtt
tctatgcttaactaagtattttctcaccaatggtatcatccatgaaacctcgtgtgtagctactcctcaa
cagaatgcaagagctgaaaggaaacatcgccatatcttaaatattgctagggcattgagatttcaagcgt
ctttacccattgaattttggggtgaatgtgtgttaaccgcagcgtatttgataaaccggactcctagttc
agttctcgacttcgcaacaccattcgaacgcttatttaacaaagctccaacttatgatcatatcagagtt
ttcggctccctgtgctatgcccatgatcagaataaaagtggtgataaatttgcttcccgaagtaaatgtt
gtgtttttgtgggttatccgtatgggaaaaaaggctggagattatatgacttagaaaaattggaattttt
tgtctcaagagatgttgttttctctgagacacagtttccttttgctccgattaatcacttgcaggcttca
gacgaatctaaagctttgtgggctccaatatctgaatttttggaaaatgatgattgtgggcttcgtaagc
ccaatcctattaggtctgttgcattagggcctgtatcatccagccagctgtcgactcccatctcatctgc
aaatgatacacgccggtctgaaaattcagacggtgacaacggtggtgcccgacaacttcctgaaccaacc
gccggtgacaaaattctgccttctacggcacccaatccaattccagttgctccaactagccgattaacac
cggctgctgtggttatgcctccagttccgacagaagagctattaggtaaagggaaacgcaacaggacccc
gtcggttcgtttaaaagattttgttgtcccacacgctcccaaaccgaaacaacaagaagaaattaatctg
gtttgtgctgaaaatctgactcacaacgtggatgttcaccgtttctccgaaacacatgttgcttatgttg
ctgcagttctctccaacttggaaccaagatcattcaaacaagctatgcaagaagaaaagtggcgaaatgc
tgttggttcagaatatggaacattagaagaaaacaacacatggaccattgaagacttacctcctaacaaa
aaggcaatagggagtcaatggatcttcaaggtcaaatttaaatcagatggtacaatagagagatataaag
ccagattggttccaatgggaaataaacagattgagggagaagattatggagagactttctctcctgtggt
aaagatgggaacagtgagattgtttcttgacattgctgttaaaaaagggtggattatacatcaaatggac
gtccacaatgccttcttacatggtgatttagaagaagaggtatatatgaagttgcctcctggttttgagt
ctgttgataagaataaagtttgccgattgcgcaaatctctgtatggccttaagcaggcacctcgatgctg
gtttgcaaagctctcttcagctcttcttgagtatgggtttcagcaattacgcagtgactactcattattc
acttatgcacaaggtactactcgtctaaatattctggtttatgttgacgatttagttatcgctggaagta
gcctgaaagctacagaatcatttaaagcctatctctcttcatgcttccatatgaaagatctaggagaact
taaatattttctgggaatagaggtggctagaaatgcatctggcatatatctttgtcagagaaaatatgca
ctagacatcattactgaaacagatcaactaggagcaaagccagcacactttcctttggaagcaaatcaca
agctagcactctccgagtctgttctattacatgatccgaaaccatatcgtcgtttgttaggacgtcttat
ttacttgggagttacacgtcctgatctcgcgttctcggttcatgttcttgctcaatttatgcagaatcca
cgattggatcattggctcgcaactttacgactcgtacggtatttaaaatctgatccaggacaaggtatct
tgttgcgagccgatgggaattttcaagtcactggctggtgtaatgcagactgggacaactgcccgataac
acgttgctcagtcacaggctatttgtgcagctaggagactctccgatcagttggaaaacgaaaaagcaga
aaacggttagtctttcatcagctgaagcagaatatcgggcacttgctaaacttgttcaagagcttatttg
gatcaagatgatgcttaaaactcttggggttgttcatacttaaccaatgttagtgcaatgtgatagtaaa
tctgcaatctacatcgccacaaatccggtcttctacgaacatacgaaacatatcgagattgatctacatt
ttgtcagagacgaagttctaaaacgagagattcagctttgtcatgtggattctttttcacagcttgcaga
catcttaacaaaacctattgggaaagatggttttcgttacttcaagtccaagctgggaacactaaatctg
tattctccagcttgagggaggg1