;ID   ATCOPIA62_I DNA   ; ATH   ; 4639 BP
;XX
;DE   Internal region of ATCOPIA62 copia-like LTR-retrotransposon.
;XX
;AC   AL163975
;XX
;DT   26-OCT-2001 (Rel. 6.2, Created)
;DT   26-OCT-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; ATCOPIA62LTR; ATCOPIA62_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4639)
;RA   Jordan,N., Bangert,S., Wiedelmann,R., Voss,H., Unseld,M., Mewes,H.W.,
;RA   Rudd,S., Lemcke,K., Mayer,K.F.X., Quetier,F. and Salanoubat,M.
;RL   Direct submission to GenBank (April 2000)
;XX
;RN   [2] (bases 1 to 4639)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (August 23, 2001)
;XX
;CC   ATCOPIA62 has been found by [1] and slightly modified by [2].
;CC   ATCOPIA62_I is an internal region of the ATCOPIA62 copia-like 
;CC   endogenous retrovirus flanked by the 1% divergent ATCOPIA62LTRs
;CC   and a 5-bp target-site duplication [2].
;CC   ATCOPIA62_I encodes the 1481-aa ATCOPIA62p copia-like polyprotein.
;CC   ATCOPIA62p
;CC   MAPGRKISTRRTIRVPVSARRSANRDDSSPEGSPEPRARTRNPVTESHDSIHSPYYLTNSDNPGASITSE
;CC   VFDGTNYDDWKISIKIALDAKNKLVFIDGSVPRPPESDPMFRIWSRCNSLVKSWLLNSVSKPIYKSILRF
;CC   DDASEIWNDLSTRYHITNLPRSYPLTQQIWSLQQGTMDLTTYYTTLRTLWNELDGSDCVTLCKRCDCCKA
;CC   MDKKAEHARVIKFLAGLNESYAVIRSQIIMKKHVPSLAEIYNLLDQDHSQRSFTPVPSNAAAFIVSAPEQ
;CC   VQPSVNATFNNAKPQKVICSHCGYTGHTVDRCYKIHGYPLGFKHKNKNQSDKSVSLEKSVSTVKPVVAHM
;CC   ALTDSTTNDLINGLTKVLTKDQINGVVAYFNSQMQNSSIASSSGATITALPGIAFSSSTLGFIGVLKATV
;CC   NVLSSETWIIDSGATHHVCHDKNLLMRLSETMNSSVTLPTGFGVKITCIGTVKLNEFLVLNNVLYIPDFR
;CC   LNLLSVSQLTKDLGYRVTFDEDYCLIQDHVKGLMIGRGEQINNLYVLDVPRIKDFPTKEISFHANIVVDS
;CC   SLWHSRLGHPSVTTSDIVTDVLGFKQRNERSFHCTICPLAKQKRLPFVSKNHVCDSAFDLVHIDVWGPFN
;CC   VPTPDGFRYFLTIVDDHTRVTWLYLMKNKNEVLTIFPDFLKMIETQYKSQVKGVRSDNAPELKFVKLFKE
;CC   KGIIHYFSCPETPEQNSVVERKHQHILNVARSLMFQAQVPVEYWGECVLTAVFLINRLPTPLLHDKSPFE
;CC   VLTNKMPDFNGLRVFGCLCYSSTSTKNRDKFQPRAKACVFLGYPPGVKGYRLLDLETNIIYVSRNVVFHE
;CC   DIFPFAKSGSTVLPDYFATETSNVDASSTEASTSEAPAVVNDSVTPSNINPVVVSESPTDTNDIVDSTIP
;CC   AVSSTDKTSKGRTSKTPAYLQDYYCNLSTNGVEHPISNFLNYDGLADSHRAYICSITKYAEPTSFTQARK
;CC   SDDWLKAMNDELKALEGTATWKICSLPPDKHAIGCRWVYKVKLNADGSLERYKARLVAKGYTQQEGVDFV
;CC   DTFSPMAKMTIVKTLLVVAAAKKWSLHQLDISNAFLYGDLEEEIYMTLPPGYTTKEGETLPPNAVCKLQK
;CC   SLYGLKQASRQWFLKFSTTLMLLGFQRSQADHTLFVRNVNGKYIAVLVYVDDIIIASNDDAEVVELKADL
;CC   ERAFKLRDLGTLKYFLGLEIARNASGISVCQRKYALGLLEETGLLACKPSNIPMEPSIKLISDGDEPPME
;CC   DPASYRRLVGKMMYLTITRPDITYAVNRLCQFTSAPKESHMKAAHKVLHYVKGTVGTGLFYSADCDMTLQ
;CC   AYTDADWASCRDTRRSTSGFCMFLGTSLISWKSKKQQTASHSSAESEYRAMEFAVREVAWLVNLLREFQA
;CC   PQLKSVAFFCDSTAAIHIANNAVFHERTKHVELDCHILRDKVMSGLIKTLHLKTDQQVADVFTKPLFPTQ
;CC   FKALVGKMALQ
;XX
;DR   Positions 63260  58622  Accession No AL163975    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4639 BP; 1313 A; 859 C; 945 G; 1522 T; 0 other;
ATCOPIA62_I
tggtatcagagcatgaatggtgaatcattcatacgattcttgacctcgttttcttcttttctttttcatt
tcgatcagtttcttcagtctcgtcactgagaaactgttgtatcgaccaaaaatttcaacaaaaatttctt
cctttgccgtgaaattggagcttctctaatggcgcctggacgtaaaatctctactcgtcgcacgattcgc
gttcctgtttcagctcgtagatcggcaaatcgcgatgattcttcacctgaaggttcacctgaacctcgag
ctcgtactcgaaatccggtaactgaatctcatgatagtatacattcaccctattatcttacgaatagtga
taatcctggagcttctattacttctgaagtgtttgatggaacgaattatgatgattggaaaatttcgatc
aagattgctttagatgcgaagaataagcttgttttcattgatggatctgttcctcgacctcctgaatcag
atcctatgtttcgaatttggtcccgatgtaacagcttggttaagtcttggctcttgaattcggtgtcaaa
accaatatataagagtatccttcgtttcgatgatgcctcagagatatggaatgatctttcaactcgttat
cacattactaatcttccaagatcttatccgttaactcaacagatttggtcacttcaacaagggactatgg
atcttactacttactatacgacattgaggactctctggaatgaattggatggttctgattgtgtgacttt
gtgtaaacgttgtgattgttgcaaagctatggataagaaagctgaacatgctcgtgtgataaagtttttg
gctggcttgaatgaatcatatgccgtcattagaagccaaatcatcatgaagaaacatgtgccttccttag
ctgagatttacaatttgttggatcaagatcacagtcaacgcagcttcacaccggttccttctaatgcagc
tgcatttattgtatctgcgccagaacaagttcaaccttctgtgaatgccacattcaacaatgcgaaacca
cagaaagtcatatgttctcattgtggttacacaggacatactgttgatcgttgttacaagattcatggat
atccacttggttttaaacacaagaataagaaccaatctgataagagtgtttctttggaaaaatcagtttc
tacagttaaacctgttgttgctcatatggctttgacagatagtactacaaatgatcttattaatggtctg
actaaggttcttaccaaggatcaaattaatggagttgttgcatacttcaattctcaaatgcagaattcct
ctattgcttcctcgtctggtgctactattaccgcattacctggtattgctttctcctcctctactcttgg
ttttattggtgttttgaaagctactgttaatgttttatcctcggaaacttggataatagacagtggagca
actcatcatgtttgtcatgataagaatttgcttatgagattatctgaaactatgaatagttcagttacct
tacctactggttttggagttaagatcacatgtataggtacagtgaagctgaatgagttcctcgtcttgaa
taatgtgctttacattccggattttcgccttaatcttctgagtgtcagtcagctgactaaagatctggga
tatagagtgacatttgatgaggattattgccttatacaggatcatgtcaaggggctgatgattggtagag
gtgagcagatcaacaatctatacgtcctggatgttccgagaattaaggattttcctactaaggaaataag
ttttcatgcaaacattgttgttgattctagtctttggcatagtagactaggtcatccatctgtaactact
tctgatatagttactgatgtacttggatttaaacaaaggaatgaaagatcttttcattgcaccatttgtc
ctcttgcaaaacagaagcgtcttccctttgtttccaagaatcatgtttgcgactcagcttttgatttagt
tcatatcgacgtctggggtccattcaatgttcctactccagatggttttcgatattttctaaccattgtt
gatgatcatacacgggtcacttggttgtatcttatgaagaacaagaatgaagtgttgactatcttcccag
attttctgaaaatgatagagactcagtacaagagtcaggtgaaaggtgttagatcagacaatgcaccaga
attgaagtttgtgaagttgtttaaagaaaagggcatcattcattatttctcttgtccagaaacaccagaa
caaaactcggtggtggaaaggaaacatcaacacatattgaatgttgctcgttctcttatgtttcaagctc
aagtgcctgtggaatattggggagagtgtgtgttaactgcagtctttctcatcaatcgattgcctacacc
attgcttcatgacaaatctccttttgaagtgcttactaacaaaatgcctgattttaatggtcttcgtgtg
tttggctgtctttgttacagttctacatcaaccaaaaatcgagataagtttcaaccaagagctaaggcgt
gtgtgtttcttggttatccaccaggtgttaagggttatcgacttttggatttggaaaccaatatcatata
cgtctcacgcaatgttgtttttcatgaagacatttttccatttgctaaaagtggatctactgttcttcct
gattattttgctactgaaacatctaatgttgatgcatcttctactgaagcatctacttctgaagcacctg
cagttgtgaatgattctgtcactccatctaatatcaatcctgtagttgtgagtgaatctcctacggatac
taatgatattgttgacagtactattcctgcagtttcttctacggataagacaagtaaaggccgaacaagt
aagactcctgcttacctccaagactattattgtaatttgtctactaatggagtggagcacccaatttcaa
atttcttgaactatgatggtttagctgattcacaccgagcatatatttgttctataacaaaatatgcaga
gcctacttctttcactcaagccaggaaatctgatgattggttaaaggcaatgaatgatgaattgaaggct
ctagaaggaacagcaacttggaagatatgttctttaccacctgataaacatgccataggctgcagatggg
tttataaagtgaagttaaacgcagatggaagtttagagcgttacaaggcacggttagttgccaagggtta
cacacagcaggagggtgttgactttgttgacactttttccccaatggcaaagatgactattgtcaaaaca
ttgttggttgttgcagcagcaaagaaatggagtttgcatcagttggatatatcgaatgcctttttatatg
gcgaccttgaagaagaaatttatatgactcttcctccgggttacacaactaaagaaggcgagactcttcc
acctaatgcagtctgtaagttgcaaaaatctctctatggtttaaaacaagcttcaagacagtggtttctg
aagtttagtaccactttaatgctattaggattccaaagatcacaggctgatcacactttgtttgtgagaa
atgtgaatgggaaatatatagcagtacttgtgtatgttgatgatattatcatcgcaagtaatgatgatgc
agaggttgttgaactaaaagcagacttggaaagagcttttaaactgagagatttgggtactttgaagtat
tttttgggcttggagatagctcgtaatgcttcaggtatttcagtttgtcaacgtaagtatgcgttgggat
tacttgaagaaacaggtttattggcttgtaagccatctaatattcctatggaaccaagtataaagttgat
atcggatggagatgagcctccgatggaagatccagcttcttacagacgcttagtgggtaaaatgatgtat
cttaccatcactagacctgacattacatatgcagtgaatagactttgtcagtttacttcagctccaaaag
aatcacatatgaaggcagctcacaaggttttacactatgttaaagggactgttggaacaggtctcttcta
ttcagctgattgtgatatgacattacaggcatatactgatgcagattgggcttcatgtcgtgatacaaga
cgttccacttctggcttctgtatgtttctaggcacatctttgatctcatggaagtcaaagaagcagcaga
ctgcatctcattcttcagctgagtctgagtatcgagcaatggaatttgcagttcgtgaggttgcttggct
tgttaatcttctcagagagtttcaagcacctcagctaaagtccgttgctttcttctgtgattcaactgca
gcaatacatattgcaaataatgcagtatttcacgaaagaaccaaacatgtggaacttgattgccacatcc
ttagagacaaggttatgagtggtttgattaagactttgcaccttaaaactgatcaacaggttgcagatgt
ttttaccaagcccttatttccgactcaattcaaggctcttgttggcaagatggctctccaatgaatatac
ttgccatcttgagggaggc1