;ID   ATCOPIA8AI  DNA   ; ATH   ; 4487 BP
;XX
;DE   Internal region of ATCOPIA8A LTR-retrotransposon.
;XX
;AC   AC005171
;XX
;DT   12-APR-1999 (Rel. 3.2, Created)
;DT   12-APR-1999 (Rel. 3.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   ATCOPIA8ALTR; ATCOPIA8AI.
;XX
;OS   thale cress
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Charophyta/Embryophyta group;
;OC   Embryophyta; Tracheophyta; euphyllophytes; Spermatophyta;
;OC   Magnoliophyta; eudicotyledons; Rosidae; Capparales; Brassicaceae;
;OC   Arabidopsis.
;XX
;RN   [1] 
;RA   Rounsley,S.D., Lin,X., Kaul,S., Shea,T.P., Fujii,C.Y., Mason,T.M.,
;RA   Shen,M., Ronning,C.M., Fraser,C.M., Somerville,C.R. and Venter,J.C.
;RT   Arabidopsis thaliana chromosome II BAC T4E14 genomic sequence
;RL   Unpublished
;XX
;RN   [2] 
;RA   Rounsley,S.D. and Lin,X.
;RT   Direct Submission
;RL   Submitted (23-JUN-1998) The Institute for Genomic Research, 9712
;RL   Medical Center Dr, Rockville, MD 20850, USA, rounsley@tigr.org
;XX
;RN   [3]  (bases 1 to 4487)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (March 1999)
;XX
;RN   [4]
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Molecular paleontology of transposable elements from
;RT   Arabidopsis thaliana.
;RL   Genetica 107 (1-3), 27-37 (1999)
;XX
;CC   ATCOPIA8A is a copia-like retrovirus; its ORF encoding copia-like 
;CC   polyprotein is not interrupted by any stop-codons; its LTRs are
;CC   identical. We presume that ATCOPIA8 may be an active retroelement [3].   
;CC   ATCOPIA8A is 80% identical on the DNA level with ATCOPIA8B element.
;CC   Since ATCOPIA8B is flanked by 99% identical LTRs and has preserved
;CC   almost perfect ORF, we may say that both divergent forms, ATCOPIA8A
;CC   and ATCOPIA8B have infected A.thaliana approximately at the same
;CC   time [3]. Originally reported [1-2] LTR, internal region and 6 bp- 
;CC   long target site are not correct [3]. ATCOPIA8, as a copia-like
;CC   retroelement, has 5 bp-long target site [3]. 
;CC   343 bp-long 3' tail of ATCOPIA8AI is a perfect copy of the sequence
;CC   followed by the ATCOPIA8A provirus. Based on the known LTRs from
;CC   other ATCOPIAs, we consider this sequence as a part of the internal
;CC   region [3].
;XX
;DR   Positions   64373   59887  Accession No AC005171    GenBank (rel. 109.0)
;XX
;SQ   Sequence 4487 BP; 1239 A; 1046 C; 876 G; 1326 T; 0 other;
ATCOPIA8AI
tggtatcagagctcatggaacatagatccatggaactctactctgttccttctctcaatatttcaaattg
tgtcaccgtcactcttactgccaagaactatattctctggaaatctcagttcgaatcttttcttgatggt
caagggcttctgggttttgtcacaggctcgattcctgcaccaagccaaaccagcgttgtttcagacattg
atgggtcaacatcggcttcacccaatcctgagtactacacctggttcaagacagatcgagttgtcaaatc
ctggctccttggttctttcttagaagatattctcagtgttgtggtcaactgcaacacttctcatgaggta
tggatctctgtagcaaaccactttaatagggtttcctcctctaggttatttgagctgcaaagacgcctcc
aaaatgttagcaaacgtgataaatctatggatgaatatcttaaggaccttaagactatttgtgaccaact
agcctctgtaggaagtcctgtaacagaaaagatgaagatttttgctgctttgaatgggttagggcgagag
tatgagcctatcaaaaccactattgaaaactccatggatgctctgcctggtcctagtctagaggatgtta
ttcctaagcttacaggctatgacgatcgacttcagggttatcttgaagagacagcagtttcacctcatgt
cgccttcaacatcaccacttcagatgactctaatgcttctggttacttcaatgcttacaatcgtggcaaa
gggaaatctaacagaggcagaaactcatttagtactcgtggtcgtggtttccatcagcagatatcttcaa
caaacagttcttcaggttctcagtctggtggtacttcggttgtctgtcagatttgcgggaaaatgggtca
tccagctcttaagtgttggcatcgcttcaacaacagctaccaatacgaagagcttcctcgtgccttagct
gcaatgcgcatcactgatattacagatcaacatggcaatgaatggcttccagattctgctgcaactgctc
atgtcacaaacagtcctcgatccctgcaacaatctcaaccataccatggatctgatgctgtcatggttgc
tgatggtaattttctgcccattacacacactggctcaacaaacttggcttcctcatcaggtaatgtccct
ctcactgatgttcttgtttgcccaagcataacgaaatctctcttgtctgtgtctaagcttactcaagatt
atccatgtactgttgagtttgactctgatggtgtgcgtatcaatgataaggcaaccaagaagcttctcat
aatgggaagcacttgtgatggtttatattgtctgaaggatgactctcagttcaaggctttcttctccact
cgtcagcagtccgcaagtgatgaagtgtggcacagacgccttggacatcctcatcctcaagtcctgcagc
aactggtcaagaccaactctatctctatcaataagacttctaagtcactctgtgaagcatgtcagcttgg
gaagagcactaggctgccatttgtttcttcttcatttacttcaaatagacctcttgagagggttcactgt
gacttgtggggaccctctccaattacttctgtacaaggctttagatattatgcagtctttattgatcatt
attctcgattcagttggatttatcctttgaagctcaagtcagatttctacaatatctttgttgcatttca
caagctagttgaaaaccaacttaatcataagattagtgtgtttcaatgtgatggtggtggagagtttgtc
aatcataagtttctgcaacatcttcagaatcatgggatacaacaacacatctcatatcctcacactcctc
aacaaaatggactagcagaaaggaagcacaggcatttggtggaattaggcttatctatgctgtttcagag
taaagtccctcttaagttttgggttgaagccttcttcactgccaactttctgattaatcttctccctaca
tctgctgttgaggatgctatttcaccttatgaaaagctgcatcagacgactccggactatacagctctca
gatctttcggttgtgcttgttttccaactatgcgtgactatgctatgaacaaatttgaccctcgctctct
taagtgcgttttcctggggtacaatgacaaatacaaggggtataggtgtttatatccacctacaggacgg
gtttatataagcaggcatgtgatttttgatgaaacagcttatcctttctctcatcactataaacaccttc
attcgcagcctacaactccattacttgcagcatggttcaaggggtttgaatcctccgtgtctcaggcacc
accaaaagtgtctccagcacaaccaccacagagaaaggcaacactacccacgcctcctctttttactgct
gctgattttcctcctttaccacggagaagccctcagttgtctcagaattctgctgctgcacttgtgtctc
aaccttcaacaacaacaatcaattcaactcatccacctgctgtggtgaatgagagttctgagcgtacgat
aaacttcgattctgcttctattggcgacagctctcactcatcccagcttttggtggatgacactgtagaa
gatctcatggcagctccagttcctactcaacaagctccacctcctactaacactcacccaatgatcacaa
gagctaaggtgggaatcacaaaaccaaatcctcgttatgtttttctgtctcacaaggttacttatcctga
gccaaagacagtaactgcggctttaaagcatccgggttggacaggcgccatgacagaagaaatgggcaac
tgttctgaaactaatacatggtccctggtgccatacacacctaacatgcatgttcttggaagcaaatggg
tcttccggactaaactccacgctgatggaaccttaaataagctcaaggctcggatagttgcaaaatgttt
tcttcaagaagaagggattggctatcttgagacctacagtcctgtagtaagaacacctacagtccaattg
gttctccatttggctactgctttgaactgggagttgaagcaaatggatgttaaaaacgccttcttacatg
gcgatctaaatgagactgtttacatgactcagcctgctggttttgttgataagagtaaaccaactcatgt
ttgcttgcttcacaaatctatctatggtttaaaacagtccccaagagcttggtttgacaagttcagcacc
tttctcttggagtttggatttttctgcagtaaatctgacccttcactattcatctatgctcataataata
acctcattctgcttcttctttatgttgatgacatggtgattacaggaaatagttcacagacattatccag
tcttctagcagctctcaacaaagaattcagaatgactgatatgggacaactccactacttcctgggaatt
caggttcagcgaaatcaacacggcctgtttatgtctcagcagaagtatgctgaagatctcttggtggctt
ctgcaatggagaactgcactcctctgccaactcctctacccgttcagcttgacagagttccacaccaaga
agaacctttcactgatccaacttatttcaggagtattgctggaaagctccaatatctcaccttgactcgc
cctgacatacattttgctgtaaacttcgtgtgccaaaagatgcaccagccaacaatgtcagactttcatc
ttctgaagcggattctaaggtacataaaaggtaccatcactatgggaatctcttacaatcaaaattctcc
tactcttttgcaagcttacagtgacagtgactggggcaactgtaagctcacaagacgctctgttggtggc
ctctgcacctttatggccacgaacctggtgtcatggtcgtcaaagaaacatccaactgtctcccgaagct
ccacagaagctgaataccgcaccttatctgatgctgcctctgagatcctctggctgagcactcttctccg
tgagcttggcattcctctcccagatactcctgaattgttttgtgacaacttgtctgcagtctaccataca
gctaatccagcgtttcatgcaatgattgataaagcaacaaaaagtaccaactttcgagtgctatatccaa
atggtgacagtattttgacgtggcatgaaaatgcataacaagacagcacagaaaaattaatatttatact
cagagacaaagaagatgatgcaaggttcatgagtttacccggagtaaccggaacatcagagaatatgaga
tggatctttggaggagaatcgaaattaaatctacgaaatgaaaagagtcaaaaatgtcagaattttgtga
atatataatatcagaaaggaacatttttcatttctttacagtaaagaagatgaatcatgaaacaagactt
tgggaat1