;ID   ATCOPIA41I  DNA   ; ATH   ; 5262 BP
;XX
;DE   Internal region of ATCOPIA41 copia-like LTR-retrotransposon.
;XX
;AC   AC007261
;XX
;DT   31-AUG-2000 (Rel. 5.8, Created)
;DT   31-AUG-2000 (Rel. 5.8, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; endogenous retrovirus; COPIA superfamily; 
;KW   internal region; pol; env; ATCOPIA41LTR; ATCOPIA41I.
;XX
;OS   thale cress
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 5262)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (August 2000)
;XX
;RN   [2]
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Molecular paleontology of transposable elements from
;RT   Arabidopsis thaliana.
;RL   Genetica 107 (1-3), 27-37 (1999)
;XX
;CC   ATCOPIA41I is an internal region of the ATCOPIA41 copia-like
;CC   endogenous retrovirus flanked by 4% divergent ATCOPIA41LTR 
;CC   LTRs; 5-bp target-site duplication. 
;CC   It encodes env-like protein (position 4045-4899):
;CC   MFGAQSSDSLPEISRPTATECQEVKSDEALTEPNPDWALVLVKDPSPPLIIEVEDETLPSIDSVQNPKPK
;CC   DPEATPSQPSVASRLRKRKSSAADPRIKRMKQGKGVTGSSSFDGIRFVSTEAEKRYAQFSQRNFIEEVEL
;CC   SRKTNEAAREFIKQAGLIRTVTKFNPFTQNLVFEFWANLPTMKVDTYMVKVLVRNREYELSPGKINEMYG
;CC   LPSVDARQQRMDIAGLVDEQVAEFLTGGKVSVLSKLQVSAFTPTSLELFKLCCSNSSPTSNAGYAQPDRL
;CC   YITFL
;CC   Gag and RT portions of the polyprotein are damaged by multiple
;CC   mutations. 
;XX
;DR   Positions   5435   174  Accession No AC007261    GenBank (rel. 116.0)
;XX
;SQ   Sequence 5262 BP; 1597 A; 967 C; 1258 G; 1440 T; 0 other;
ATCOPIA41I
attggtatcagagctgacacctaacacaagtggtatctgttcaacaggtgagatcttgctacagggtaat
ggagaaatcacaacagtacgttacaatacaacagccattgaagttggatgctgagcattatggctactgg
aaggtgtctatccgataggctatccaaagtgctaaaatggatgcgtggttcgcagtggaagaaggatgga
aagctccgatggtcatggatgccaaggggttagaaataccaaaatctaagaaggattggactactgaaga
gaagacagctgctaagaacaactcaacggccttgtcaatcatcttcagatcattgccaatgagtcaattc
acgcatgtgcaagggagcacatcagcaaaggaagcatgtgacattctagaaacaacctttgagggaacca
gtaatgtgaagaggaccagattggatattttagcgtctgaatttgaaaacttgacaatggagaatgaaga
gtcgattgaggaattcagcagtcggctgagttcaatttcctagtaaactgtcgtcttgggaaaaacgtac
aaggacaagaagctggtcagcagtcggaaccaaggatttggataaacttttaaccatgggcagaacgtct
aatgttacgtggggtcttggatacaatggaggaaatacaaaaggtgagactcagtttgtcaaaggatcca
cttcagacgataaatctcaggtcaaacccacaaccactgttcggaatctgtttcgctctaagcctaaacg
agcatctcaggcttacaacttgagacaaacttacttcaaacctgagttcagaagcaagagaacaggatgc
tggtactgtggaggtctgagtcactacaaagcagactgctacaactacttgaaacatgttgttcagacta
ggaggaattatcaagtgggtgctcaaggacgacgagtcagacaagtgtatgtcaagaaagaacatctata
gtgtcatgttactcaaacatcagcgaatgcagaactcaaggatctgatgtggtgctttgatagtggatgc
tccagacacatgactggaacactagacaatcttgctggatataaggatgtgccatctaggaaagtcagat
ttggagatggaggtcatgccttgatcaaaggaaagggatatacttccggacaagccttgcctcatctaac
cgatgtatatcatgttgatggacttaaggcaaatctgatcagtatcagtcaactgtgtgacgatggtttg
agtgttttctttacacaaacggaatgcaaagcctttgacaaacatggatttgtaaagatggaaggacacc
gtgctgcaaataactgttacatgtggaatccagaagagtcatgctacactgcatcacagatcgatcaatc
tacttcagaagatcaatcgctagtgtctctgctagaatctgatcatatgggtgaagccatgagttcggtt
caggagtttaccatatacacaatgagctggcaggctccattcaagttcacaatcaacaaggtatgcaatt
gtttagcttggtatctgaccagtctgtatggcttacttgtctcaggtggttggtgggaaggtcagctgat
ctcaggaaacaagcaatctcagttccagttcctcaaacacacggttcacgtcatctcacaaccaatacag
tgtggaatctcattaacactagtgtcatgaatgagattcaagggggaagagtgatcttggattcttcgtg
tgttaacaattaggaagatgcaatggagtgggaactgaacttacttaagtctgtcaagctattgggatct
gatatctgcaatcagatcattgatgaagccggtgttgcaatccgtcttagacatctgctgttggctcagg
gctatattcggatggaagacgagtcaaagattgatacacatcttgcttgatgtgaatctatcaggtttct
gctaggatgtgtgtgtgcaacacagttctatgtttatcagatggtactccagaaagggatcttgagtagt
gaaccccagagggaatggtttgagagtcacccaagagcattcggtgttccgggacatccatactcggctc
atagattgaaaacagttgtctactgcttaaagcagcttccggagaaaatgtgttgtatggctcgacaaat
caaacacttgtacaagcttttgttgaacttattaccagagagtttgaagtaagtatgtacggagttctga
ggtacatgcagggcatacagattaagcaaactgatgagggcatatctatgtcgcagaatatatacgcaag
aacaatgattgagaagttcaaactggatgcacaagagattgtcacaacaccgatgaagatctcgacaaga
cttactgcagatgagagaggaggagatgtaaatgttagtatgtatcaggggatgatcgagagtcttcagt
acttgactgtgagtagaccagatatctgtcatgcagtgaatgtgtgtgctcagtatcaagtcaaccccaa
gatgtcgcacttgtcagctgtaagaagaattctcaagtatgtgaatggtataccaactttcgagctgtac
tacaccaaggacactgacaatagactgaagggatattgtgctgcggattgggcttgatctttggatgaca
tacggggatcacttagaggatgctattttgttggaaacaacatggtgtcttggaagagcatgaagcagaa
cagtcagcttctatcaactgctgaagcagagctaagttctttggagagttcgagttctcaacttgtacga
ctgaaacagctcatagaggagtgtggtatgatctctgattttccagtactgtattgcaataatcatagtg
ctatacattgtttccaagaacctgctaggaagtctcgtacaatgcacatagatcataaacataactatat
tcatgaattggttaaggagaaactaatagcaattgaacatgtgggtactaaagctcaactagctgatgtg
tttatcaaacctctgatgtctaacaaattatgtactctacgaatgttcattggaatgtttgaactataat
tgtgtttgagttgtgctgatctagaatagggacctgaagcaagaacaagtcatggaaatccaaagatgca
tagaggatcagcacttagcaaagtcatgtggaaaatagttgcatgtcatgccggttcaaatcagacgtaa
aaagttgtcgtctaacaggaggttaaagagctgagaagatcaacaaactgaatgaacttggtacacaaaa
aaaaaaaaaaatgtgtatcaagttcaccaggttcgaatatcaagcaattgagaagtcaagtgatattgct
gatataacaaaataagggagtaattcaaaagaattcatatcatcacgcccctgtttagtcaaactatctt
caaaggacattccagctgcctgttacacgttgatctgattcatgagagctaaaggcatgacagtgtgaaa
gactagccactgaataatgagctgattcatcaacgatctgattggttatgacagtttcatgcggaagtcg
tgaacacctgattggcacaaagcacattttcaagtactaggaagaggtaaactttgaaatcagtgttgga
aagcttttaccctcatgtgtcaattagctgtttaggtcactaaactagttcaggctataaaatgttcaat
gttgctctgaatgtgtttgaaatttaatcacgtgtttgattgaaaattctcttggcaaagaagcacttat
gtgtttgttaagtgtttcagaatattaaaacagtttgggcccaagcctgtttcaagcccacagaatttca
aggaggagtgctcgagtattggagggaaaaagaggagttagggtttgaggagtcaaatcaatttcttatt
tggaagtcgagaaggcaatcccctgtgttcaacgcaatttcgagtgagatcgagcatgaaatcacttctc
tccggtctgcaacgcctggttcaatagaaagccgagtccgcatcttctgcagacatgttcggtgcacaat
ctagtgattcgttgccagagatttcacggccgactgcaacagaatgtcaagaggtgaaatccgatgaagc
actcactgaacccaaccctgattgggctctagttctcgtcaaagacccatcgcctccactcatcatcgag
gttgaggatgagactctgccttcaatcgacagtgttcagaaccctaaaccgaaggatcctgaggctaccc
cctcgcaaccctctgttgcatctcgattgcgcaagaggaagtcatctgctgcggatccacgcatcaaaag
gatgaagcagggaaagggagttaccggttcatcttccttcgatggaattcgatttgtctctacggaggct
gaaaagaggtatgctcaattctctcaacgaaatttcattgaggaagtagaactgtctaggaagacaaatg
aggcagctagggagtttataaagcaggcaggactgattcgaacggttactaagttcaacccgttcactca
gaatctagtgtttgagttctgggctaatctgcccactatgaaggtagacacgtatatggtcaaagtcttg
gtgcgcaatcgggagtatgagctctcacctgggaagatcaacgagatgtatggtctcccttctgttgatg
ctagacagcagcggatggatatcgctggtctggttgatgaacaagtggctgaatttctcactggtgggaa
agtcagtgttctgagcaagcttcaggtgagtgcctttacacctacgagtttggaactgtttaaactctgc
tgctcaaattcgtctcccacatccaacgctgggtatgctcagcctgatcgattgtatattacatttcttt
gattgagtatttgatttttgactaagttctgtgctcaagcagggggagataaggccccttcggactccgt
tgatgctgggggagattaagttgtgggggagctgtgttgtttaagtttttaattgttataagcttcattg
tatttcagaccaaaacagtttcttttgtgtttttgtttaaaccgatgattatatcatcgtaaacatgggt
ctgtaatatttttaacacaatgtttactttcagactaatcatttatgctagatgatgtttctagagtatg
tgtgttatgtctccgatttgagtttcaggttatatcaataaaggagtgcttgagtatcaaactcagtcaa
aaagggggagat1