;ID   ATCOPIA48_I DNA   ; ATH   ; 4420 BP
;XX
;DE   Internal region of ATCOPIA48 copia-like LTR-retrotransposon.
;XX
;AC   AL161543
;XX
;DT   05-NOV-2001 (Rel. 6.2, Created)
;DT   05-NOV-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; internal region; 
;KW   copia-like polyprotein; reverse transcriptase; ATCOPIA48LTR; 
;KW   ATCOPIA48_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] (bases 1 to 4420)
;RA   Kapitonov,V.V. and Jurka,J.
;RT   Internal region of ATCOPIA48 copia-like LTR-retrotransposon.
;RL   Repbase Reports 1:(2) p. 2 (2001)
;XX
;CC   ATCOPIA48_I is an internal region of the ATCOPIA48 copia-like 
;CC   endogenous retrovirus flanked by the 98% identical ATCOPIA48LTR
;CC   long terminal repeats, and by the 5-bp target-site duplication 
;CC   (GAAGT). ATCOPIA48_I CDS, which encodes the ATCOPIA48p copia-like 
;CC   polyprotein, is interrupted by a non-LTR retrotransposon inserted 
;CC   recently in the genome. Presumably, the insertion has induced a
;CC   2629-bp duplication which flanks the insert. Both the non-LTR 
;CC   retrotransposon and a copy of the 2629-bp direct repeat were
;CC   removed from the ATCOPIA48_I sequence reported here.
;CC   ATCOPIA48p:
;CC   MERRSMELYTVPQLNISNCVTVTLTQQNYILWKSQFESFLSGQGLLGFVTGSISAPSPTIPVPDINGVTT
;CC   DRPNPEFDVWFKTDKVVKSWLLGSFAEDILSVVVNYVTAHEVWSTLANHFNRATSSRLFELQRRLQTLEK
;CC   KDKPMQVYLKELQTIYEQLASVGSPVPEKMKIFAALNGLGREYEPIKTSIEGSIDIPPTPKLDEIMPRLN
;CC   GYDDRLQAYAANSDVSPHLAFNTVQANSVFYTNRGRGQGNRRFGGSRGQGSFSTRGRGFHQXLSYNDSSS
;CC   NASAERPTCQICGKHGHHALNCWHRFDNSYQLDVLPQALPATQITDITDHSGSEWVTDSAATAHITNSPR
;CC   HLQQTKSYAGSDSVMVGNGNFLPITHTGSTSIGSTSGKLHLKDVLVCPLITKSLLSVSKVTKDYPCIFEF
;CC   DCDEVRVRDKETKKLLLQGSNRDGLYVLDEPKLLVFYSSRQVAASDEVWHRRLGHPNPHVLQQLSSTKSI
;CC   LINKHSKAICEACQSGKSSRLSFSASSFVASRPLERIHCDLWGPSPVMSVQGFRYYVIFIDNYSRYCWFY
;CC   PLKLKSDFYTIFAKFQALVQNQLQSKISIFQCHGGGEFTSKVFLNHLQEHGIQQYISCPYTPQQNGLAER
;CC   KHRHITDFGLSMLFQGKVPQKHWVEAFYTTNFLSNLLPHTALTDAKSPFELLNKKKPDYQALRIFGCACF
;CC   PTLRDYAQHKFDPKSLKCVFLGYNEKYKGYRCLLPTTGRVYISRHVIFDEHSFPFSDTYMHLQPTGVTPL
;CC   LSAWQQSFMPQTTASSTASATATSPFNAAEIQVSPVITSNNNTGASVLENGSSQLPIQNSSVLSTVASEE
;CC   SSECTESINLLPIGNSSSSLANRTDNADTSPLQEAATETNSSTVQEAAESTTSSTMQEPASNQSTHPMIT
;CC   RSKKGITKPNPRYGLLTHKVKYAEPKTVTEALKHPGWTAAMHEEYDNCTEAQTWSLVPYTSDMNVLGSKW
;CC   VFRTKLNADGSLDKLKARLVAKGFDQEEGIDYLETYSHVVRSATVRMVLHVATVMDWEVKQMDVKNAFLH
;CC   GDLTETVYMLQPAGFVNKEKPTHVCHLHKALYGLKQAPRAWFDKFSNYLLEFGFNCSIKDPSLFIYLKGN
;CC   DLILLLLYVDDMVLTGSNSATMIKLLEDLNTQFRMKDLGQMHYFLGTQAXFHENXXCLYTLSGLFLSQQK
;CC   YAEDLLTIAAMDECSPMPTPLPLQLHKVPHQEELFANPTYFRSLAGKLQYLTLTRPDLQFSVNFVCQKMH
;CC   QPTVSDYNLLKRILRYVKGTLSMGIHFSKHSDFQLRVYTEKDPAFSLRAYSDSDWGGCKDTRRSTGGYCT
;CC   FLGTNLISWSSKKQPTVSRSSTEAEYRSLSETAQEMTWICHLLRELGIPLPVTPELYGDNLSSVYLTANP
;CC   AFHARSKHFEFDYHYVRERVALGSLVVKHIPAHQQIVDIFTKSLPYEAFCNLRFKLGVDLPPTPRLRG
;XX
;DR   Positions  16028  11546  Accession No AL161543    GenBank (rel. 124.0)
;XX
;SQ   Sequence 4420 BP; 1228 A; 991 C; 870 G; 1331 T; 0 other;
ATCOPIA48_I
tggtatcagagctcatggaaagaagatccatggagctctatactgttcctcaactcaacatttcaaattg
cgttacagtcactcttacgcagcagaactatattctgtggaagagtcagttcgaatctttcctttctggt
caaggcttgcttgggtttgtcactggatctatttctgcaccgtcaccaaccattcctgttccagatatca
atggtgtcaccacagacagaccaaatccagagtttgatgtttggttcaagacagacaaggttgtcaagtc
ttggcttctagggtcctttgctgaagatatcctgagtgttgttgtgaactacgtcactgctcatgaggta
tggtctactcttgcaaatcacttcaatagagctacttcatctaggctatttgagcttcaaaggcgtttac
aaactctagaaaagaaagataaacctatgcaagtctatcttaaggagttacaaaccatctatgaacagtt
agcttctgtagggagtccagttcctgagaagatgaaaatctttgctgctcttaatggtctaggtagggaa
tatgagcctatcaaaacaagtattgaaggttccattgatattcctcccactcctaagcttgatgaaatca
tgcctagactcaacggttatgatgatagacttcaggcttatgcagcaaactctgatgttagtcctcatct
agctttcaatacagttcaagctaactctgtcttctacaccaaccgcggcagaggtcaagggaaccgtcgg
tttggtgggtcacgaggccaaggttccttctccactcggggtcgtggcttccatcagtagctctcataca
atgattcttcttccaacgcctctgcagaacgtccaacatgtcagatttgtgggaaacatggtcatcatgc
tctcaactgctggcacagatttgataatagttatcagcttgatgtgttaccacaggctctcccagcaacg
cagatcacagacattactgatcactctggcagcgaatgggtcacagacagtgctgctactgctcacatca
ccaactcgccacgtcatctgcaacagacaaagtcctatgctggttctgattctgtaatggttggcaatgg
gaattttctacctatcactcataccggttctacaagtattggttctacttcaggtaagcttcatcttaaa
gatgtattggtttgtcctctaattactaaatctttgttgtctgtgtcaaaagtcacaaaggattatccct
gcatttttgagtttgattgtgatgaagttcgtgtgcgtgataaggaaaccaagaagcttcttcttcaggg
aagtaatcgagatggactctatgtgctggatgaaccaaagcttctggtgttctactcttctcgccaagtt
gcagcgtctgatgaagtttggcacagacggttaggacacccaaatccccatgttctccagcagctatcct
caacaaagtccattcttattaataaacacagcaaggctatttgtgaagcatgtcagtctggtaaaagctc
aagactgtcattctctgcatcatcttttgttgctagtagacccttagagagaattcattgtgatctctgg
ggtccttctcctgttatgtcagttcaaggattcagatactatgttatatttattgacaattactctagat
attgctggttctatcctctcaagttaaagtctgatttctacacaatctttgcaaagtttcaagctttggt
tcagaatcagctacaaagcaaaatctcaatttttcaatgtcatggagggggagaattcaccagtaaggtc
tttctcaatcatcttcaagaacatgggattcaacaatacatctcctgtccttacactccccaacagaatg
gtcttgctgaaagaaaacacagacacattactgattttggtttatctatgctttttcaaggcaaagtccc
tcaaaaacattgggtagaagccttttatactacaaactttctcagtaatctgcttcctcatactgctctt
actgatgctaaaagtccttttgagctgttaaacaagaagaaaccggattatcaggccttgagaatctttg
gatgtgcttgttttccgacactccgagattatgcacaacacaagtttgatcctaagtccttaaaatgtgt
gttcttgggctacaatgaaaaatataagggctaccggtgtcttcttcctaccacaggcagagtctacata
agtcgtcatgtcatttttgatgaacactccttccctttctcagatacttatatgcatctgcaacccactg
gtgttacacctttgctctctgcctggcaacaaagttttatgcctcagacaactgcttcttctacagcctc
tgcaactgcaacctctccattcaatgctgcagagattcaggtttctcctgtgattaccagtaacaacaac
acaggcgcttcagtgttagaaaatggttcatctcagctgcctatacagaattcatcagtcttgtctactg
tggctagtgaagagagttctgagtgtacggagagcatcaatctcttacctattggcaatagctcttcttc
acttgctaacaggacggataatgctgatacttctcctcttcaagaagctgcaacagaaacaaacagctct
actgtgcaagaagctgcagaatcaacaactagctctacaatgcaagaacctgcttcaaatcagtctactc
atccaatgataacacggtctaagaaaggtattacaaagcctaatccacggtatggacttcttacacacaa
agttaaatatgcagaaccaaaaacggttacagaagctcttaaacacccgggatggactgctgcaatgcat
gaagagtatgataattgtacagaagcacaaacgtggagtctagttccgtatacttctgatatgaatgtcc
ttggaagtaaatgggtgtttcgaaccaagttaaatgcagatgggtctttggacaagttaaaagctcgctt
ggttgcaaaaggatttgatcaagaagagggaattgattacttagaaacatacagccatgtggtaaggtct
gcaacagtcagaatggtgcttcatgttgcaacagtcatggactgggaagtgaaacaaatggatgtgaaga
atgcatttcttcatggggacctcactgaaactgtttacatgcttcaaccagctggctttgtgaataagga
aaaacctactcatgtctgccatctacataaagctctttatggtttaaaacaggctcctcgggcttggttt
gacaaatttagcaattacttgcttgaatttggtttcaactgcagtattaaagatccatctctgttcattt
atttaaaagggaatgatcttatactcttacttctttatgttgatgacatggttttaacaggtagtaactc
tgcaactatgatcaagctgcttgaggatttgaatacacaatttcgtatgaaagatcttgggcaaatgcat
tattttcttgggacacaagcttaatttcacgagaattgatgatgtttatacactctttcaggcctatttc
tatctcagcagaagtatgctgaagaccttctcaccattgcagcaatggacgaatgctccccaatgccaac
tccactgccacttcagcttcacaaagttcctcatcaagaagaactctttgctaatccaacttatttccgg
agtcttgcagggaaacttcagtacttgacattgacaaggccggatcttcagttttctgtaaactttgtgt
gccaaaagatgcatcaaccaacagtttcagattacaatcttctcaagcggattcttcggtatgtcaaagg
aactctatcaatgggaatccacttctccaagcactctgatttccagctccgggtttacactgaaaaagac
cctgcttttagtcttcgtgcttacagtgatagtgactggggtggttgtaaagatactcgtcgttccacag
gaggctattgcacatttcttggcactaacctcatctcctggtcgtctaagaagcagccgactgtttctcg
aagctcgactgaagcagaatatcgatcactctcagaaacagctcaggaaatgacttggatctgtcattta
cttcgagagcttggcatacctcttcctgtcacacccgagctctatggagacaacttatcttctgtgtacc
ttactgccaatcctgccttccacgctcgtagcaaacactttgaattcgactatcattatgtccgtgaaag
agtcgccttgggatctttggtcgtgaaacacattcctgctcatcaacaaattgttgacatcttcacgaag
tctttgccttatgaagctttctgtaatctcaggttcaaacttggtgtggatttaccacccacaccgcgtt
tgagggggag1