;ID   ATCOPIA44_I DNA   ; ATH   ; 4190 BP
;XX
;DE   Internal portion of the ATCOPIA44 copia-like LTR-retrotransposon.
;XX
;AC   AL133315
;XX
;DT   02-SEP-2001 (Rel. 6.2, Created)
;DT   02-SEP-2001 (Rel. 6.2, Last updated, Version 1)
;XX
;KW   LTR-retrotransposon; COPIA superfamily; pol; reverse transcriptase; 
;KW   ATCOPIA44LTR; ATCOPIA44_I.
;XX
;OS   Arabidopsis thaliana
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
;OC   euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
;OC   Rosidae; Capparales; Brassicaceae; Arabidopsis.
;XX
;RN   [1] 
;RA   Choisne,N., Robert,C., Brottier,P., Wincker,P., Cattolico,L.,
;RA   Artiguenave,F., Saurin,W., Weissenbach,J., Mewes,H.W., Lemcke,K.,
;RA   Mayer,K.F.X., Quetier,F. and Salanoubat,M.
;RL   Direct submission in GenBank (December 1999)
;XX
;RN   [2] (bases 1 to 4190)
;RA   Kapitonov,V.V. and Jurka,J.
;RL   Direct submission (August 2001)
;XX
;CC   Two copies of ATCOPIA44LTR flank the ATCOPIA44_I internal sequence,
;CC   they are 2% divergent from each other. 
;CC   ATCOPIA44_I encodes the 1349-aa ATCOPIA44p copia-like polyprotein
;CC   (it is disrupted by one stop codon emerged after integration of
;CC   ATCOPIA44 in the genome).
;CC   ATCOPIA44p:
;CC   MAMSSKVEIKTFNGDRDFSLWKIRIEAQLGVLGLMNTLTDYSLTKFVPVPKSEGKKPETDEESSPTEEVP
;CC   DLIKIEQSKQAKNIIINHITDAVLLKVQHCVSAADMWATLNKLYMETSLPNRIYTQLRLYSFKMLETMSI
;CC   DQNIDQFLRIVAELGSLQIVVAEEVQAILILNSLPVSYIQLKHTLKYGNKTLCVQDVVSSAKSLEHELAE
;CC   SKESERGSSTVLYTTERGRPQNRSQQQGNKGKGRSRSNSKTKVTCWFCKKEGHVKKDCFAKKRKLESEGP
;CC   GEAGVIIEKLEVSEALNIGDRLVKDMWVLDSGCTSHMSSRRDWFSDFEENDGTTILLGDDHTVKSQGQGS
;CC   IRIKANGGSIRILKNVKYVPNLRRNLISTGTLDKLGYHHEGGDGKVRYHKNNATALVGRLINGLYVLDGE
;CC   TIMSESFNAEDTKSSTELWHSRLGHMSLNNMKILAGKGLLQKNDVKELEFCEHCVMGKSKKLSFNVSKHI
;CC   TEEALGYVHADLWGSPNVTPSLLGMKYFLSIVDGKTRKVWLMFLKSKDETFDRFCEWKELVETQV*KKVK
;CC   VLRTDNGLEFCNSKFEDYLKKFGIERHRTCAYTPQQNGVAERMNRTLMEKVRCLLSESGLEEIFWAEAAS
;CC   TVAYLVNRSPAFAVDHNVPEELWLNRKSGYKHLRRFGSVAYVHQDQGKLKPRALKGVFLGYPQGTKGYKI
;CC   WLLEEMKCVISRNVIFHEDLVYKDLQLKEKSEQEERAEKITQAEKTVSEIVSNQQQVGESSVAGGTVDVS
;CC   SSDDESEYFEPEGEAPASSERLSNYQLARDRVRRQIRAPIRFSDYSQFAYALMAAEDMDSSEEPSCYHEA
;CC   KETKEWEKWNAGMGDEMQSLLKNYTWDIVDHPKNQKIISCRWLYKKKPGIPGVEPERYKARLVARGFTQR
;CC   KGIDYDEVFAPVVKHVSIRILMSIVVQEDLELEQMDVKTAFLHGDLDQPLYMEQPEGYVADEQKDQVCLL
;CC   KKSLYGLKQAPRQWNKKFNSFIMDQNFIRSGHDSCVYIKQVSDEEFVYILIYVDDMLIAAKSMTEINKIK
;CC   EALSTGFEMKDMGAASRILGIDIIRDRKAGTLRLSQTGYLEKVLHMFNMTEARPVSTPMGAHFKLASVVE
;CC   EEECVDTDKVPNSSAIGSIMYAMVGTRPDIAQAIGVLSRFMSKPGKIHWTAVKWLLRYLKGSTDLNLVFT
;CC   REKDFRVQGFSDSDYAADLDRRRSTTGYVFTVGGNTVSWKSNLQSIVALSTTEAEYVALTEAVKEALWIQ
;CC   GLLTEMGFKQEKVTLWCDSQSAISLAKNNTFHERTKHIAIKFNFIRDVIEEGSVEVLKIHTSQNPADMLT
;CC   KGIHVQKFESALEFLKLLR
;XX
;DR   Positions   19000   14811  Accession No AL133315  GenBank (rel. 124.0)
;XX
;SQ   Sequence 4190 BP; 1392 A; 668 C; 1025 G; 1105 T; 0 other;
ATCOPIA44_I
aagtggtatcagagccttggtttctgaattgaggaggtttgatgcttcgatagcgttaatggcgatgtcg
tcaaaggtagagatcaagacatttaatggagatagagacttttctctatggaagattcggattgaagcac
aacttggagttctgggtttgatgaacactttaacggattactctttgacaaagtttgttccagtcccaaa
gagtgaaggaaagaaacctgaaaccgatgaagaatcatctccgactgaagaagttccagatctgatcaag
attgaacaatcgaaacaagctaagaacatcataatcaatcacattactgatgcggttcttcttaaagttc
agcattgtgtatctgcagctgatatgtgggcaacgctaaacaagctctacatggaaacatctctgcctaa
caggatctatactcaacttagactttactcattcaagatgcttgaaacaatgagtattgatcagaacatt
gatcaattcttaagaattgtggctgaacttggcagtctgcagattgtagttgctgaagaagtgcaagcaa
tcttgatcttgaattcattgcctgtgagttatatccagttgaagcacactttgaagtatggtaacaagac
tctctgtgtgcaggacgttgtatcatcagctaagtcattggaacatgaacttgctgaatctaaagagtct
gaaagaggctcttcaactgtgttgtatacaactgaaagaggtagaccccaaaacaggtctcagcagcaag
gaaacaaagggaaaggcaggagcagatctaattccaaaacaaaggtcacctgctggttctgtaaaaagga
aggtcatgtgaagaaagattgttttgctaaaaaaagaaaactggaaagtgaaggtccaggagaggctggt
gttatcattgagaaactagaagtttctgaagccttaaacattggtgacagattggtcaaggacatgtggg
tactagactctggatgcacatcacatatgtcatcaagaagagactggtttagtgattttgaggagaatga
tggcacaacaattcttcttggtgacgatcacacagttaagtctcagggacaaggttctattcggattaag
gcaaatggtggatcaatcagaatcttgaagaatgtcaagtatgtgcctaatctcaggcgaaatctaattt
caacaggcactctagataaactgggatatcaccatgaaggtggagatggtaaagtgagataccataagaa
caatgcaactgcattagttggacgtttaatcaatggactgtatgttctggatggagagaccattatgtct
gagagctttaatgcagaagacactaaaagcagtactgaattatggcatagcagacttggccatatgagtt
taaacaacatgaagatactggctggaaagggactgctacaaaagaacgatgtcaaagaactagagttctg
tgagcactgtgtaatgggaaaatccaagaagcttagcttcaatgtcagcaagcacatcacagaggaagct
ctaggatacgttcatgcagacctatggggctctccaaatgtaactccatcactcttaggtatgaaatatt
ttctgtctattgttgatggtaagacaagaaaggtttggcttatgtttcttaaatctaaagatgagacatt
tgaccgtttttgtgagtggaaagaacttgttgaaacacaggtgtgaaagaaggttaaagtgctcaggaca
gataatggattggagttttgtaattccaaatttgaagattacctcaagaagtttggtattgaaaggcaca
ggacatgtgcttatacccctcagcagaacggtgtagcagagagaatgaacagaactcttatggaaaaagt
gagatgtcttttgagtgaatcaggtcttgaagaaatattctgggctgaagctgcttcaactgttgcatat
ctggtgaacaggtcacctgcttttgcagtggatcacaatgtacctgaagagttgtggttaaacaggaaat
ctgggtacaagcatttaaggaggtttgggtcagttgcttatgtacaccaagatcaaggaaagcttaaacc
aagagctttaaaaggtgtgttcctcggttatccgcaaggcactaagggatacaagatctggctcttagaa
gaaatgaaatgtgttatcagtcgaaatgtgatatttcatgaggacttggtgtataaggatttgcagttaa
aagagaagtctgaacaagaagaaagagcagagaagattactcaagcagaaaagactgtctctgaaatagt
aagtaaccaacagcaggttggtgagagttctgttgcaggtggaacagttgatgtttcatctagtgatgat
gagtcagagtattttgaacccgaaggagaagctccagcaagcagtgaaagactgagcaattatcagttag
ctagagatcgggttagaaggcaaatcagagcacctataagattctctgattactctcaatttgcatatgc
tcttatggcagctgaagatatggacagcagtgaagaacctagctgttatcatgaagctaaagaaaccaaa
gagtgggaaaaatggaatgcaggaatgggagatgaaatgcagtcactattgaaaaactatacatgggata
tagtagaccatcccaagaatcagaaaatcatcagttgtagatggctgtacaagaagaaaccaggaattcc
cggtgtggaacctgaaagatacaaagccagactagtagcaagaggctttactcagagaaaaggaatcgac
tatgatgaagtgtttgcacctgtagtcaagcatgtgtcgataagaatcttgatgtctattgttgttcaag
aagatctagaattggaacaaatggatgttaagactgcgttcttgcatggtgatctggaccagccacttta
catggagcaacctgaagggtatgttgctgatgaacaaaaggatcaagtgtgcttgttaaaaaagtcactc
tatgggttaaaacaagcaccacgtcagtggaacaagaagtttaactcttttatcatggatcagaacttta
tcagaagtggtcatgattcgtgtgtttacataaaacaggtgagtgatgaagagtttgtgtatatactgat
atacgttgatgacatgttgatagcagctaagtcaatgactgaaatcaacaagatcaaagaggcgttgagt
acaggatttgaaatgaaggatatgggtgcagctagtcgaatactgggaatcgacattataagagacagaa
aagcaggtacattgcggttgtctcagacaggatacttagagaaagtgcttcacatgtttaatatgactga
agcaagacctgtaagcacacctatgggagcacatttcaaacttgcctcagtagttgaggaagaagagtgt
gtggacactgataaagttccaaactcaagtgctattggcagcatcatgtatgccatggttggcaccagac
cagatatagctcaagctattggagttctaagcaggtttatgagcaaaccaggtaagattcattggactgc
agtgaaatggttgctaagatacttaaaagggtctacagatttgaatctggttttcaccagagaaaaggat
ttcagagttcaaggctttagtgactctgactatgcagcagaccttgacaggaggcgttcaacaacaggtt
atgtattcactgttggtggaaatacagtaagttggaagtcaaatctgcagagcatagtggctttatcaac
cactgaagcagagtatgttgctttaactgaagcagtgaaagaagctttgtggattcaggggttgttaaca
gaaatgggattcaagcaagagaaggttactttgtggtgtgactcacagtcagcaatcagtttggcaaaga
acaacacgttccatgaaagaactaaacatattgcgatcaagttcaactttatcagggatgtgattgaaga
aggaagtgttgaagttcttaagatccatacttctcagaatcctgcagacatgcttaccaaaggcattcat
gtgcagaagtttgagtcagctttagagtttctaaagctactcaggtgaggtggaaagacatctcagccga
aggtgaaatccaagtaagtgcaagtccactagttatggagagatttgaatcaaggtggag1