;ID   ATLINE1_1   DNA   ; ATH   ; 5851 BP
;XX
;DE   ATLINE1_1, non-LTR retrotransposon - a consensus.
;XX
;AC   .
;XX
;DT   15-DEC-2000 (Rel. 5.9, Created)
;DT   15-DEC-2000 (Rel. 5.9, Last updated, Version 1)
;XX
;KW   non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; 
;KW   reverse transcriptase; ATLINE1_1.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 5851)
;RA   Kapitonov,V. and Jurka,J.
;RL   Direct submission (December, 2000)
;XX
;CC   ATLINE1_1 is a non-LTR retrotransposon. Its individual copies are 
;CC   99% identical to each other. There are only a few copies of 
;CC   ATLINE1_1 present in the genome. ATLINE1_1 belongs to the L1
;CC   superfamily of non-LTR retrotransposons, its copies are flanked
;CC   by ~15-bp target site duplications. 
;CC   Two proteins, ATLINE1_1p1 and ATLINE1_1p2, are encoded by ORF1 
;CC   (position 17-1609) and ORF2 (position 1652-5782) respectively. 
;CC   Function and classification of the first protein is unclear, although
;CC   it is expected to be a nucleic acid binding protein analogously to
;CC   the protein encoded by L1 in mammals. The second protein contains 
;CC   the reverse transcriptase and endonuclease domains.
;CC   ATLINE1_1p1 (530 aa):
;CC   MESGARVLGEKAQDATMTDVGEKGRPPGDPPDKSSSWANKVRGGHVGGMLAPTDVLSDEFVRARLSLTFP
;CC   DGEDGEPLIIIGLEVFEAMNSLWKNCMLVKVLGRSVPIAVLSKKLRELWKPIGAMHVVDLPRQYFMVRFE
;CC   SEEEYLTALTGGPWRVFGSYLLVQAWSPDFDPMKDEIVTTPVWVRLSNIPLNLYHPSILMGITGGLGNLI
;CC   KVDMTTLTCERARFARVCVEVNLRKPLKGTVMINEDRYFVAYEGLTNICSGCGLYGHLVHNCPQVKQSNV
;CC   VKATQSIVAVETSLAPVSQSVDGFTTVGQSGRRGTKQPATVVFAAGGTRSGLGKSQRDLGKKSDSANISV
;CC   TNSFGSLLTDMEGTDLSADVVELEGNKENEEILIQSRNEKKVIHGNVVPLGENSKRANEGARIGKKDKRN
;CC   GLKKVAVSNGPRPNQMNHVRPTRGLVFGPTRGEMERSFNGKRLRIEEKNPGRPGGVFTQTGDGSSNVENS
;CC   GQGRVVKNMVGQLDSNQSTMPMEETMRSVPQGSGNGNMVA
;CC   ATLINE1_1p2 (1376 aa):
;CC   MNCLLWNFQGANKPHFRRSIRYIVKKFPTEILAIFETHASGDRAGSICQGLGFDKSFRVDAVGQSGGIWL
;CC   LWRSEIGEVTIVQSTDQFIFATVDTGDEVLNLIVVYAAPSISRRSGLWDQLRDVIREAIGPIVIVGDFNS
;CC   IVRLDERTGGNGQLSPDSLAFGDWINTSSLIDMGFRGNKFTWKRGKTESNFVAKRLDRVLCCAHSCLKWQ
;CC   EAIVTHLPFLSSDHAPLYVQLSPTVRGDPRRRPFRFEAAWLSHEGFKELLNVSWNRNLSTPIALQELQKI
;CC   LKKWNKEVFGDIQQRKEKLVVEIKEVQDLLDVTQTDELLQKEEQLIKEFDIVLEQEETLWYQKSRERWIV
;CC   LGDRNTTYFHTSTVIRRRRNRIEMLKNNEDQWVSDSRELEXLALDYYSKLYSLDDVEPVVTKLPPEGFMS
;CC   LSQADKTELLRCFSAGEVEKAVRCMGKFKAPGPDGYQPVFYQECWEVVGESVVKLVLEFFETSVLPSRLN
;CC   DALVVLLPKVGKPEKITQFRPISLCNVLFKIITKTMVERLKPLMTNLIGPAQASFIPGRVSTDNIVLVQE
;CC   AVHSMRRKKGVKGWMLLKLDLEKAYDTIRWDYLEDTLISAGFPEVWVRWIMCCVSGPEMSLLWNGERTDS
;CC   FKPLRGLRQGDPLSPYLFVLCMERLCHLIERSIDNKQWKPISLSQGGPKLSHICFADDLILFAEASVMQI
;CC   RVIRKVLETFCIASGQKISLEKSKIFFSGNVSRDPSKLISEESGIKATNDLGKYLGMPVLHKRINKDTFS
;CC   ELLEKVSSRLSGWKERTLSFAGRMTLTKAVLSSIPVHTMSSIALPQSTITRLDKASRSFLWGSTAEKRKQ
;CC   HLVSWKKVCLPKKDGGLGIRNAKLMNKALIAKVGWRLLQDQSSLWAEVFRKKYKIGDLRDCQWLRKKGTW
;CC   SSTWRSIVTGLVDVISRGTCWVPGDGHHIRFWRDIWVSGKPLWEDGQGVVPANLETESVRSLWKDDSGWD
;CC   ISRISPYVSESRCLELRAIVLDHETVARDRISWNESQNGQFSVSSAYNMLSWDDSPRPNMEKFFNRIWRV
;CC   KVPERVRAFFWLVVNQGIMTDAERYRRHIGESEICQICKGGVETTLHILRDCPAMSGIWTRIVPQRKRRA
;CC   FFDKTLFEWVFDNLHEETPFQESSWSTVFAMAVWWGWKWRCGNIFGEKRKCRDRVQFIKDVAKEVFVANA
;CC   RATVLNGIPTRVERQIGWVAPSTGWYKVNTDGASRGNPGLATAGGVIRDGAGNWCGGFALNIGRCSAPLA
;CC   ELWGVYYGLYLAWTKALTRVELEVDSELVVGFLKTGIGDQHPLSFLVRLCHGLLSKDWIVRITRVYREAN
;CC   RLADGLANYAFSLPLGFHSLIDVPDDLEVILHEDSLGSTRPRRVRL
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 5851 BP; 1555 A; 970 C; 1731 G; 1595 T; 0 other;
ATLINE1_1
atcaagttcgtcttccatggagagcggtgctagggttttgggcgagaaagcgcaggacgcgacgatgacc
gatgttggtgagaaagggaggccgccgggggatcctccggataagtcatcgtcgtgggcgaataaggtga
gaggaggccatgtgggagggatgttggctccgacggatgtgttaagtgatgagtttgtaagggcacggtt
gagtctgacgtttccagatggcgaggatggagaaccactaatcataatcgggctagaggtgtttgaagct
atgaacagtttgtggaagaattgcatgttggtcaaggtgttaggaaggagtgtccctattgcagtgttaa
gcaagaagttaagagagctgtggaagcctataggagcgatgcatgttgtggatctaccacgacaatactt
catggttcggttcgaatcagaggaagagtatttgacagcactgacgggagggccgtggagggtgttcggc
agttacctgttagttcaagcgtggtccccagattttgatccaatgaaggacgagattgttacgacaccgg
tgtgggtgagattgtcgaacatcccgttgaatctttatcatccgtcgatcttgatgggaattactggtgg
cctgggtaatctgatcaaagtggatatgacaacgttgacttgtgaaagggctaggtttgctcgtgtttgt
gttgaagttaacttgagaaagccgcttaaagggacggtgatgataaatgaggacagatattttgtggctt
atgaaggccttacgaatatttgctccggttgtggcttgtatggacatctagtgcacaactgcccacaagt
gaagcagagtaacgtggtgaaagctacacaatctattgtggcggtcgagacaagcttagctccagtgagt
caatcagtagatgggttcacgacggtcggacagtcaggtcgaagggggacgaagcagccggcgacggttg
tgttcgcagctggaggcactaggagcggtttaggaaagtctcaacgtgatttggggaagaagagtgattc
agcgaatatttcggtaacaaacagttttgggagtttgttgactgatatggaaggtactgatttaagtgca
gatgtggtggaattagaggggaataaggagaatgaggagattctaattcaatcaaggaatgagaagaaag
ttattcatggtaatgttgtaccgttaggggagaattctaagagggccaatgagggtgcgcgtatagggaa
gaaagataagcggaatgggctcaaaaaagttgcagtttcgaatgggcctaggcctaatcaaatgaaccac
gttaggcccacaagaggtctggtgtttgggccaactagaggcgagatggagagatcttttaacgggaaac
gtctaaggatagaggagaagaatccaggacgtccggggggtgtgtttactcagacaggggacggaagttc
gaatgtagagaattccggtcaaggtcgagtcgttaaaaacatggtcggtcagttggattcgaatcaaagt
acgatgcctatggaggaaacgatgcgtagtgttccacagggtagtggtaatgggaacatggtcgcataac
ggattgccccggcttttctttttactcaagaagaaataatgatgaattgtttactatggaatttccaggg
ggcgaataaaccccactttcgaagatcaattcgatatattgtaaaaaagtttccaaccgaaatcttggca
atctttgaaactcatgcgagtggggatcgagcagggagcatttgccagggattaggctttgataaatctt
ttcgtgtcgacgcagtcgggcagagtgggggtatttggttgttgtggcggtcggagattggggaagtgac
gattgttcagtcaacagatcaatttatctttgcgacggtggatacaggggatgaggttctgaatctcatt
gtagtatatgctgcgccttctataagtcggagaagtggtttatgggatcaacttcgagatgtgatacgcg
aggctataggaccgattgtaattgttggagactttaattcaatagtaaggttggatgaaaggactggtgg
taatggccaactctcgccggattcgttagcctttggagactggatcaatacttcatctcttattgatatg
ggattccggggtaataaattcacttggaaacgagggaaaacagaatcgaactttgtggcaaaacggctgg
atagggttctgtgttgtgctcactcctgtttaaaatggcaggaagctattgtaacgcaccttccgtttct
ctcttcggatcatgctcctctttatgttcaactctctccaacagtaagaggagacccacgaagaagacca
ttccgttttgaagcggcgtggctaagtcacgaagggtttaaagagttgttaaatgtttcgtggaatcgaa
atctgagtactccaatagctttgcaggagttacagaaaattctcaaaaaatggaacaaagaggtgtttgg
agacattcagcaacgtaaagagaagttggtggtggaaattaaagaagttcaggacttgttggatgttacc
caaactgatgagttgttgcagaaggaggaacaactaattaaagagtttgatatcgttcttgagcaggagg
agacgttgtggtatcaaaaatcaagagaacggtggattgtgttgggcgacagaaataccacatattttca
cacttctacggtgattaggagacggagaaacagaatagagatgctcaagaataatgaagatcagtgggtt
tcagattctcgagaattggaatagctcgcccttgattattattctaagctttattctttagacgatgttg
agcctgtggttaccaaactaccaccagaaggatttatgagcctgtcccaagcagataaaacagagctgct
acggtgtttctctgccggtgaagtggagaaggctgttcgttgtatgggtaagtttaaagcacctggaccc
gatggatatcagccagtcttttatcaagagtgctgggaggtagtgggggaatcagtggttaagcttgtgt
tggagttttttgaaacgtccgtgttaccgagtagactaaatgacgcccttgttgtgctgttaccaaaggt
tggaaagcctgaaaaaataactcagttccggcctatcagcctatgcaatgtgcttttcaagatcatcact
aagacgatggtagagcgtctgaaaccgttaatgacaaatctgattggtccggctcaagctagttttatcc
ccgggagagtaagtacggataatatagtgttggttcaagaagcagtccattccatgagaagaaaaaaagg
ggtaaagggctggatgcttttgaagttggatttggagaaggcttatgatacgatccgatgggattatctg
gaggatacacttatttctgcggggttccccgaagtttgggtaagatggataatgtgctgtgtgtcgggcc
cggagatgagtttgttgtggaatggagagagaacagactccttcaagccgttacggggcctccgccaggg
tgatcccttatctccgtatttgtttgtgctatgtatggagaggctttgtcacttgattgagcgatcgatt
gataacaaacagtggaagcccattagtttgtctcaaggcggtcccaagttatctcatatctgttttgctg
acgatctaattttatttgcggaggcttcagtaatgcagataagagtgattcggaaggtgctggagacatt
ttgtatagcctctggtcaaaagattagcctggagaagtcaaaaatatttttttcagggaatgtctcacgt
gatccgagcaaactgataagtgaggagagcgggattaaagcaacaaacgacttgggtaaatatcttggga
tgccagtacttcataaacgcatcaataaggatactttcagtgaacttttggagaaggtctcttcccgttt
gtcgggttggaaggagagaactttgagttttgcgggacggatgacgcttaccaaagctgtattatcatct
ataccggtccatacaatgagttctattgcgctaccacaatcgactattactcgattggacaaagcttctc
gttcctttctttggggaagcacggctgagaagagaaaacaacacttggtgtcttggaaaaaggtctgctt
acctaagaaagatgggggtttgggcattcgcaatgcgaaattgatgaacaaggctcttattgctaaggtt
ggttggcgtctgctgcaggaccagtcgagtctttgggctgaggtcttcagaaaaaagtataaaattggcg
atctccgggattgtcagtggctgcgtaagaaggggacttggtcatcaacttggagaagcattgtaacagg
gcttgtggatgtgatatcgcgggggacttgctgggttcctggtgatggacatcatattcgtttttggcga
gatatttgggtctcagggaaacccttatgggaagatgggcagggcgtggttccggcgaacttggagacag
agtcagtgcggagtttgtggaaggacgatagtggctgggatatcagtaggatcagtccgtatgtttcaga
gagccgatgtctagagctacgggcaatcgtattggatcatgagacagtggcgagagacagaatctcctgg
aatgagagtcagaatggccaattcagtgtatcgtccgcttataatatgctgagttgggatgatagtccac
gacctaatatggagaaattctttaatcggatatggagagttaaggttccagaacgtgtgagagctttctt
ctggttagtggtcaatcaagggattatgacagatgcagagcgttatcgaaggcatatcggtgagtcggag
atttgtcagatttgtaaagggggagtagagaccactttacacattctcagagattgtccagctatgtcag
ggatctggactagaattgtgcctcagaggaagcggcgtgcgttcttcgacaagactttgtttgaatgggt
gtttgataatctgcacgaagaaaccccttttcaagagagctcttggtccacggtgtttgctatggcggtt
tggtggggatggaagtggagatgcggtaacatttttggtgagaaaaggaaatgcagggatagagttcagt
tcatcaaggatgtcgcgaaggaagtatttgtggcgaatgcaagggctacagtgcttaacgggattccgac
aagagtggagaggcagattggatgggtcgcaccaagtacgggttggtataaggtgaacacagacggtgct
tctcgtggaaacccggggttagcgacggcgggtggtgtgatacgggatggagctggaaattggtgtggag
gctttgcgcttaatatcgggagatgctcagcgcctttagcggagctgtggggggtttattatggacttta
cctagcttggactaaggcgttgactcgtgtggagctcgaagttgattcggagttggtagttggttttctc
aagacagggatcggtgatcaacatccgctgtcgttcctggtgcggttgtgccatggcttattatcgaagg
actggattgtccgaatcactcgcgtgtatagggaggctaatcgtctagccgacggtctagctaactatgc
attttctttaccattaggttttcattctcttattgatgttccggatgatttagaggtgattttgcatgag
gatagtcttgggtcaacgcgaccaagacgagtccggttgtaagctttgttttatttcttttttaataata
ttccgggagcttccgtctcccggtgaatcaccaaaaaaaaa1