;ID ATLINE1_1 DNA ; ATH ; 5851 BP ;XX ;DE ATLINE1_1, non-LTR retrotransposon - a consensus. ;XX ;AC . ;XX ;DT 15-DEC-2000 (Rel. 5.9, Created) ;DT 15-DEC-2000 (Rel. 5.9, Last updated, Version 1) ;XX ;KW non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; ;KW reverse transcriptase; ATLINE1_1. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; ;OC Dilleniidae; Capparales; Brassicaceae. ;XX ;RN [1] (bases 1 to 5851) ;RA Kapitonov,V. and Jurka,J. ;RL Direct submission (December, 2000) ;XX ;CC ATLINE1_1 is a non-LTR retrotransposon. Its individual copies are ;CC 99% identical to each other. There are only a few copies of ;CC ATLINE1_1 present in the genome. ATLINE1_1 belongs to the L1 ;CC superfamily of non-LTR retrotransposons, its copies are flanked ;CC by ~15-bp target site duplications. ;CC Two proteins, ATLINE1_1p1 and ATLINE1_1p2, are encoded by ORF1 ;CC (position 17-1609) and ORF2 (position 1652-5782) respectively. ;CC Function and classification of the first protein is unclear, although ;CC it is expected to be a nucleic acid binding protein analogously to ;CC the protein encoded by L1 in mammals. The second protein contains ;CC the reverse transcriptase and endonuclease domains. ;CC ATLINE1_1p1 (530 aa): ;CC MESGARVLGEKAQDATMTDVGEKGRPPGDPPDKSSSWANKVRGGHVGGMLAPTDVLSDEFVRARLSLTFP ;CC DGEDGEPLIIIGLEVFEAMNSLWKNCMLVKVLGRSVPIAVLSKKLRELWKPIGAMHVVDLPRQYFMVRFE ;CC SEEEYLTALTGGPWRVFGSYLLVQAWSPDFDPMKDEIVTTPVWVRLSNIPLNLYHPSILMGITGGLGNLI ;CC KVDMTTLTCERARFARVCVEVNLRKPLKGTVMINEDRYFVAYEGLTNICSGCGLYGHLVHNCPQVKQSNV ;CC VKATQSIVAVETSLAPVSQSVDGFTTVGQSGRRGTKQPATVVFAAGGTRSGLGKSQRDLGKKSDSANISV ;CC TNSFGSLLTDMEGTDLSADVVELEGNKENEEILIQSRNEKKVIHGNVVPLGENSKRANEGARIGKKDKRN ;CC GLKKVAVSNGPRPNQMNHVRPTRGLVFGPTRGEMERSFNGKRLRIEEKNPGRPGGVFTQTGDGSSNVENS ;CC GQGRVVKNMVGQLDSNQSTMPMEETMRSVPQGSGNGNMVA ;CC ATLINE1_1p2 (1376 aa): ;CC MNCLLWNFQGANKPHFRRSIRYIVKKFPTEILAIFETHASGDRAGSICQGLGFDKSFRVDAVGQSGGIWL ;CC LWRSEIGEVTIVQSTDQFIFATVDTGDEVLNLIVVYAAPSISRRSGLWDQLRDVIREAIGPIVIVGDFNS ;CC IVRLDERTGGNGQLSPDSLAFGDWINTSSLIDMGFRGNKFTWKRGKTESNFVAKRLDRVLCCAHSCLKWQ ;CC EAIVTHLPFLSSDHAPLYVQLSPTVRGDPRRRPFRFEAAWLSHEGFKELLNVSWNRNLSTPIALQELQKI ;CC LKKWNKEVFGDIQQRKEKLVVEIKEVQDLLDVTQTDELLQKEEQLIKEFDIVLEQEETLWYQKSRERWIV ;CC LGDRNTTYFHTSTVIRRRRNRIEMLKNNEDQWVSDSRELEXLALDYYSKLYSLDDVEPVVTKLPPEGFMS ;CC LSQADKTELLRCFSAGEVEKAVRCMGKFKAPGPDGYQPVFYQECWEVVGESVVKLVLEFFETSVLPSRLN ;CC DALVVLLPKVGKPEKITQFRPISLCNVLFKIITKTMVERLKPLMTNLIGPAQASFIPGRVSTDNIVLVQE ;CC AVHSMRRKKGVKGWMLLKLDLEKAYDTIRWDYLEDTLISAGFPEVWVRWIMCCVSGPEMSLLWNGERTDS ;CC FKPLRGLRQGDPLSPYLFVLCMERLCHLIERSIDNKQWKPISLSQGGPKLSHICFADDLILFAEASVMQI ;CC RVIRKVLETFCIASGQKISLEKSKIFFSGNVSRDPSKLISEESGIKATNDLGKYLGMPVLHKRINKDTFS ;CC ELLEKVSSRLSGWKERTLSFAGRMTLTKAVLSSIPVHTMSSIALPQSTITRLDKASRSFLWGSTAEKRKQ ;CC HLVSWKKVCLPKKDGGLGIRNAKLMNKALIAKVGWRLLQDQSSLWAEVFRKKYKIGDLRDCQWLRKKGTW ;CC SSTWRSIVTGLVDVISRGTCWVPGDGHHIRFWRDIWVSGKPLWEDGQGVVPANLETESVRSLWKDDSGWD ;CC ISRISPYVSESRCLELRAIVLDHETVARDRISWNESQNGQFSVSSAYNMLSWDDSPRPNMEKFFNRIWRV ;CC KVPERVRAFFWLVVNQGIMTDAERYRRHIGESEICQICKGGVETTLHILRDCPAMSGIWTRIVPQRKRRA ;CC FFDKTLFEWVFDNLHEETPFQESSWSTVFAMAVWWGWKWRCGNIFGEKRKCRDRVQFIKDVAKEVFVANA ;CC RATVLNGIPTRVERQIGWVAPSTGWYKVNTDGASRGNPGLATAGGVIRDGAGNWCGGFALNIGRCSAPLA ;CC ELWGVYYGLYLAWTKALTRVELEVDSELVVGFLKTGIGDQHPLSFLVRLCHGLLSKDWIVRITRVYREAN ;CC RLADGLANYAFSLPLGFHSLIDVPDDLEVILHEDSLGSTRPRRVRL ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 5851 BP; 1555 A; 970 C; 1731 G; 1595 T; 0 other; ATLINE1_1 atcaagttcgtcttccatggagagcggtgctagggttttgggcgagaaagcgcaggacgcgacgatgacc gatgttggtgagaaagggaggccgccgggggatcctccggataagtcatcgtcgtgggcgaataaggtga gaggaggccatgtgggagggatgttggctccgacggatgtgttaagtgatgagtttgtaagggcacggtt gagtctgacgtttccagatggcgaggatggagaaccactaatcataatcgggctagaggtgtttgaagct atgaacagtttgtggaagaattgcatgttggtcaaggtgttaggaaggagtgtccctattgcagtgttaa gcaagaagttaagagagctgtggaagcctataggagcgatgcatgttgtggatctaccacgacaatactt catggttcggttcgaatcagaggaagagtatttgacagcactgacgggagggccgtggagggtgttcggc agttacctgttagttcaagcgtggtccccagattttgatccaatgaaggacgagattgttacgacaccgg tgtgggtgagattgtcgaacatcccgttgaatctttatcatccgtcgatcttgatgggaattactggtgg cctgggtaatctgatcaaagtggatatgacaacgttgacttgtgaaagggctaggtttgctcgtgtttgt gttgaagttaacttgagaaagccgcttaaagggacggtgatgataaatgaggacagatattttgtggctt atgaaggccttacgaatatttgctccggttgtggcttgtatggacatctagtgcacaactgcccacaagt gaagcagagtaacgtggtgaaagctacacaatctattgtggcggtcgagacaagcttagctccagtgagt caatcagtagatgggttcacgacggtcggacagtcaggtcgaagggggacgaagcagccggcgacggttg tgttcgcagctggaggcactaggagcggtttaggaaagtctcaacgtgatttggggaagaagagtgattc agcgaatatttcggtaacaaacagttttgggagtttgttgactgatatggaaggtactgatttaagtgca gatgtggtggaattagaggggaataaggagaatgaggagattctaattcaatcaaggaatgagaagaaag ttattcatggtaatgttgtaccgttaggggagaattctaagagggccaatgagggtgcgcgtatagggaa gaaagataagcggaatgggctcaaaaaagttgcagtttcgaatgggcctaggcctaatcaaatgaaccac gttaggcccacaagaggtctggtgtttgggccaactagaggcgagatggagagatcttttaacgggaaac gtctaaggatagaggagaagaatccaggacgtccggggggtgtgtttactcagacaggggacggaagttc gaatgtagagaattccggtcaaggtcgagtcgttaaaaacatggtcggtcagttggattcgaatcaaagt acgatgcctatggaggaaacgatgcgtagtgttccacagggtagtggtaatgggaacatggtcgcataac ggattgccccggcttttctttttactcaagaagaaataatgatgaattgtttactatggaatttccaggg ggcgaataaaccccactttcgaagatcaattcgatatattgtaaaaaagtttccaaccgaaatcttggca atctttgaaactcatgcgagtggggatcgagcagggagcatttgccagggattaggctttgataaatctt ttcgtgtcgacgcagtcgggcagagtgggggtatttggttgttgtggcggtcggagattggggaagtgac gattgttcagtcaacagatcaatttatctttgcgacggtggatacaggggatgaggttctgaatctcatt gtagtatatgctgcgccttctataagtcggagaagtggtttatgggatcaacttcgagatgtgatacgcg aggctataggaccgattgtaattgttggagactttaattcaatagtaaggttggatgaaaggactggtgg taatggccaactctcgccggattcgttagcctttggagactggatcaatacttcatctcttattgatatg ggattccggggtaataaattcacttggaaacgagggaaaacagaatcgaactttgtggcaaaacggctgg atagggttctgtgttgtgctcactcctgtttaaaatggcaggaagctattgtaacgcaccttccgtttct ctcttcggatcatgctcctctttatgttcaactctctccaacagtaagaggagacccacgaagaagacca ttccgttttgaagcggcgtggctaagtcacgaagggtttaaagagttgttaaatgtttcgtggaatcgaa atctgagtactccaatagctttgcaggagttacagaaaattctcaaaaaatggaacaaagaggtgtttgg agacattcagcaacgtaaagagaagttggtggtggaaattaaagaagttcaggacttgttggatgttacc caaactgatgagttgttgcagaaggaggaacaactaattaaagagtttgatatcgttcttgagcaggagg agacgttgtggtatcaaaaatcaagagaacggtggattgtgttgggcgacagaaataccacatattttca cacttctacggtgattaggagacggagaaacagaatagagatgctcaagaataatgaagatcagtgggtt tcagattctcgagaattggaatagctcgcccttgattattattctaagctttattctttagacgatgttg agcctgtggttaccaaactaccaccagaaggatttatgagcctgtcccaagcagataaaacagagctgct acggtgtttctctgccggtgaagtggagaaggctgttcgttgtatgggtaagtttaaagcacctggaccc gatggatatcagccagtcttttatcaagagtgctgggaggtagtgggggaatcagtggttaagcttgtgt tggagttttttgaaacgtccgtgttaccgagtagactaaatgacgcccttgttgtgctgttaccaaaggt tggaaagcctgaaaaaataactcagttccggcctatcagcctatgcaatgtgcttttcaagatcatcact aagacgatggtagagcgtctgaaaccgttaatgacaaatctgattggtccggctcaagctagttttatcc ccgggagagtaagtacggataatatagtgttggttcaagaagcagtccattccatgagaagaaaaaaagg ggtaaagggctggatgcttttgaagttggatttggagaaggcttatgatacgatccgatgggattatctg gaggatacacttatttctgcggggttccccgaagtttgggtaagatggataatgtgctgtgtgtcgggcc cggagatgagtttgttgtggaatggagagagaacagactccttcaagccgttacggggcctccgccaggg tgatcccttatctccgtatttgtttgtgctatgtatggagaggctttgtcacttgattgagcgatcgatt gataacaaacagtggaagcccattagtttgtctcaaggcggtcccaagttatctcatatctgttttgctg acgatctaattttatttgcggaggcttcagtaatgcagataagagtgattcggaaggtgctggagacatt ttgtatagcctctggtcaaaagattagcctggagaagtcaaaaatatttttttcagggaatgtctcacgt gatccgagcaaactgataagtgaggagagcgggattaaagcaacaaacgacttgggtaaatatcttggga tgccagtacttcataaacgcatcaataaggatactttcagtgaacttttggagaaggtctcttcccgttt gtcgggttggaaggagagaactttgagttttgcgggacggatgacgcttaccaaagctgtattatcatct ataccggtccatacaatgagttctattgcgctaccacaatcgactattactcgattggacaaagcttctc gttcctttctttggggaagcacggctgagaagagaaaacaacacttggtgtcttggaaaaaggtctgctt acctaagaaagatgggggtttgggcattcgcaatgcgaaattgatgaacaaggctcttattgctaaggtt ggttggcgtctgctgcaggaccagtcgagtctttgggctgaggtcttcagaaaaaagtataaaattggcg atctccgggattgtcagtggctgcgtaagaaggggacttggtcatcaacttggagaagcattgtaacagg gcttgtggatgtgatatcgcgggggacttgctgggttcctggtgatggacatcatattcgtttttggcga gatatttgggtctcagggaaacccttatgggaagatgggcagggcgtggttccggcgaacttggagacag agtcagtgcggagtttgtggaaggacgatagtggctgggatatcagtaggatcagtccgtatgtttcaga gagccgatgtctagagctacgggcaatcgtattggatcatgagacagtggcgagagacagaatctcctgg aatgagagtcagaatggccaattcagtgtatcgtccgcttataatatgctgagttgggatgatagtccac gacctaatatggagaaattctttaatcggatatggagagttaaggttccagaacgtgtgagagctttctt ctggttagtggtcaatcaagggattatgacagatgcagagcgttatcgaaggcatatcggtgagtcggag atttgtcagatttgtaaagggggagtagagaccactttacacattctcagagattgtccagctatgtcag ggatctggactagaattgtgcctcagaggaagcggcgtgcgttcttcgacaagactttgtttgaatgggt gtttgataatctgcacgaagaaaccccttttcaagagagctcttggtccacggtgtttgctatggcggtt tggtggggatggaagtggagatgcggtaacatttttggtgagaaaaggaaatgcagggatagagttcagt tcatcaaggatgtcgcgaaggaagtatttgtggcgaatgcaagggctacagtgcttaacgggattccgac aagagtggagaggcagattggatgggtcgcaccaagtacgggttggtataaggtgaacacagacggtgct tctcgtggaaacccggggttagcgacggcgggtggtgtgatacgggatggagctggaaattggtgtggag gctttgcgcttaatatcgggagatgctcagcgcctttagcggagctgtggggggtttattatggacttta cctagcttggactaaggcgttgactcgtgtggagctcgaagttgattcggagttggtagttggttttctc aagacagggatcggtgatcaacatccgctgtcgttcctggtgcggttgtgccatggcttattatcgaagg actggattgtccgaatcactcgcgtgtatagggaggctaatcgtctagccgacggtctagctaactatgc attttctttaccattaggttttcattctcttattgatgttccggatgatttagaggtgattttgcatgag gatagtcttgggtcaacgcgaccaagacgagtccggttgtaagctttgttttatttcttttttaataata ttccgggagcttccgtctcccggtgaatcaccaaaaaaaaa1