;ID ATLINE1_2 DNA ; ATH ; 5814 BP ;XX ;DE ATLINE1_2, non-LTR retrotransposon - a consensus. ;XX ;AC . ;XX ;DT 15-DEC-2000 (Rel. 5.9, Created) ;DT 15-DEC-2000 (Rel. 5.9, Last updated, Version 1) ;XX ;KW non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; ;KW reverse transcriptase; ATLINE1_2. ;XX ;OS consensus ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; ;OC Dilleniidae; Capparales; Brassicaceae. ;XX ;RN [1] (bases 1 to 5773) ;RA Kapitonov,V. and Jurka,J. ;RL Direct submission (December, 2000) ;XX ;CC ATLINE1_2 is a non-LTR retrotransposon. Its individual copies are ;CC 98% identical to each other. There are only a few copies of ;CC ATLINE1_2 present in the genome. ATLINE1_2 belongs to the L1 ;CC superfamily of non-LTR retrotransposons, its copies are flanked ;CC by ~15-bp target site duplications. ;CC Two proteins, ATLINE1_2p1 and ATLINE1_2p2, are encoded by ORF1 ;CC (position 2-1579) and ORF2 (position 1622-5752), respectively. ;CC Function and classification of the first protein is unclear, although ;CC it is expected to be a nucleic acid binding protein analogously to ;CC the protein encoded by L1 in mammals. The second protein contains ;CC the reverse transcriptase and endonuclease domains. ;CC ATLINE1_2p1 (525 aa): ;CC MGERGDFRVPVTSSQDAPMMDIGDGGRPSGDPLDVVESWASKVSGSNAGGRLSPERVLDKEFVMARMRLE ;CC FPDGEDGEPVITIAQEVLDVMNGLWKQCIIVKVLGRHIALPALNKRLREMWNPKGGMHVLDLPRQFFMIR ;CC FDLEDEYLVALTGGPWRAFGSHLMVQAWSPDFDPLRNEIVTTPVWVRLSNIPLNLYHETILLGIVQGLGK ;CC PIKVDLTTLHHAKARFARVCVEVNLSKPLKGTITINGERYFVAYEGLSNICSGCGIYGHLVHNCPRRVVE ;CC RTVAPVTVEVPVNVSGGTPQDDGFTVVRRTGRKGGAPENSGVATAGRLSTNLERNLRDISGRANMESIDT ;CC SNSFGNLEEVIEGSVREVAASVDANKENMMNGNYAKKGKSVAQGKAWAPVDLMNKIRAGKKDKTAGGKVG ;CC EANGPKPRNGNSNRPVRGLVFGPTRKEIELSGSGKRLRVDESSVYRQGGGGSPEKESRGVDGVSMSVAGE ;CC AVAKSSLVDLAEAPQNREVEMQNGLSAVVATSLAA ;CC ATLINE1_2p2 (1376 aa): ;CC MDVLFWNCRGANKPLFRRTIRYMLKKNNIDILALFETHAAGDRASRICQKLGFEHTFRVDAVGHSGGVWL ;CC LWRASVGVVTVVASSEQFIHAKIVSETETLHLIVVYAAPSVSRRSGLWGCLKTAIEGVDGPLVIRGDFNM ;CC IVRLDERTGGNGRLSLDSLAFGEWINELMLIDMGFKGSQYTWRRGRLEENFIAKRLDRILCCPQARLRWQ ;CC EATVTHLPVVASDHAPLYLQLSPAFRGDPKRRPFRFEAAWLLHDGFKELLQLSWNNSLSTPEALNGLQIR ;CC LKKWNREVFGDINQRKDRLTTEIKSVQDLLDVVQTDALLRKEEELIKELDVVMEQEEVIWFQKSREKWVL ;CC DGDRNTKYFHTSTIIRRRRNRVEMLKSDSGVWISDPQELEKLATAYYKRLYSMEDVDQEVEMLPPGGFAR ;CC LTEREVAELTKPFSAVEVEASVRSMGKLKAPGPDGYQPIFYQDCWEVVGQSIARFVLDFFVTGILPEGTN ;CC DVLMVLIPKLAKPSKIMQFRPINLCNVLFKTITKTMMRRLQNVMSKLVGPAQSSFIPGRLSTDNILVVQE ;CC AVHSMRRKKGRKGWMLLKLDLEKAYDRIRWDFLHDTLVSVGLPDCWREWIMKCVAGPSMTLLWNGEKADP ;CC FKPARGLRQGDPLSPYLFVLCMERLCHQIEISVASKEWKPINLSQGGPKLSHICFADYLILFAEASVAQI ;CC RVIRQVLERFCVASGQKVSLEKSKIFFSDNVSRDLATLISNESGIKATKDLGKYLGMPVLHKRINKDTFG ;CC DVVERVASRLAGWRCRFLSLAGRITLTKSVLSSIPVHTMSTISLPQSILNKLDSISRSFLWGSTMEKRKQ ;CC HLIAWDRVCLPKQDGGLGIRCSTQMNTALLSKIGWRLLHDDVSLWSKILRSKYRVGDIHNRAWMVSKGTW ;CC SSTWRSVVVGLKEVVFSGLSWVLGDGVDILFWKDRWMSQTPLCEVVTCELPANWEAVKVVDVWRDGVGWD ;CC LQRLTPYFTEGMKLKLLSLVVDNVTGARDRLSWGGCSNGNSTVKSAYSFLSLDWSSKQQMARFFSRIWRV ;CC VAHERVRVFIWLAANQVLMTNVERYRRHLCDSSLCSVCKSGEETILHILRDCPAMAGIWTRLLPARRLSS ;CC FFSKSLLEWIYANLGEEIEINGCPWAVTFSQAIWWGWKWRCGNIFGENRKCRDRVRFIKDRALDVWKAHV ;CC HKMGVTTRTAREERLIAWSPPRVGWFKLNTDGASRGNPGLATAGGVVRDGDGNWCYGFSLNIGICSAPLA ;CC ELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFLRTGIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREAN ;CC RLADGLANYAFLLPLGFHLFNSTPDNVMSIVHDDVAGSAYPRNVQV ;XX ;DR [1] (Consensus) ;XX ;SQ Sequence 5814 BP; 1486 A; 965 C; 1747 G; 1616 T; 0 other; ATLINE1_2 gatgggtgagagaggcgattttagggttccagtgacgagttcgcaagatgcgccaatgatggatatcgga gatggtggtcgtccctcgggagaccctctggatgtggtagagtcatgggcgagtaaggtgtccggtagca atgctggagggaggttgtcaccggagagagtgttggataaggagttcgtcatggcgaggatgcggctgga gtttcctgatggggaggatggagaaccagtcattacgatcgcccaggaagttcttgatgtgatgaatggt ttgtggaagcagtgtattattgtgaaggtgttagggagacacatagcgttaccagcgttgaacaagagat tgagggaaatgtggaatccaaagggagggatgcatgtgttggatctcccaagacagttctttatgattcg ctttgatctggaagatgaatacttggtggctcttaccggtggaccatggagagcgtttgggagccaccta atggtgcaagcttggtctccggactttgatccgttgaggaatgagatagtaacaacgccagtgtgggttc gtttgtcgaatattcctttgaatctgtatcacgaaacgatcctgttgggtattgttcagggattagggaa acctatcaaagtggatctcacgacattgcatcatgcgaaagctagatttgcacgagtatgtgtggaggtg aatctatcaaaacctctcaaaggtacaataacgattaatggagagagatatttcgttgcttatgagggtc tatcaaacatttgctcgggatgtgggatatatggtcatttggtgcacaattgtcctaggagagtggtgga gaggacagtggcgcctgtgaccgtggaggttccagtgaatgtttccggtgggacgccgcaggatgacggt tttacggtggttcgaagaacgggccgtaagggaggagcaccggaaaatagtggggtggcaacagctggtc ggttgagcacaaatttggaaaggaatctgcgtgatatttctgggcgggcgaatatggaaagcattgatac ctctaacagttttgggaatttggaggaagtgattgaggggagtgtaagggaagttgctgcatcagtggat gcgaataaagaaaatatgatgaatggtaattatgctaaaaaagggaagagtgttgcacaagggaaggctt gggcgccagtggatttaatgaataaaattagggccggaaaaaaagataaaacggctggaggcaaggtagg agaggctaatgggccaaagcccagaaatggtaactccaacaggcctgtgcgtgggttagtttttggcccg acaaggaaggaaattgagttatctggctcggggaaaaggctgagagtggatgaatcttcggtgtatcgac agggtggaggtggttcaccggagaaggagagtagaggagttgatggcgtgtcaatgtctgtggcgggaga ggctgtggcgaagagttctcttgtggatttggcagaggcgccacagaatagagaagtggaaatgcagaat gggttgtcggcggttgtggcgacctctcttgcggcatgacagcttgccccggtagttcaagcttttttca atctctttatgatggatgttttattttggaattgccggggggcaaataaacctcttttccgaagaacaat acggtacatgttgaagaagaataacatcgacattctggctctgtttgaaacacacgctgcaggagataga gccagcagaatttgtcagaagttgggtttcgagcacacgttccgggtcgatgcagtggggcatagtggtg gagtatggttgctgtggagagctagtgtgggagtggttacagttgtggcctcgtctgaacagtttattca cgccaagattgtgagtgagacagagaccttgcatttgattgtggtttacgcagctccatcagttagtcga agaagcgggctatggggatgtttgaaaactgcgattgaaggagtagatggtcctttagtgattcgtggtg atttcaacatgattgtaagacttgatgagcggacaggggggaatggacgtttgtctctggactctttagc attcggtgaatggattaatgagcttatgttaattgatatgggttttaaagggagtcagtatacttggaga agaggtagattggaagagaattttattgctaagcgcttggaccggattctttgctgtccccaggcacgct tgagatggcaagaggcgacagtaactcacctccccgttgttgcctcagatcatgcgcctctttatctgca actttcgcctgcattccgaggtgacccgaaacgaagaccattccggttcgaggcagcttggttactacat gacgggtttaaggagcttcttcaactctcttggaataacagtctatcaacgcctgaagctcttaatgggt tgcaaattaggctgaaaaagtggaatcgggaggtgtttggtgatattaatcaacgtaaggatcgattaac cacggaaatcaaatcggtacaagacttgctcgatgttgttcaaactgatgctctgttacgcaaagaagaa gagctgattaaagagttagatgttgtcatggagcaggaggaagtcatttggttccaaaagtcacgggaga aatgggtgttagacggagatcgaaatactaaatactttcacacatctaccattattcgaaggagaaggaa tcgtgttgagatgttgaagagtgatagtggtgtatggatctcagacccgcaggagctggagaaattggcg actgcttattataaaagattatattccatggaggatgttgatcaagaggtggaaatgctaccaccgggag gttttgctaggcttacagagagagaagtagcagagcttaccaaaccgttctcggcagttgaggtggaagc ttcagtccggagtatggggaagttgaaagctccggggccagatggataccaaccaatcttttaccaagac tgttgggaggtggtgggacagtcaatagcccgctttgtgttggatttctttgtaacagggatcttacctg aaggaaccaatgatgtgttgatggtgcttatccctaagctagctaagccgagtaaaattatgcaattcag accaatcaatttatgtaatgttttgtttaaaacaattacaaaaaccatgatgcggcgtttacagaatgtg atgagtaagctcgttggtccagctcagtcaagcttcataccgggcagattgagtaccgacaatattctgg tggttcaagaagcagtccattcaatgcggaggaaaaaagggcgaaaaggttggatgctcctaaaactgga tttagagaaagcctatgatcgtatacgctgggattttttacatgatactttagtgtctgtgggactgcca gattgttggagagagtggatcatgaagtgtgttgcaggaccatctatgactttgttgtggaatggggaga aggcagatccgtttaaaccggctagagggttgagacaaggtgacccgctgtcaccatatctctttgttct ttgtatggagcggttgtgccatcagatagagatttcagtagcttcgaaggagtggaaaccgattaatctc tctcagggagggccgaagttatcacatatttgtttcgcggattatcttattctttttgccgaagcatcgg tggcacaaattcgggttatcagacaagttctagaacgattctgcgtagcgtcggggcagaaggttagtct cgaaaaatcaaagatcttcttctctgataatgtgtctcgggatttagcgactcttatcagtaatgaaagc ggcattaaggcaactaaagatctgggcaaatatttgggaatgccagttcttcataagcggattaataaag acacatttggtgatgtggtagagcgagtagcttcaaggctggcgggttggagatgtcgcttcctcagtct tgcgggtcggattacacttactaaatcagtcctctcatccattccggttcataccatgtcaacaatttcg ctgccacagtctattttgaataaattagacagtatctcacgctcttttttatgggggagtacaatggaaa aacgaaaacaacatcttattgcttgggatcgtgtgtgcttgccaaaacaggatggagggttaggcatccg gtgctctacacagatgaatacggctcttctctcgaaaattgggtggcgtttacttcatgatgatgtgagt ctatggtcgaaaatcttgagaagcaagtaccgtgtcggtgatattcataacagagcgtggatggtgtcta aaggtacatggtcttctacatggaggagcgttgtcgtgggtctgaaggaggtggtcttctcgggtttgag ctgggttctaggggatggtgttgatatcctcttctggaaagataggtggatgtcgcagaccccgttatgt gaggtggtaacatgcgaattaccggcaaactgggaggcggttaaagttgtggatgtttggagagatggag tgggctgggatttacagaggcttacgccgtatttcacagaaggtatgaagctcaagttactatctttggt ggtggataatgtgacaggggctagggatcgtttgtcttggggagggtgttccaatggtaattctacagtc aagtcagcttactcttttttaagcttggactggagttcgaagcagcagatggcgcgttttttctctagaa tttggcgtgttgtagctcatgaaagagttcgagtgttcatatggttggctgcaaatcaggtgttgatgac gaatgttgagagatacagacggcatttatgtgattcgagcttatgttcagtatgcaagagtggggaggag acaattctacacattttacgggactgtccggcgatggcgggaatttggacccgcttgttaccagcacgga gactctcttctttcttttcgaaatcgctgttagaatggatctatgcaaatttaggagaggagatagagat taatggttgtccatgggcggtcactttttcgcaggccatatggtgggggtggaaatggcgttgcggtaac atctttggcgagaacaggaagtgtcgggatcgggttcgttttattaaggatcgtgcgttggatgtttgga aggcgcatgtgcacaaaatgggagtgacgacgcggacagctagggaggagagattgattgcgtggtctcc gccaagggtgggttggtttaagctcaatactgatggggcttcgcgtggtaacccgggactagctacagca ggtggagtagttcgagacggggatggaaattggtgttatgggttttcgttgaatattgggatttgttcgg ctccgcttgcggaactatggggagcatattacggtttaaatatcgcttgggagcgcggtgtcacacagtt ggagatggagattgattcggagatggtagtgggttttcttcggacagggattgatgattcgcatccgctg tccttcctggtgcggttgtgccatggcttactttcaaaggactggtcagtccggatttcgcatgtgtata gagaagctaatcgtctcgcggatgggttagctaactatgcttttcttttaccgttaggttttcatttgtt taattctactccggataatgttatgtcgattgttcacgacgatgtagcggggtctgcgtacccccggaac gttcaagtgtaatttttttagtttttcagttttaataaaaatgggggttcgcccccctcttctaccaaaa aaaa1