;ID   ATLINE1_2   DNA   ; ATH   ; 5814 BP
;XX
;DE   ATLINE1_2, non-LTR retrotransposon - a consensus.
;XX
;AC   .
;XX
;DT   15-DEC-2000 (Rel. 5.9, Created)
;DT   15-DEC-2000 (Rel. 5.9, Last updated, Version 1)
;XX
;KW   non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; 
;KW   reverse transcriptase; ATLINE1_2.
;XX
;OS   consensus
;XX
;OC   Arabidopsis thaliana
;OC   Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida;
;OC   Dilleniidae; Capparales; Brassicaceae.
;XX
;RN   [1]  (bases 1 to 5773)
;RA   Kapitonov,V. and Jurka,J.
;RL   Direct submission (December, 2000)
;XX
;CC   ATLINE1_2 is a non-LTR retrotransposon. Its individual copies are 
;CC   98% identical to each other. There are only a few copies of 
;CC   ATLINE1_2 present in the genome. ATLINE1_2 belongs to the L1
;CC   superfamily of non-LTR retrotransposons, its copies are flanked
;CC   by ~15-bp target site duplications. 
;CC   Two proteins, ATLINE1_2p1 and ATLINE1_2p2, are encoded by ORF1 
;CC   (position 2-1579) and ORF2 (position 1622-5752), respectively. 
;CC   Function and classification of the first protein is unclear, although
;CC   it is expected to be a nucleic acid binding protein analogously to
;CC   the protein encoded by L1 in mammals. The second protein contains 
;CC   the reverse transcriptase and endonuclease domains.
;CC   ATLINE1_2p1 (525 aa):
;CC   MGERGDFRVPVTSSQDAPMMDIGDGGRPSGDPLDVVESWASKVSGSNAGGRLSPERVLDKEFVMARMRLE
;CC   FPDGEDGEPVITIAQEVLDVMNGLWKQCIIVKVLGRHIALPALNKRLREMWNPKGGMHVLDLPRQFFMIR
;CC   FDLEDEYLVALTGGPWRAFGSHLMVQAWSPDFDPLRNEIVTTPVWVRLSNIPLNLYHETILLGIVQGLGK
;CC   PIKVDLTTLHHAKARFARVCVEVNLSKPLKGTITINGERYFVAYEGLSNICSGCGIYGHLVHNCPRRVVE
;CC   RTVAPVTVEVPVNVSGGTPQDDGFTVVRRTGRKGGAPENSGVATAGRLSTNLERNLRDISGRANMESIDT
;CC   SNSFGNLEEVIEGSVREVAASVDANKENMMNGNYAKKGKSVAQGKAWAPVDLMNKIRAGKKDKTAGGKVG
;CC   EANGPKPRNGNSNRPVRGLVFGPTRKEIELSGSGKRLRVDESSVYRQGGGGSPEKESRGVDGVSMSVAGE
;CC   AVAKSSLVDLAEAPQNREVEMQNGLSAVVATSLAA
;CC   ATLINE1_2p2 (1376 aa):
;CC   MDVLFWNCRGANKPLFRRTIRYMLKKNNIDILALFETHAAGDRASRICQKLGFEHTFRVDAVGHSGGVWL
;CC   LWRASVGVVTVVASSEQFIHAKIVSETETLHLIVVYAAPSVSRRSGLWGCLKTAIEGVDGPLVIRGDFNM
;CC   IVRLDERTGGNGRLSLDSLAFGEWINELMLIDMGFKGSQYTWRRGRLEENFIAKRLDRILCCPQARLRWQ
;CC   EATVTHLPVVASDHAPLYLQLSPAFRGDPKRRPFRFEAAWLLHDGFKELLQLSWNNSLSTPEALNGLQIR
;CC   LKKWNREVFGDINQRKDRLTTEIKSVQDLLDVVQTDALLRKEEELIKELDVVMEQEEVIWFQKSREKWVL
;CC   DGDRNTKYFHTSTIIRRRRNRVEMLKSDSGVWISDPQELEKLATAYYKRLYSMEDVDQEVEMLPPGGFAR
;CC   LTEREVAELTKPFSAVEVEASVRSMGKLKAPGPDGYQPIFYQDCWEVVGQSIARFVLDFFVTGILPEGTN
;CC   DVLMVLIPKLAKPSKIMQFRPINLCNVLFKTITKTMMRRLQNVMSKLVGPAQSSFIPGRLSTDNILVVQE
;CC   AVHSMRRKKGRKGWMLLKLDLEKAYDRIRWDFLHDTLVSVGLPDCWREWIMKCVAGPSMTLLWNGEKADP
;CC   FKPARGLRQGDPLSPYLFVLCMERLCHQIEISVASKEWKPINLSQGGPKLSHICFADYLILFAEASVAQI
;CC   RVIRQVLERFCVASGQKVSLEKSKIFFSDNVSRDLATLISNESGIKATKDLGKYLGMPVLHKRINKDTFG
;CC   DVVERVASRLAGWRCRFLSLAGRITLTKSVLSSIPVHTMSTISLPQSILNKLDSISRSFLWGSTMEKRKQ
;CC   HLIAWDRVCLPKQDGGLGIRCSTQMNTALLSKIGWRLLHDDVSLWSKILRSKYRVGDIHNRAWMVSKGTW
;CC   SSTWRSVVVGLKEVVFSGLSWVLGDGVDILFWKDRWMSQTPLCEVVTCELPANWEAVKVVDVWRDGVGWD
;CC   LQRLTPYFTEGMKLKLLSLVVDNVTGARDRLSWGGCSNGNSTVKSAYSFLSLDWSSKQQMARFFSRIWRV
;CC   VAHERVRVFIWLAANQVLMTNVERYRRHLCDSSLCSVCKSGEETILHILRDCPAMAGIWTRLLPARRLSS
;CC   FFSKSLLEWIYANLGEEIEINGCPWAVTFSQAIWWGWKWRCGNIFGENRKCRDRVRFIKDRALDVWKAHV
;CC   HKMGVTTRTAREERLIAWSPPRVGWFKLNTDGASRGNPGLATAGGVVRDGDGNWCYGFSLNIGICSAPLA
;CC   ELWGAYYGLNIAWERGVTQLEMEIDSEMVVGFLRTGIDDSHPLSFLVRLCHGLLSKDWSVRISHVYREAN
;CC   RLADGLANYAFLLPLGFHLFNSTPDNVMSIVHDDVAGSAYPRNVQV
;XX
;DR   [1] (Consensus)
;XX
;SQ   Sequence 5814 BP; 1486 A; 965 C; 1747 G; 1616 T; 0 other;
ATLINE1_2
gatgggtgagagaggcgattttagggttccagtgacgagttcgcaagatgcgccaatgatggatatcgga
gatggtggtcgtccctcgggagaccctctggatgtggtagagtcatgggcgagtaaggtgtccggtagca
atgctggagggaggttgtcaccggagagagtgttggataaggagttcgtcatggcgaggatgcggctgga
gtttcctgatggggaggatggagaaccagtcattacgatcgcccaggaagttcttgatgtgatgaatggt
ttgtggaagcagtgtattattgtgaaggtgttagggagacacatagcgttaccagcgttgaacaagagat
tgagggaaatgtggaatccaaagggagggatgcatgtgttggatctcccaagacagttctttatgattcg
ctttgatctggaagatgaatacttggtggctcttaccggtggaccatggagagcgtttgggagccaccta
atggtgcaagcttggtctccggactttgatccgttgaggaatgagatagtaacaacgccagtgtgggttc
gtttgtcgaatattcctttgaatctgtatcacgaaacgatcctgttgggtattgttcagggattagggaa
acctatcaaagtggatctcacgacattgcatcatgcgaaagctagatttgcacgagtatgtgtggaggtg
aatctatcaaaacctctcaaaggtacaataacgattaatggagagagatatttcgttgcttatgagggtc
tatcaaacatttgctcgggatgtgggatatatggtcatttggtgcacaattgtcctaggagagtggtgga
gaggacagtggcgcctgtgaccgtggaggttccagtgaatgtttccggtgggacgccgcaggatgacggt
tttacggtggttcgaagaacgggccgtaagggaggagcaccggaaaatagtggggtggcaacagctggtc
ggttgagcacaaatttggaaaggaatctgcgtgatatttctgggcgggcgaatatggaaagcattgatac
ctctaacagttttgggaatttggaggaagtgattgaggggagtgtaagggaagttgctgcatcagtggat
gcgaataaagaaaatatgatgaatggtaattatgctaaaaaagggaagagtgttgcacaagggaaggctt
gggcgccagtggatttaatgaataaaattagggccggaaaaaaagataaaacggctggaggcaaggtagg
agaggctaatgggccaaagcccagaaatggtaactccaacaggcctgtgcgtgggttagtttttggcccg
acaaggaaggaaattgagttatctggctcggggaaaaggctgagagtggatgaatcttcggtgtatcgac
agggtggaggtggttcaccggagaaggagagtagaggagttgatggcgtgtcaatgtctgtggcgggaga
ggctgtggcgaagagttctcttgtggatttggcagaggcgccacagaatagagaagtggaaatgcagaat
gggttgtcggcggttgtggcgacctctcttgcggcatgacagcttgccccggtagttcaagcttttttca
atctctttatgatggatgttttattttggaattgccggggggcaaataaacctcttttccgaagaacaat
acggtacatgttgaagaagaataacatcgacattctggctctgtttgaaacacacgctgcaggagataga
gccagcagaatttgtcagaagttgggtttcgagcacacgttccgggtcgatgcagtggggcatagtggtg
gagtatggttgctgtggagagctagtgtgggagtggttacagttgtggcctcgtctgaacagtttattca
cgccaagattgtgagtgagacagagaccttgcatttgattgtggtttacgcagctccatcagttagtcga
agaagcgggctatggggatgtttgaaaactgcgattgaaggagtagatggtcctttagtgattcgtggtg
atttcaacatgattgtaagacttgatgagcggacaggggggaatggacgtttgtctctggactctttagc
attcggtgaatggattaatgagcttatgttaattgatatgggttttaaagggagtcagtatacttggaga
agaggtagattggaagagaattttattgctaagcgcttggaccggattctttgctgtccccaggcacgct
tgagatggcaagaggcgacagtaactcacctccccgttgttgcctcagatcatgcgcctctttatctgca
actttcgcctgcattccgaggtgacccgaaacgaagaccattccggttcgaggcagcttggttactacat
gacgggtttaaggagcttcttcaactctcttggaataacagtctatcaacgcctgaagctcttaatgggt
tgcaaattaggctgaaaaagtggaatcgggaggtgtttggtgatattaatcaacgtaaggatcgattaac
cacggaaatcaaatcggtacaagacttgctcgatgttgttcaaactgatgctctgttacgcaaagaagaa
gagctgattaaagagttagatgttgtcatggagcaggaggaagtcatttggttccaaaagtcacgggaga
aatgggtgttagacggagatcgaaatactaaatactttcacacatctaccattattcgaaggagaaggaa
tcgtgttgagatgttgaagagtgatagtggtgtatggatctcagacccgcaggagctggagaaattggcg
actgcttattataaaagattatattccatggaggatgttgatcaagaggtggaaatgctaccaccgggag
gttttgctaggcttacagagagagaagtagcagagcttaccaaaccgttctcggcagttgaggtggaagc
ttcagtccggagtatggggaagttgaaagctccggggccagatggataccaaccaatcttttaccaagac
tgttgggaggtggtgggacagtcaatagcccgctttgtgttggatttctttgtaacagggatcttacctg
aaggaaccaatgatgtgttgatggtgcttatccctaagctagctaagccgagtaaaattatgcaattcag
accaatcaatttatgtaatgttttgtttaaaacaattacaaaaaccatgatgcggcgtttacagaatgtg
atgagtaagctcgttggtccagctcagtcaagcttcataccgggcagattgagtaccgacaatattctgg
tggttcaagaagcagtccattcaatgcggaggaaaaaagggcgaaaaggttggatgctcctaaaactgga
tttagagaaagcctatgatcgtatacgctgggattttttacatgatactttagtgtctgtgggactgcca
gattgttggagagagtggatcatgaagtgtgttgcaggaccatctatgactttgttgtggaatggggaga
aggcagatccgtttaaaccggctagagggttgagacaaggtgacccgctgtcaccatatctctttgttct
ttgtatggagcggttgtgccatcagatagagatttcagtagcttcgaaggagtggaaaccgattaatctc
tctcagggagggccgaagttatcacatatttgtttcgcggattatcttattctttttgccgaagcatcgg
tggcacaaattcgggttatcagacaagttctagaacgattctgcgtagcgtcggggcagaaggttagtct
cgaaaaatcaaagatcttcttctctgataatgtgtctcgggatttagcgactcttatcagtaatgaaagc
ggcattaaggcaactaaagatctgggcaaatatttgggaatgccagttcttcataagcggattaataaag
acacatttggtgatgtggtagagcgagtagcttcaaggctggcgggttggagatgtcgcttcctcagtct
tgcgggtcggattacacttactaaatcagtcctctcatccattccggttcataccatgtcaacaatttcg
ctgccacagtctattttgaataaattagacagtatctcacgctcttttttatgggggagtacaatggaaa
aacgaaaacaacatcttattgcttgggatcgtgtgtgcttgccaaaacaggatggagggttaggcatccg
gtgctctacacagatgaatacggctcttctctcgaaaattgggtggcgtttacttcatgatgatgtgagt
ctatggtcgaaaatcttgagaagcaagtaccgtgtcggtgatattcataacagagcgtggatggtgtcta
aaggtacatggtcttctacatggaggagcgttgtcgtgggtctgaaggaggtggtcttctcgggtttgag
ctgggttctaggggatggtgttgatatcctcttctggaaagataggtggatgtcgcagaccccgttatgt
gaggtggtaacatgcgaattaccggcaaactgggaggcggttaaagttgtggatgtttggagagatggag
tgggctgggatttacagaggcttacgccgtatttcacagaaggtatgaagctcaagttactatctttggt
ggtggataatgtgacaggggctagggatcgtttgtcttggggagggtgttccaatggtaattctacagtc
aagtcagcttactcttttttaagcttggactggagttcgaagcagcagatggcgcgttttttctctagaa
tttggcgtgttgtagctcatgaaagagttcgagtgttcatatggttggctgcaaatcaggtgttgatgac
gaatgttgagagatacagacggcatttatgtgattcgagcttatgttcagtatgcaagagtggggaggag
acaattctacacattttacgggactgtccggcgatggcgggaatttggacccgcttgttaccagcacgga
gactctcttctttcttttcgaaatcgctgttagaatggatctatgcaaatttaggagaggagatagagat
taatggttgtccatgggcggtcactttttcgcaggccatatggtgggggtggaaatggcgttgcggtaac
atctttggcgagaacaggaagtgtcgggatcgggttcgttttattaaggatcgtgcgttggatgtttgga
aggcgcatgtgcacaaaatgggagtgacgacgcggacagctagggaggagagattgattgcgtggtctcc
gccaagggtgggttggtttaagctcaatactgatggggcttcgcgtggtaacccgggactagctacagca
ggtggagtagttcgagacggggatggaaattggtgttatgggttttcgttgaatattgggatttgttcgg
ctccgcttgcggaactatggggagcatattacggtttaaatatcgcttgggagcgcggtgtcacacagtt
ggagatggagattgattcggagatggtagtgggttttcttcggacagggattgatgattcgcatccgctg
tccttcctggtgcggttgtgccatggcttactttcaaaggactggtcagtccggatttcgcatgtgtata
gagaagctaatcgtctcgcggatgggttagctaactatgcttttcttttaccgttaggttttcatttgtt
taattctactccggataatgttatgtcgattgttcacgacgatgtagcggggtctgcgtacccccggaac
gttcaagtgtaatttttttagtttttcagttttaataaaaatgggggttcgcccccctcttctaccaaaa
aaaa1