;ID ATLINE1_4 DNA ; ATH ; 6700 BP ;XX ;DE ATLINE1_4, non-LTR retrotransposon - a fossil. ;XX ;AC AC006248 ;XX ;DT 30-NOV-2001 (Rel. 6.3, Created) ;DT 30-NOV-2001 (Rel. 6.3, Last updated, Version 1) ;XX ;KW non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; ;KW reverse transcriptase; ATLINE1_4. ;XX ;OS thale cress ;XX ;OC Arabidopsis thaliana ;OC Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; ;OC Dilleniidae; Capparales; Brassicaceae. ;XX ;RN [1] (bases 1 to 6700) ;RA Kapitonov,V. and Jurka,J. ;RT ATLINE1_4, a non-LTR retrotransposon. ;RL Repbase Reports 1:(3) p. 33 (2001) ;XX ;CC ATLINE1_4 is a non-LTR retrotransposon. Its individual copies are ;CC ~88% identical to each other. There are only 3 copies of ;CC ATLINE1_4 present in the genome. ATLINE1_4 belongs to the L1 ;CC superfamily of non-LTR retrotransposons, its copies are flanked ;CC by ~15-bp target site duplications. ;CC Two proteins, ATLINE1_4p1 and ATLINE1_4p2, are encoded by ORF1 ;CC (position 389-2380) and ORF2 (position 2551-6609), respectively. ;CC ATLINE1_4p2 contains the reverse transcriptase and endonuclease ;CC domains. ;CC ATLINE1_4p1 (664 aa): ;CC MSKRIRPSWYRESPPKQPPFAFEPEEEDDVVILPQVDNSALLARLHLSLVGRMFHQGGRSTKALLSFLPK ;CC ENIWDVEGRVRGVSLGDARFQFFFESEVDLQKVLNKRPCHFNKWSFALERWEPHVGTSFPNIMTFWVRTE ;CC GIPAEFWDEEVLRNFGNSLGLVRRVDPSKGRILISVTADVPLRFNKNAQLPSGTVVKVKLSYEKLFRWCS ;CC YCRRICHELEQCPLLDAEQKAVLSAEESQRNLRLSLRDGDSSQARLPLQSFPESNRSRSERNHLPLLNGP ;CC PSSRPYHSRGVIRREENNLASRSGRNRSHRQPYPASLNQYPAEHASKLNRPHGAVSHKRPASPVVTRNAG ;CC VDDGRKRRFGDSFSKEKSSAPPNKQSSPLLSDSQLTLSDTVLAPSRAKQITSSPTYVRERPFRLNLSKKA ;CC SALEKGKGKVVEHPTPLLGESSVVGSSAKKSLNFEPSEPAPKNLDTPISVTSKSLEPQLEKRKSWYDMTV ;CC EEDEATARSLESGPDTILAAKFSQVVSFASPSVVSPSLSVLPSPAAPEDEWNESLNPLSEALNLDWTEED ;CC EAAYHLADDLDVDADDLLSEELQESLQDQSGLVIPGSHVAPISELQPEKLKSLIVSSLQSEEVGPSIAIP ;CC RRKDSKKKVVPHSRKAQLNSGLCLNLASKKIHHI ;CC ATLINE1_4p1 (1354 aa, two false frame-shifts are corrected based on ;CC alignments with related proteins): ;CC MRLISWNCQGVGPKTTSRRLEEMCRMYSPGFLFLSETKNDLVYLQNVQVSLGFDCLKTVEPIGNSGGLAL ;CC FYSRDYPVKFIYVCDRLIDIETIIDGNRVFITFVYGDPVVQYRELVWKRLTRIGIVRSEPWFMIGDFNEI ;CC IGNHEKRGGKKRSESSFLPFCCMIENCGMIDFPSTGSLFSWVGKRSCGVAGRKRRDLIKCRLDRAMGNEE ;CC WHSIYSHTNVEYLQHRGSDHKPLLASIQNKPYRPYKHFIFDKRWINKPGFKESVQEGWAFPSRGEGVPFF ;CC QKIKNCRQTISIWKKSNKTNTEKLILELHSQLDLAYEDENFSTEDLLALKWKLCQAYRDEEIFWKLKSRE ;CC IWLQLGDMNTKFFHASVHKQRRARNKILGLLNQDGLWVDNEVGVEHLAENYFETLFTTSDPQVFDSALQE ;CC VPVLITEEMNKSLTKVISPEEVKRALFSLNPDKAPGPDGMTAFFYQHYWDLTGPDLIKLVQNFHSTGFFD ;CC ERLNETNICLIPKTERPRKMAEFRPISLCNVSYKVISKVLSSRLKRLLPELISETQSAFVAERLITDNIL ;CC IAQENFHALRTNPACKKKYMAIKTDMSKAYDRVEWSFLRALMLKMGFAQKWVDWIIFCISSVSYKILLNG ;CC SPKGFIKPSRGIRQGDPISPFLFILCTEALVAKLKDAEWHGRIQGLQISRASPSTSHLLFADDSLFFCKA ;CC DPLQGKEIIDILRLYGEASGQQLNPDKSSVMFGHEVDNSIRNTIKVSLGIHKDGGMGSYLGLPKQIHGSK ;CC TQVFSFVRDRLQKRINGWTSKFLSKGGKEILIKSVAQALPTYVMSCFLLPKAIRSKLSSVVANFWWKTRE ;CC ESNGIHWIAWDKLCTPFSDGGLGFRTLEEFNLVLLAKQLWRLIRFPNSLLSRVLRGRYFRYSDPIQIGKA ;CC NRPSFGWRSIMAAKPLLLSGLRRTIGSGMLTRVWEDPWIPSFPPRPAKSILNIRDTHLYVNDLIDPVTKQ ;CC WKLGRLQELVDPSDIPLILGIRPSRTYKSDDFSWSFTKSGNYTVKSGYWAARDLSRPTCDLPFQGPSVSA ;CC LQAQVWKIKTTRKFKHFEWQCLSGCLATNQRLFSRHIGTEKVCPRCGAEEESINHLLFLCPPSRQIWALS ;CC PIPSSEYIFPRNSLFYNFDFLLSRGKEFDIAEDIMEIFPWILWYIWKSRNRFIFENVIESPQVILDFAIQ ;CC EANVWKQANSKEVATEYPPPQVVPANLPPTRNVCQFDASWHLKDTLSGHGWVLVDQDIVLLLGLKSARKS ;CC LSPLHAEVDSLLWAMECMISLGVSDCSFASDSADFISLLENPSEWPTFVAELATFSSLVCFFPSFSIKFF ;CC SRIYNVRADCLSKKARARNSLFSM ;XX ;DR Positions 35574 28875 Accession No AC006248 GenBank (rel. 124.0) ;XX ;SQ Sequence 6700 BP; 1793 A; 1441 C; 1452 G; 2014 T; 0 other; ATLINE1_4 gacggattatattataccaaaatgttatctttctttcttgactcttttctctttcaggcacagtctccga gataactgaaagaggcaaatgggttcaaaattagtagatctgaactaatttttttagctattttgggaaa ttttccctcttttcttgccggagtctcctttttctcttctcccctcatggttttcttctctgtttctttt agccacagactttgcttctcatctgctctctctcaagaattcttctgctgagcttttgtgaatgcaaaga aaggctctcatctcctctgaagtttctcctgcaatagcagagtttttgcttgtttctagcttttagctaa gtctgtgtgtgttattacacatctctcctctgcttaccatgtcgaagagaattcgcccaagctggtacag agagtcccctccaaaacaacctccttttgcttttgaacctgaagaagaagatgatgttgtcattctccct caagtggataattcagcgctccttgctcgtctccatctcagcctcgtggggagaatgttccaccaaggag gacgcagtactaaagcccttctctcttttctccctaaggaaaacatatgggatgtggagggtcgagtccg aggcgtttctctcggagatgctcgatttcaattcttttttgaatctgaagttgacctccaaaaagttctc aacaagagaccttgtcacttcaacaagtggtctttcgcattggagagatgggaaccccatgttggaacct ccttcccaaacatcatgactttttgggttagaactgaaggcatccctgctgagttttgggatgaagaagt gttaagaaattttggtaactctcttggtttagttcggagagttgatccctctaaaggaagaattctaatc tctgtgacagcagatgttcctctaagattcaacaagaatgcacaactcccctctggaacagttgtcaaag tgaaactttcttatgaaaagctctttcgctggtgctcgtattgtcgccgtatttgtcacgaactagagca gtgtcctttactcgatgcagaacaaaaagctgttctctcagctgaggaatctcaacgtaaccttcgactc tctcttagagatggagatagttctcaagctcgacttcctctacaatcttttccagagtctaaccgttcta gatctgagagaaatcacttacccttgctgaatggtcctccttcctcaagaccatatcactctcgtggagt catcagaagggaagaaaacaacttagcttcaaggtctggtcgcaatcgatcccaccgacagccttaccct gcttccctcaatcagtaccctgctgagcatgcttccaagctgaatcgacctcatggtgctgtctcccaca aacgcccagcctcccctgttgtcactagaaacgccggtgttgatgacggaaggaaacgtcgttttggtga ttctttctccaaggaaaaaagctctgcacctcctaacaaacaatcatcaccactcctctcggattcacaa ttgacgctttcagacacggtcttggctccctcaagggctaaacagataacctcttccccgacttatgtca gggaaagaccattccggttaaatctctctaagaaggcttccgctttagagaaaggtaaaggaaaggttgt ggagcatcctactccgttgctaggtgaatcctctgttgttgggagctcggctaagaaatctcttaacttt gaaccctctgaaccggctccgaagaaccttgatacgcccatctctgtaacaagcaagtcgctggagccac aactggaaaaacgaaagagttggtatgatatgactgtagaagaagatgaagctactgcgagaagtcttga gtccggtccggatacaatacttgcagccaaattctcgcaggtggtctctttcgcaagtccatctgtcgtc tctccatctctctctgttcttccatctcctgctgctccggaggatgagtggaatgagtctctcaatccgc tctcggaagctctaaatttggattggactgaagaagacgaggctgcttaccatcttgccgacgacctaga tgtggatgctgacgatttgttgagtgaggagcttcaggagagtcttcaggatcaatccggtttggtgatt ccggggagccatgtagcacccatctctgagcttcagccggaaaaattgaagagcttgatcgtgagctcgc ttcaatcggaggaggtggggccgtctattgctatccctaggaggaaggactcaaagaagaaagttgtgcc ccattccagaaaagcccaactaaattcgggcctttgtttgaacttggcttcaaaaaaaattcatcacatt tagggtttatctcccgagaaaaggctatccaacaaatccggcccatctaggccgagatcctctaggagaa ccacgacgggtaaaagaaatggagtgccgtctacttccaagactaagagtccggttttgagtgttgtttc ggaagggatcccaccttccgacaacaaaacatgaggcttatttcgtggaactgtcaaggggtgggaccga aaactacatcccgtcgactggaagagatgtgtcggatgtactctccaggttttctttttttatcggaaac taagaatgatcttgtgtatcttcagaatgtacaagtttcattaggctttgattgtttaaaaactgttgaa cctatcggtaacagtggtggtcttgctcttttttattctcgagactacccggttaagtttatctatgtat gtgatcgtttgattgatattgaaacaattattgatggaaaccgcgtatttattacttttgtttacggtga tccggtagttcaataccgtgaattagtttggaaacgtttgacccgtattggtattgttcggtctgagcca tggtttatgatcggtgattttaatgagattatcggtaatcatgaaaaaagaggaggaaaaaagagatctg aatcctctttcctccctttctgttgtatgattgaaaactgtggaatgattgatttcccctctacagggag tttattttcttgggtcggaaaacgaagttgtggagtggcgggccgaaagagacgtgacctcattaagtgt cggttagatagagctatgggaaatgaagaatggcatagtatttattcccacacgaatgtggagtacctgc agcataggggttcagatcataaacccctacttgcatccatacaaaacaaaccttatcgcccttataaaca ttttatttttgataaacgatggataaataaaccaggttttaaggaatccgtccaagaaggatgggctttc ccctctcgaggagagggtgttccgttttttcagaagattaaaaattgtagacaaactatatctatctgga aaaaatccaacaaaacaaatacggagaagcttattcttgagctacattcccaacttgatttggcgtatga agatgagaatttctctactgaagaccttcttgccctaaaatggaagctctgccaagcgtacagagacgag gagattttttggaaacttaagagtagggagatctggctccaacttggagacatgaacacaaaattttttc atgcgtccacaagcaaaggagggctaggaataaaattctaggtttgctaaatcaagatggcctttgggtg gataatgaggtgggagttgaacatcttgcagagaattactttgagactctttttaccacgtcagacccac aagtttttgattccgctctccaggaagtccccgtgctaattactgaagaaatgaataaatctctcacaaa ggtgatctcaccagaggaagtcaagcgagctcttttttcgctaaatcctgataaagctccagggccggat gggatgacagctttcttttatcagcactattgggaccttacggggcctgatcttattaaacttgtccaaa atttccattccaccggtttctttgatgaacggctgaatgagaccaatatctgtttgatcccaaaaacaga aagacctaggaaaatggcagaatttcgccctatcagtctttgcaatgtcagttacaaagttatatccaaa gttctaagttctcgtctaaaacgtcttcttccggaattaatatccgagacgcaatctgcctttgtagcgg agcgcttgattactgataacattcttatcgctcaagaaaattttcatgcccttcgaacaaatccagcctg caagaagaaatacatggccattaaaactgatatgagcaaggcttatgatcgagttgaatggtctttcttg cgagctctaatgttgaaaatgggtttcgctcagaaatgggtagattggataattttctgcatctcttcgg tatcttacaaaattcttttaaatggatcacctaaagggtttatcaaaccatcgagaggtattcgtcaggg cgaccctatctctccgtttctgtttatcttgtgtacagaagctcttgtagcaaagttgaaagatgcagaa tggcatggccgcattcaaggacttcagatttcgagagctagcccatcaacatcccatcttctttttgctg atgatagcctatttttctgtaaagctgaccctttacaaggtaaagagattattgatattctccgcctata tggggaggcttcaggacaacagttgaatcccgacaagtcctcggtaatgttcggtcatgaggtggataac tctatcagaaataccattaaagtatctcttgggattcataaggatgggggtatgggatcttatttgggac tcccaaaacaaattcatggctcgaagactcaagttttttcctttgtaagagatagacttcaaaaaagaat aaatggatggacgtccaaattcttgtctaaaaggaggcaaagagatacttatcaagtctgtggcgcaggc tcttccgacatatgttatgtcgtgtttccttctccccaaagctatccgctcaaaactaagtagtgttgtg gctaatttctggtggaagacaagggaggaaagtaatggtattcattggatagcttgggacaagctttgta caccattctctgatggtggtctcgggtttagaacgttggaggaatttaatcttgtgttactagcaaagca actttggcgcctaattagatttccaaattctctcctaagcagggtgttgaggggaaggtactttaggtat agtgatcctattcaaattggaaaggctaatcgtccttcgtttggatggagaagcattatggctgctaaac cactccttctctcgggtctaaggagaacaataggatctggaatgctaacccgagtgtgggaagacccttg gatcccttcttttccaccaaggcctgcaaaaagtattcttaacataagggacacacacctctatgttaat gatttaattgatcccgtgacaaagcaatggaaattaggtagacttcaagagttggttgacccttccgata ttcctttgattttaggaattcggccgagtcgtacttacaaaagtgatgatttcagctggtcatttaccaa atctgggaattatactgtaaaatccggttattgggccgcgagagatctctctcgtcctacttgtgacctc ccttttcagggacctagtgtttcggcgcttcaggcacaagtatggaaaattaaaactacgcgtaaattta aacatttcgagtggcaatgcctttcgggctgtctagcgacaaaccaaagacttttctcccgtcacattgg taccgaaaaagtttgccctcgatgtggtgcagaggaagagtctataaaccatcttctttttctctgtccg ccttcgagacaaatctgggccttgtcccctatcccgtcttcggagtacatttttccccggaattctctct tctataattttgatttcttactctcgcgaggaaaagaatttgatatagcagaagatattatggagatctt tccatggatactttggtatatttggaagagtcgtaaccggtttattttcgaaaacgttattgaatccccg caagttattctggattttgcaattcaagaagcaaatgtttggaaacaagcaaattcaaaggaggtggcta cggagtaccctcctcctcaggtggtccctgctaatcttccccctactcggaatgtttgccagtttgacgc atcttggcatctaaaggacaccttgagtggacatggatgggtcttggtagaccaagacatcgtcctactt ttgggccttaagagtgcacggaagagtctatcaccactacacgctgaagtagattccttgttgtgggcaa tggaatgcatgatctccttaggtgtctcggattgctctttcgcatctgacagtgctgattttatctctct tcttgaaaatccttcggagtggcctacctttgttgctgagttggcgacttttagttcccttgtctgtttt ttcccttcttttagtattaagttcttttctcgtatttacaatgtgcgggccgattgcctttcaaaaaaag cacgagcccgcaatagtcttttttccatgtaagcaagtcggttcctgattggctctctctaaaaaagagt atctttccaattacttaatagaaatggtgtttcgacggaagaaaaaaaaa1