| 2001, Volume 1, Issue 3 |
| March 31, 2001 |
| Copyright © 2001-2007 - Genetic Information Research Institute, Mountain View, California |
| ISSN# 1534-830X |
| Page 33 |
ATLINE1_4 |
|||
|---|---|---|---|
ATLINE1_4, non-LTR retrotransposon - a fossil. |
|||
|
Submitted: 31-Mar-2001 |
Accepted: 31-Mar-2001 |
||
|
Key Words: non-LTR retrotransposon; L1 superfamily; poly(A) tail; ORF1; ORF2; reverse transcriptase; ATLINE1_4 |
|||
|
Source: thale cress |
Organism: Arabidopsis thaliana |
Taxonomy: Eukaryota; Plantae; Embryobionta; Magnoliophyta; Magnoliopsida; Dilleniidae; Capparales; Brassicaceae |
|
| [1] |
Authors: Kapitonov,V. and Jurka,J. |
||
|
Title: ATLINE1_4, a non-LTR retrotransposon. |
|||
|
Journal: Repbase Reports 1:(3) p. 33 (2001) |
|||
Abstract: ATLINE1_4 is a non-LTR retrotransposon. Its individual copies are ~88% identical to each other. There are only 3 copies of ATLINE1_4 present in the genome. ATLINE1_4 belongs to the L1 superfamily of non-LTR retrotransposons, its copies are flanked by ~15-bp target site duplications. Two proteins, ATLINE1_4p1 and ATLINE1_4p2, are encoded by ORF1 (position 389-2380) and ORF2 (position 2551-6609), respectively. ATLINE1_4p2 contains the reverse transcriptase and endonuclease domains. ATLINE1_4p1 (664 aa): MSKRIRPSWYRESPPKQPPFAFEPEEEDDVVILPQVDNSALLARLHLSLVGRMFHQGGRSTKALLSFLPK ENIWDVEGRVRGVSLGDARFQFFFESEVDLQKVLNKRPCHFNKWSFALERWEPHVGTSFPNIMTFWVRTE GIPAEFWDEEVLRNFGNSLGLVRRVDPSKGRILISVTADVPLRFNKNAQLPSGTVVKVKLSYEKLFRWCS YCRRICHELEQCPLLDAEQKAVLSAEESQRNLRLSLRDGDSSQARLPLQSFPESNRSRSERNHLPLLNGP PSSRPYHSRGVIRREENNLASRSGRNRSHRQPYPASLNQYPAEHASKLNRPHGAVSHKRPASPVVTRNAG VDDGRKRRFGDSFSKEKSSAPPNKQSSPLLSDSQLTLSDTVLAPSRAKQITSSPTYVRERPFRLNLSKKA SALEKGKGKVVEHPTPLLGESSVVGSSAKKSLNFEPSEPAPKNLDTPISVTSKSLEPQLEKRKSWYDMTV EEDEATARSLESGPDTILAAKFSQVVSFASPSVVSPSLSVLPSPAAPEDEWNESLNPLSEALNLDWTEED EAAYHLADDLDVDADDLLSEELQESLQDQSGLVIPGSHVAPISELQPEKLKSLIVSSLQSEEVGPSIAIP RRKDSKKKVVPHSRKAQLNSGLCLNLASKKIHHI ATLINE1_4p1 (1354 aa, two false frame-shifts are corrected based on alignments with related proteins): MRLISWNCQGVGPKTTSRRLEEMCRMYSPGFLFLSETKNDLVYLQNVQVSLGFDCLKTVEPIGNSGGLAL FYSRDYPVKFIYVCDRLIDIETIIDGNRVFITFVYGDPVVQYRELVWKRLTRIGIVRSEPWFMIGDFNEI IGNHEKRGGKKRSESSFLPFCCMIENCGMIDFPSTGSLFSWVGKRSCGVAGRKRRDLIKCRLDRAMGNEE WHSIYSHTNVEYLQHRGSDHKPLLASIQNKPYRPYKHFIFDKRWINKPGFKESVQEGWAFPSRGEGVPFF QKIKNCRQTISIWKKSNKTNTEKLILELHSQLDLAYEDENFSTEDLLALKWKLCQAYRDEEIFWKLKSRE IWLQLGDMNTKFFHASVHKQRRARNKILGLLNQDGLWVDNEVGVEHLAENYFETLFTTSDPQVFDSALQE VPVLITEEMNKSLTKVISPEEVKRALFSLNPDKAPGPDGMTAFFYQHYWDLTGPDLIKLVQNFHSTGFFD ERLNETNICLIPKTERPRKMAEFRPISLCNVSYKVISKVLSSRLKRLLPELISETQSAFVAERLITDNIL IAQENFHALRTNPACKKKYMAIKTDMSKAYDRVEWSFLRALMLKMGFAQKWVDWIIFCISSVSYKILLNG SPKGFIKPSRGIRQGDPISPFLFILCTEALVAKLKDAEWHGRIQGLQISRASPSTSHLLFADDSLFFCKA DPLQGKEIIDILRLYGEASGQQLNPDKSSVMFGHEVDNSIRNTIKVSLGIHKDGGMGSYLGLPKQIHGSK TQVFSFVRDRLQKRINGWTSKFLSKGGKEILIKSVAQALPTYVMSCFLLPKAIRSKLSSVVANFWWKTRE ESNGIHWIAWDKLCTPFSDGGLGFRTLEEFNLVLLAKQLWRLIRFPNSLLSRVLRGRYFRYSDPIQIGKA NRPSFGWRSIMAAKPLLLSGLRRTIGSGMLTRVWEDPWIPSFPPRPAKSILNIRDTHLYVNDLIDPVTKQ WKLGRLQELVDPSDIPLILGIRPSRTYKSDDFSWSFTKSGNYTVKSGYWAARDLSRPTCDLPFQGPSVSA LQAQVWKIKTTRKFKHFEWQCLSGCLATNQRLFSRHIGTEKVCPRCGAEEESINHLLFLCPPSRQIWALS PIPSSEYIFPRNSLFYNFDFLLSRGKEFDIAEDIMEIFPWILWYIWKSRNRFIFENVIESPQVILDFAIQ EANVWKQANSKEVATEYPPPQVVPANLPPTRNVCQFDASWHLKDTLSGHGWVLVDQDIVLLLGLKSARKS LSPLHAEVDSLLWAMECMISLGVSDCSFASDSADFISLLENPSEWPTFVAELATFSSLVCFFPSFSIKFF SRIYNVRADCLSKKARARNSLFSM
|
|||
|
Derived: Positions 35574 28875 Accession No AC006248 GenBank (rel. 124.0) |
|||
|
Download Sequence - Format: IG, EMBL, FASTA |
|||
|
References: |
|||
© 2001-2008 - Genetic Information Research Institute