Repbase Reports

2003, Volume 3, Issue 2
February 28, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 15

CR1-4_AG

CR1-4_AG is a CR1-like non-LTR retrotransposon - a consensus sequence.

Submitted:
28-Feb-2003
Accepted:
28-Feb-2003
Key Words:
non-LTR retrotransposon; CR1 clade; DNA/RNA-binding; PHD finger; AP endonuclease; reverse transcriptase; CR1-4_AG
Source:
consensus
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
CR1-4_AG, a subfamily of CR1-like non-LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 3:(2) p. 15 (2003)
Abstract:
CR1-4_AG is a young family of CR1-like non-LTR retrotransposons. The CR1-4_AG consensus sequence was reconstructed based on multiple alignment of ~50 copies identified in the sequenced portion of the genome. Given the ~1% divergence of these copies from the consensus sequence, transposition of CR1-4_AG occurred less than 1 million years ago. The 3' terminus of CR1-4_AG is composed of the TAAA microsatellite. CR1-4_AG encodes two proteins: a 349-aa CR1-4_AG-ORF1p (positions 363-1409) and 965-aa CR1-4_AG-ORF2p (positions 1430-4324). CR1-4_AG_ORF1p is DNA/RNA binding protein composed of the PDH domain (positions 3-40). CR1-4_AG-ORF2p is composed of the AP endonuclease and reverse transcriptase domains. CR1-4_AG-ORF1p: MDCAICSTTINKDPVVCIGNLPFSECNSAFHPECIKLAATCVKEVARNRGLCWMCEKCRDSRSDLFSSIS CLMNTLKDELKNAIRSELDQRISQLDPNRVMPQEREKIAPTVSTISLTDKTFHTSTDTPMSPTPTPVKQN SLPHSQSRMLSDNEHINPTQAILHTGTANDHINTDTIQFIPAPEPKVWMFVTRIAPTVTEENMKMFILGR LKCTDCSVKCVIPRGRVTSSLKYVSFKIGIPSEFGELAFSPSTWPCGFVYRQFEFHQRTQKQFTPTLPVS CFPASNNSTARSFSTTNFMHNDVNCINVIPPTHTTQHSPSPSHHLKNANSPETHLTQNNSSGSTFLSQH CR1-4_AG-ORF2p: MEPVLNTFNIFYQNVRGLRTKTSECFANTAIADWDVIVLTETWLDDSFPSELLFDNNRFNTFRTDRSAAN SNKCRGGGVLVAINANYASSLCSTNTSTIECLWVRVKVLNVSLIIGSFYLPPDQSANMDTINAFCNSLHL TREKYKNDFFILFGDFNQPNLKWDINGKFPTLNLMLTRLSPTSQALLDELSFEGLRQLNTVLNHNNNMLD LVFANDKVTDYMRPIELCIESIVEPDGHHPALLTYFTLPQYSVPSSKPPRQADFNFRRTNFTDLVSALNQ INWDSIADHDDINDSVAEFSSQMNELYEQFIPRFNVRAHPPWTNSALRLAKRRRSRALKKLHRLKNSTNQ INFARASKIYKQLNRTAYANYVRKIEINIKRHPTSFWKFAKDKESCGRLPSSMQFEGNTITGDEEFCNAF ASYFSSVYTNNSSVPSNSTSALSFINDEVNLCTPLINDDEVESAISLLKLSYAPGPDNIPSAILINCKAA LIPILTKLFNKSLQSKCFPRLWKSSWMFPVYKKSDKSNVCNYRGISMLCACSKLFEKIMSRHMLQAFSPL ISNVQHGFMPKRSIETNLIYLLNFCHSYIDKGLQVDVIYTDFCAAFDKVNHFLLLSKLSKYGVHTNVVEW LRSYLTDRCINVKIGTSLSATFHNLSGVPQGSILGPLLFIIFINDVVFAIPHVKLLLYADDLKMFLPVKY SDDCEMLQDSLNYFSAWCFNNEMLLNVSKCSCITFSKKKNPIIYNYKINEDSVPRFSQVRDLGVILDSKL SLSSHYQTIVTKALKLLGFVLRVSADFKDPFSLKTLYCSLVRPILEFASVVWCPHQITYIDKIEKIQKKI TRVMFHRLPWSNQIPRPSYNVRCLLFGLETLQHRRTTAQITFMHKLLIGDFDAPDILNFICFSTPSRGLR SRELLRSPFRSTGFGANDPLLKMIDVYNRLGLSADFNQSVSQLRQHIQVSSRAII
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute