Repbase Reports

2002, Volume 2, Issue 11
November 30, 2002
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 1

CR1-1_AG

CR1-1_AG is a CR1-like non-LTR retrotransposon - a consensus sequence.

Submitted:
00- - 0
Accepted:
30-Nov-2002
Key Words:
non-LTR retrotransposon; CR1 clade; DNA/RNA-binding; endonuclease; reverse transcriptase; CR1-1_AG
Source:
Anopheles gambiae str. PEST
Organism:
Anopheles gambiae str. PEST
Taxonomy:
Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
CR1-1_AG, a family of CR1-like non-LTR retrotransposons from African malaria mosquito.
Journal:
Repbase Reports 2:(11) p.1 (2002)
Abstract:
CR1-1_AG is a family of CR1-like non-LTR retrotransposons. The CR1-1_AG consensus sequence was reconstructed based on multiple alignment of ~20 copies identified in the sequenced portion of the genome. Given the ~2% divergence of these copies from the consensus sequence, transposition of CR1-1_AG occurred less than 1 million years ago. Integrations of CR1-1_AG have not produced target site duplications. The consensus sequence encodes two proteins: a 440-aa CR1-1_AG-ORF1p (positions 425 1745) and 941-aa CR1-1_AG-ORF2p (positions 1746-4568). CR1-1_AG_ORF1p is DNA/RNA binding protein composed of the PDH domain (aa positions 3-57) and gag-like zinc knuckle regions (aa positions 334-442). CR1-1_AG-ORF2p is composed the AP endonuclease and reverse transcriptase domains. The 3' terminus is composed of the CAT microsatellite. CR1-1_AG-ORF1p: MECLKCSAVVGTSDDPIICSGSCGFIFHRRCITPTLNKPAVKLINENRNVVYMCDICLDQSAGLVHMDTD ATKSNDLLAQTLRDLEANVSVWISSALERGIETLKTELCAQVERKLETTLRETLSAIEASKMSKAALRAT SDTPQTSKTVQDVNLETWATVTKKRKRTNSGDSNVQTIINRFDEGNNKVTPKIKKINDVKEPMGKNKENN KTLVIVPKVVQSCDRPRADLSARLDPRKQQLSEFRNGRDGQVYAQCPALANLDSIRKEVEDILGDDYSTS LPMARVKIIGMSEKYSSSDLVDLLKSQNEGIPWKQENVIGMFESKIYKYQIHNVVLEIDHETDKCLAKLD KINIGFDRCKISRSIHVMRCFKCGQFSHKSTDCQNKEACSKCSGEHRTSDCTSSILKCVNCVLANTSRNL KLQVQHAANSYECPLFKKQVERRMQLSQ CR1-1_AG-ORF2p: RDECNFLNSRGGVELGRFREILYFNVAGLSSNYAMFRETVEKVQPLLVLISETHVTEKEAFEQFYLKGYR VVSCLSHSRHTGGVAAYARSDVVLKVILNESLEGNWFLGVAVSRGMTVGNYSILYHSPSASDSRFVDILE EWLDRFLDLSKLNIIVGDFNIDWLNVEKSAKLKSLMDSVNMNQKVNEFTRIARQSRTLIDQVYSSIDSIK VTTDPLLKISDHETLVLNINDERCKTIQRKVKCWNRYSKHALCNNVSQGLQCGASDFDEAADLLWNTLKH AMSTLVEEKTIVSRETSRWYTLDLARAKRKRDKVYKKFIRTNRDNDWSEYTKLRNSYSRDLKNRRSDFFS NEINKHKKNSKELWKVLKSMLQPDESCVSVVKFNGVIEADDSIICNKFNSFFVNSVLDINQNIASVSEPS YYVDSATPRCHFRFQKITLEQLKTICFNLTKTAGIGNVSSTTIQDCYHVIGEDLLMVINQSLERGCFPKS WKESLIIPIPKVNGAANAEDFRPINMLHVLEKVLETVVKEQLVQFLNRNELLIREQSGYRQGHSCETALN LVLARWRVLMDRRESIVAVFLDLKRAFETISRPLLLSTLRRFGIVGRELSWFESYLKERTQRTLFGSSVS EPIENTLGVPQGSVLGPILFIMYINDMKQVLKACEINLFADDTVLFISHKEIKQAESLMNIDLNALDGWL KYKKLALNINKTCYMVMSAGVLEEPPSIVINSELIERVRQAKYLGVILDDRLKFHAHIDWVIAKVAKKCG VISRLAKDLDFFGKVHLYKSLISPHFDFCSSILFLGNKGQIKRLQRLQNRIMRLILGCGRRTPSAVMLNI LQWMSVEQRIVYQTMTFIYKMLKGLLPGYLGESIVRGSDIHRHHTRRANEPRVPNLHSQSARNSLFFKGI QRYNSLPDEIKNARNLPDFKRKCVIYVEQTV.
Derived:
[1] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute