Repbase Reports

2004, Volume 4, Issue 11
November 30, 2004
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 291

Gypsy7-I_DR

Gypsy7-I_DR is an internal portion of the Gypsy7_DR LTR retrotransposon - a consensus sequence.

Submitted:
00-Nov-2004
Accepted:
30-Nov-2004
Key Words:
LTR retrotransposon; endogenous retrovirus; Gypsy superfamily; gag; protease; reverse transcriptase; integrase; Gypsy7_DR; Gypsy7-LTR_DR; Gypsy7-I_DR
Source:
Danio rerio
Organism:
Danio rerio
Taxonomy:
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; Cyprinidae; Danio
[1] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
Gypsy7_DR LTR retrotransposon from zebrafish
Journal:
Repbase Reports 4:(11) p.291 (2004)
Abstract:
Gypsy7-I_DR is a consensus sequence of the internal portion of Gypsy7_DR LTR retrotransposons. Its long terminal repeat is deposited in Repbase as Gypsy7-LTR_DR. The internal portion encodes the 1658-aa Gypsy7_DRp polyprotein (pos. 476-5449) composed of gag, protease, reverse transcriptase, and integrase domains. Given that some Gypsy7_DR genomic copies are flanked by 100% identical LTRs, it is possible that Gypsy7_DR elements are still transpositionally active. The gag domain is similar to the Arc protein important for long-term spatial memory in vertebrates (mammals, birds). Presumably, Arc was derived some 300-400 million years ago from a gypsy-encoded gag protein. Gypsy7-I_DRp: MANVNPSPSTSVDIDPPDIATPVWPPVQQRQFSSPSNIPTYHSTPTQLDPYGRTQVHFHTTTPG VTSTVQPDPMQLCTSASTVESPPSTATQHALPGYLPTPGREIHQLTAHVQGNWDRVFDCLKRQD KAVKELTEKSSKSFSLHEAKLAKMESTHQQLLNTLTAQRKDDTETADQLTKAVKVMVTQEIQRS ESTLISEIRFMVEQAQLELQKDIQATKEHSDKNFERLSSDLNHCSTEINAIKNQLDNLQTEISD VIPPIKQVSDPPSSAPVSVSTQSSSSVTAPMPFQTPVIKSDHLKLTFPTFGRPSDDADPLLYVT RCKDFLALHPLDDPDILATFRTVLYGTARDWWEVARSAISTWSEFETAFLSAFLSEDYEDELAE RVRTRTQAEKESIRDFAFTYRAMCKRWKPTLTESELVKMILKNIKPHLASQLRSRVHTVDELVK LGLQLEKDYVQQLHYVEHVTQPSPQRIAPNRVEKPPVLCWRCKGLHPPGSCPHYSSSVQTTQSS SHPPPTGNKRYFQTQKHGGNPSNNAMSVTLPSKSLPKSTVTKSVVIPQQLIVPIYIGAWRGKAI LDTGASYTLLHESLWKEIDPQASLHPWTLGPLYLANGEAEVPLGWTNFEIILHDKVFPTQAAIL TPKALAYSVVLGLDFIYSSGLQINVVDQTYSFKSNPNEEYPFQPGHASVPVGRSQHLNKNAQTQ HSSKTLSLLSSIPPPLPFPVVSQLAPSSDDQALIEMAVAEAHLPLESKPQLLHLLQSNPKVCTL QLGRTTVLQHCIYTTHPVPVKQRPYRLTPGKQAIVEEQIEEMLKAGVIEQSCSPWASPVVLVPK KDNSLRFCVDYRKLNAMTESDAYPIPNITEILESLSGASTFSSLDLNCGFWQVPMDDKSKLMTA FITSRGLYHFNVMPFGLKNAPATFQRLMEIVLRDLLGKICYVYIDDIVIYSPTLTQHLHDIQTI LERLEKAGLTLNLKKCSFCLPEITFLGHVVSHQGVAADPKKVEVIHAYPVPQNLKDVQRFLGLA GWYHRFVPNFSRIAEPLNNLKKKGRQFKWDSLCQQAFDNLKFCLTTPPILGHPDLNIPFTVYTD ASDSGLGAVLTQRKEQGGEEVIAYASRTLTKAEVNYSTTEKECLAVVWALDKWQHYLEPRMFTV VTDHSALQWVMNSTKPASRLMRWALRLQRYDFVIEYRKGRLNVAPDALSRMYSMPGCNLYTTEK DLPDFPVTPQTIWEEQHQDTDIMKIFQALAKNEQQEQAQYTVLEDKLYHITHLADETVHYKVVI PSTLRPTVLEWYHDTPLSGHLGIYKTYKRIQDVAYWPGMWTDIKKYVKNCAKCQVTKWDNRKPA GKLQQVTTSRPNEMWGVDIMGPMPKSGKQNEYLLVFVDYFSKWVELFPMRHATAQTIATILRQE MLTRWGVPDFILSDRGAQFVSSLFTELCGKWNITPKLTTAYHPQTNMTERVNRTLKSMIAGFVE DNHKTWDTYLPELRFALNSAIQESIGMTPAELHLGRKIHSPMDKLLHRRDLSPTKPAYDMVHKI TQLQRQAKENYTKAQKRQLRSYDKNRRDVFFRERERVWVRNFPISSAQHHFSAKLAPKWKGPYR IIQQLGPVNYQVSLEDTGEDVRNVHVCNLKPCFPTAEELEAREKNCTKILPQQDQKRF
Derived:
[1] (consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2020 - Genetic Information Research Institute