| 2005, Volume 5, Issue 1 |
| January 31, 2005 |
| Copyright © 2001-2007 - Genetic Information Research Institute, Mountain View, California |
| ISSN# 1534-830X |
| Page 17 |
Gypsy-22-I_DR |
|||
|---|---|---|---|
An internal portion of the Gypsy-22_DR LTR retrotransposon - a consensus sequence. |
|||
|
Submitted: 00-Jan-2005 |
Accepted: 31-Jan-2005 |
||
|
Key Words: LTR retrotransposon; endogenous retrovirus; Gypsy superfamily; gag; reverse transcriptase; integrase; Gypsy-22_DR; Gypsy-22-LTR_DR; Gypsy-22-I_DR |
|||
|
Source: Danio rerio |
Organism: Danio rerio |
Taxonomy: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes; Cyprinidae; Danio |
|
| [1] |
Authors: Kapitonov,V.V. and Jurka,J. |
||
|
Title: Gypsy-22_DR, a family of LTR retrotransposons from zebrafish |
|||
|
Journal: Repbase Reports 5:(1) p.17 (2005) |
|||
Abstract: Gypsy-22-I_DR is an internal portion of the Gypsy-22_DR LTR retrotransposon that belongs to the Gypsy superfamily. Its long terminal repeat is deposited in Repbase as Gypsy-22-LTR_DR. Gypsy-22_DR is characterized by 4-bp target site duplications. The internal portion encodes one polyprotein: the 1686-aa Gypsy-22_DR1p (pos. 605-5662) composed of the gag, protease, reverse transcriptase, and integrase domains. The consensus sequence was built from five copies less than 2% diverged from the consensus sequence. Gypsy-22_DR1p: MAQFFSHSTSTYVDIDAPDAPTISTPVWSPPVQTQTLSGLPPHDQIMHNISPVHSFPSHATPVQ TLPTALGGHILMQPLSLSQEHVSMQLCTSADVSSPSTMEQHDLPGNLPTPRREIQQVSSYVQGN LDNLMVTMKKQEKCLHELTQKLKTSSSQHVNQITTLTAKIESNKQEIVTILTGAKQQEAADADQ LVKAVQLMLATEFQKFESTLTSAVVDKVEKLRRDVHHDLKSIQQTLQGSLDQLTTNLQQCEEKI SKCQTCVTQLKKDLQVHNVKDVEPQTETKAAPVTSTLSTETVSTLPNTMVKSDHLKLTFPTFGR HTDDTDPLLYLTKCQDFLALHPLTDADLLATFRTVLYGTARDWWEVSRSNIATWKEFESAFLSA FLSEDYEDELAERVRTRVQGDRESIRDFAFTYRALCKRWKSTLTETEIVKMILKNIKPYLASQL RSRVNTVEDLVKLGYLLERDYEEQRRYESRMAHKQASSQKSFSNRPVEKQPIQCWRCSGPHPPG NCPMYLTPPSQQSSTQHHPNHGKSFHAAKSGGRPTNIIVAASETPQSTKEVPNVFLPSTTMSSL AIPQQLVVPISIGSWFGKAILDTGASYTLIHESLMQHFDTSAQLQNWSSGPLYLANGKAEIPLG WLNITIQIHGKSFVVPAVVLPSQALAYAIILGLDFIFFSGLKIHVSERKYSFTSDPTEEHPFQP GYASEPLVKMTPMTEKKTLRKNKLNLTLLSAVPPPQTSLGMLQTDHVDDATQIWNAVSEAQLPK EEKQQLLQILQNNPRVCTQRTGKTKLLQHRIYTTSQVPIKQKPYRLSPVKQQVMEEQLEQMLRE GIVEPSHSSWASPVVLVPKKNGKLRFCVDYRKVNAITENDAYPLPNITEILESLSGSTIFSTID LNSGYWQVMMDPDSKAKTAFIVSDGLYQFNVMPFGLKNAPATFQRLMETVLGELRRKICLVYID DIIVYSPSVTQHFCDLQTILHRLEAAGLTINLEKCKFFLPEITFLGHVVNAKGITADPSKVEAI LSFPTPNNLKEVQRFLGLAGWYHRFVQNFSKIAEPLNALKKKGQVFKWTAQCQQSFDQLRSCLT SPPILGHPDLKIPFIVYTDASDTGLGAILTQRKDPGSEEVIAYASRTLTGAEVNYTATEKECLA VVWALEKWQHYLEYKLFTVVTDHSALQWVMGSTKTNSRLIRWVLRLQKFNFIIEYRKGKLNVAP DALSRSPLTTISPVTAVYTKQQTDQHTELPVSDVVLWEEQHSDEETTKLLQAVAEEPNQLEQYE VIEDKLYHKTYLKNDQVHYRVYVPNRLRPTLLHHYHSHPLSGHHGIYKTYKRIQAVAFWPGLWT DVKRHVKECVKCQTIKYDNQKPAGKLQSTITSRPNQMLGVDIMGPLPRSTQQNEYLLVFVDYYS KWVEFFPMRQANAQSVAVIFRREILTRWGVPDFILSDRGTQFISSVFKNVCEKWGVTQKLTTAY HPQTNMTERVNRTVKSMIASYVDDNHSKWDQFLPEMRFAMNTAIQETTGVTPAELQIGRKLHGP MDKILHGQNLIPDNTSYDVVCHIQQLKSQVQENCRRAQQRQLRNYNKKRREAGFKNKDRVWLRN FPQSSAQHKFSAKLAPKWKGPYRVLKQLGPLNYRIALEETGEDVRTVHVCNLKECFPTAEELEV QEKKRLRELFEETSEEEEFFGF
|
|||
|
Derived: [1] (consensus) |
|||
|
Download Sequence - Format: IG, EMBL, FASTA |
|||
|
References: |
|||
© 2001-2008 - Genetic Information Research Institute