Repbase Reports

2003, Volume 3, Issue 1
January 31, 2003
Copyright © 2001-2016 - Genetic Information Research Institute
ISSN# 1534-830X
Page 1

DIRS1_DR

DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus.

Submitted:
31-Jan-2003
Accepted:
31-Jan-2003
Key Words:
LTR retrotransposon; gypsy; endogenous retrovirus; DIRS superfamily; reverse transcriptase RNase H; phage integrase; DIRS1; DIRS1_DR
Source:
consensus
Organism:
Danio rerio
Taxonomy:
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Actinopterygii; Neopterygii; Teleostei; Euteleostei; Ostariophysi; Cypriniformes; Cyprinoidea; Cyprinidae; Rasborinae; Danio
[2] Authors:
Kapitonov,V.V. and Jurka,J.
Title:
DIRS1_DR, a family of DIRS-like endogenous retroviruses in zebrafish.
Journal:
Repbase Reports 3:(1) p. 1 (2003)
Abstract:
DIRS1_DR is a family of DIRS1-like retrotransposons. These elements are related to gypsy-like LTR retrotransposons and endogenous retroviruses. There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% divergent from the consensus sequence. Therefore, this family retrotransposed in the zebrafish genome very recently. The unusual structure of DIRS1_DR is depicted in the next figure. GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG <====== ======> <--------------------------------------------- AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT ---------------------------------------------------------------------- TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA ----------------------------------------------- ...................................................................... ...................................................................... GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC <====== ======> <~~ GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG ---------------------------------------------------------------------- CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC ---------------------------------------------------------------------- CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt ----------------------> <====== ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted repeats are underlined by a single line. DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for the gag-like protein. ORF2 (positions 1633-2597) codes for reverse transcriptase and RNase H. ORF3 (positions 2598-5129) codes for the phage integrase. ORF1p: MALRLCVSGCGGFLSPDDGHDHCIACLGVQHVNAVLAGGSCRHCDAMTVAQLRSRLTFARERATPVASCS KKAAGARADLRVSAGANPPPTGSRTSRSSRRSIQASGGESDPSNQMVALTLADTGDQMSSAASEGGLSLS DEDPDPLAPSGQVSAVKSDPEADMLAVLSRAASAVGLEMVYPPAPRPDRLDGCYVEDQKAKPSKPLVPFF PEVHSRLTQSWRAPFSARAASASALTALDGGAARGYEAIPSVERAIAVNLCPRGASTWRGLPRLPSKACR LSASLGARAYKAAGQAASALHAMATYQRYQAQALAELHEGGSNPSLLHELRTATDYALRTTKSAACALGR TMSTLVVQERHLWLNLADMRDVDKVRFLDSPISQAGLFGDTVGEFTQEFKAVKEQSDAMGNVIYRRGRKP APPAEPSTSAVPRRGRPPTSAAPPPPAPPAKRARRSPRKQAAPPAQGAVKSGKRTAKRP ORF2p: MRWAMSSIGVAVSPLRPPSHPPPLFLAEGARQRVLPRPRLRLRPSGRGVHLESRQPLLPRAPLSPVNGPR SVPETGHPEKRKLALSPLEGGAPITTVLFSATKTSVKEHFFPSPDVTARVLPVRDALPSGSQTLRASPVA HERWGDGLPSLSPPAPSPESGCGARANRSPPAFPRDPRASRISTPTPRCPTAGTSAIVAMTPLARALPAW LARASPSRWLIRTIRLGYAIQFAKRPPKFTGVYFSRVNPLSAPVLREEIAALLAKGAIEPVPPAEMESGF YSPYFIVPKKSGGSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKDAYFHVSILPRHRQ FLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIRILSYLDDWLILAHSREQLIMHRDEVL RHLRLLGLQVNREKSKLAPVQRISFLGMELDSITMVAHLSEERARLLLNCLRELDSKLVVPLKFFQRLLG HMASAAAVTPLGLLHMRPLQHWLHDRVPRRAWHAGTHRVSVTALCRRALSPWNDPSFLQAGVPLGQASSH VVVSTDASNTGWGAVCRGHAAAGLWKGAQLHWHINRLELLAVFLALHRFLPVLERQHVLVRTDSTAAAAY INRMGGMRSRRMSQLARRLLLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLIWARF GEAQIDLFASPENAHCQLFFSLTEGSLGTDALAHSWPRGMRKYAFPPVSLLAQFLCKVREDEEQVLLVAP LWPNRTWISELSLLATALPWRIPLREDLLSQGQGTIWHPRPDLWNLHVWSLDARKT ORF3p: MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGSSVAVQGPPLRALSVSAGLHQTRGGCPSAPSARG HSHTQLSRRLADFSPLAGAIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLRG TRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSALASRSGPQTRMARGHTPGLGY CAVSPRPQPLERPLVPTGRCASRTGVQPCCCFNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGS VPRSPPLFTGAGAATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSRPRHAQSC SRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSLPVVFFPDRGLSRHGCTGPQLASGHAQ VCVSPSEPARAVSVQGQGGRGTGSASCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPR SLEPPRVVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPRNCQISVVLSFLQE KLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQFLRGARRINPSRPPLMPSWDLALVLTSLRSDP FEPLESVSLRFLSLKTALLVALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQGSAVSKQRLSHWIVDAISLA YSSRGQPCPPGVRAHSTRSVASSWARARGASLTDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIE ETTR
Derived:
[2] (Consensus)
Download Sequence - Format:
IG, EMBL, FASTA
References:

© 2001-2024 - Genetic Information Research Institute