GenomeNet

Database: UniProt
Entry: W8S1H9_9VIRU
LinkDB: W8S1H9_9VIRU
Original site: W8S1H9_9VIRU 
ID   W8S1H9_9VIRU            Unreviewed;      1242 AA.
AC   W8S1H9;
DT   14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT   14-MAY-2014, sequence version 1.
DT   22-FEB-2023, entry version 40.
DE   RecName: Full=Structural polyprotein {ECO:0000256|ARBA:ARBA00014555};
DE   AltName: Full=p130 {ECO:0000256|ARBA:ARBA00033029};
GN   Name=SP {ECO:0000313|EMBL:AHL83773.1};
GN   ORFNames=F782_44644gpSP {ECO:0000313|EMBL:AHL83773.1};
OS   Madariaga virus.
OC   Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes;
OC   Martellivirales; Togaviridae; Alphavirus.
OX   NCBI_TaxID=1440170 {ECO:0000313|EMBL:AHL83773.1, ECO:0000313|Proteomes:UP000145767};
RN   [1] {ECO:0000313|EMBL:AHL83773.1, ECO:0000313|Proteomes:UP000145767}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MADV/Mesocricetus/Peru/70V1104/1970
RC   {ECO:0000313|EMBL:AHL83773.1};
RA   Das S., Halpin R.A., Ransier A., Mohan M., Fedorova N., Tsitrin T.,
RA   Stockwell T., Amedeo P., Appalla L., Bishop B., Edworthy P., Gupta N.,
RA   Hoover J., Katzel D., Li K., Schobel S., Shrivastava S., Thovarai V.,
RA   Wang S., Auguste A.J., Auguste J., Wentworth D.E., Weaver S.C.;
RL   Submitted (FEB-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Autocatalytic release of the core protein from the N-terminus
CC         of the togavirus structural polyprotein by hydrolysis of a -Trp-|-
CC         Ser- bond.; EC=3.4.21.90; Evidence={ECO:0000256|ARBA:ARBA00000840};
CC   -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC       Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Cell
CC       membrane {ECO:0000256|ARBA:ARBA00004251}; Single-pass type I membrane
CC       protein {ECO:0000256|ARBA:ARBA00004251}. Cytoplasm
CC       {ECO:0000256|ARBA:ARBA00004496}. Host cell membrane
CC       {ECO:0000256|ARBA:ARBA00004598}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004598}. Host cell membrane
CC       {ECO:0000256|ARBA:ARBA00004402}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004402}. Host cytoplasm
CC       {ECO:0000256|ARBA:ARBA00004192}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004479}. Virion membrane
CC       {ECO:0000256|ARBA:ARBA00004385}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004385}. Virion membrane
CC       {ECO:0000256|ARBA:ARBA00004563}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004563}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KJ469626; AHL83773.1; -; Genomic_RNA.
DR   Proteomes; UP000145767; Genome.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0030430; C:host cell cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0039619; C:T=4 icosahedral viral capsid; IEA:UniProtKB-KW.
DR   GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR   GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR   GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR   Gene3D; 1.10.287.2230; -; 1.
DR   Gene3D; 2.60.40.350; -; 1.
DR   Gene3D; 2.60.40.3200; Alphavirus E2 glycoprotein, A domain; 1.
DR   Gene3D; 2.60.40.4310; Alphavirus E2 glycoprotein, domain B; 1.
DR   Gene3D; 2.60.40.2400; Alphavirus E2 glycoprotein, domain C; 1.
DR   Gene3D; 2.60.98.10; Tick-borne Encephalitis virus Glycoprotein, domain 1; 3.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR002548; Alpha_E1_glycop.
DR   InterPro; IPR000936; Alpha_E2_glycop.
DR   InterPro; IPR002533; Alpha_E3_glycop.
DR   InterPro; IPR042304; Alphavir_E2_A.
DR   InterPro; IPR042305; Alphavir_E2_B.
DR   InterPro; IPR042306; Alphavir_E2_C.
DR   InterPro; IPR000336; Flavivir/Alphavir_Ig-like_sf.
DR   InterPro; IPR036253; Glycoprot_cen/dimer_sf.
DR   InterPro; IPR038055; Glycoprot_E_dimer_dom.
DR   InterPro; IPR014756; Ig_E-set.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR000930; Peptidase_S3.
DR   Pfam; PF01589; Alpha_E1_glycop; 1.
DR   Pfam; PF00943; Alpha_E2_glycop; 1.
DR   Pfam; PF01563; Alpha_E3_glycop; 1.
DR   Pfam; PF00944; Peptidase_S3; 1.
DR   PRINTS; PR00798; TOGAVIRIN.
DR   SUPFAM; SSF81296; E set domains; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   SUPFAM; SSF56983; Viral glycoprotein, central and dimerisation domains; 1.
DR   PROSITE; PS51690; ALPHAVIRUS_CP; 1.
PE   4: Predicted;
KW   Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Fusion of virus membrane with host endosomal membrane
KW   {ECO:0000256|ARBA:ARBA00022510};
KW   Fusion of virus membrane with host membrane
KW   {ECO:0000256|ARBA:ARBA00022506};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW   Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW   Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW   Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW   T=4 icosahedral capsid protein {ECO:0000256|ARBA:ARBA00022973};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius};
KW   Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW   Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW   Virion {ECO:0000256|ARBA:ARBA00022844};
KW   Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296}.
FT   TRANSMEM        689..711
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        782..802
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1211..1238
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          112..261
FT                   /note="Peptidase S3"
FT                   /evidence="ECO:0000259|PROSITE:PS51690"
FT   REGION          1..104
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..21
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        69..101
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   ACT_SITE        138
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR600936-1"
FT   ACT_SITE        160
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR600936-1"
FT   ACT_SITE        212
FT                   /note="Charge relay system"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR600936-1"
SQ   SEQUENCE   1242 AA;  137384 MW;  F5D15F32DF684155 CRC64;
     MFPYPTLNYP PMAPVNPMAY RDPNPPRRRW RPFRPPLAAQ IEDLRRSIAN LTFKQRAPNP
     PAGPPAKRKK PAPKPKPAAP KKKRQPPPAK KQKRKQKPGK RQRMCMKLES DKTFPIMLNG
     QVNGYACVVG GRVFKPLHVE GKIDNEQLAA IKLKKASIYD LEYGDVPQCM KSDTLQYTSE
     KPPGFYNWHH GAVQYENNRF TVPRGVGGKG DSGRPILDNR GRVVAIVLGG ANEGSRTALS
     VVTWNQKGVT VKDTPEGSEP WSLTTVMCVL ANITFPCDQP PCMPCCYEKN PHETLSMLEQ
     NYDSQAYDLL LDAAVKCNGR RTRRDLETHF TQYKLARPYI ADCSNCGHGR CDSPIAIEDI
     RGDAHAGYIR IQTSAMFGLK SDGVDLAYMS FMNGKTLKAI KIEHLYARTS APCSLVSYHG
     YYILAQCPPG DTVTVGFQDG ANKHMCTIAH KVEFKPVGRE KYRHPPEHGV ELPCTKYTHK
     RADQGHYVEM HQPGLVADHS LLSMSSTKVK ITVPSGSQVK YYCKCPDVKE GTTGSDYTTA
     CTDLKQCRAY LIDNKKWVYN SGKLPRGEGE TFKGKLHVPF VPVASKCTAT LAPEPLVEHK
     HRFLILHLHP EHPTLLTTRA LGSNARPTRQ WIEQPTTVNF TVTGEGFEYT WGNHPPKRVW
     AQESGEGNPH GWPHEIVIYY YNRYPMTTVI GLCTCVAIIM VSCVTSVWLL CRTRNLCITP
     YRLAPNAQVP ILLAVLCCVK PTRADDTLQV LNYLWNNNQN FFWMQTLIPL AALIVCMRML
     RCLLCCGPAF LLVCGALGAA AYEHTAVMPN KVGIPYKALV ERPGYAPVHL QIQLVTTKII
     PSANLEYITC KYKTKVPSPV VKCCGSTQCS AKSLPDYQCQ VFTGVYPFMW GGAYCFCDTE
     NTQMSEVYIE RAEECSVDQA KAYKVHTGTV QAVVNITYGS VSWRSADVYV NGETPAKIGD
     AKLTIGPLSS AWSPFDSKVV VYGHEVYNYD FPEYGTGKAG SFGDLQSRTP TSNDLYANTN
     LKLQRPQPGV VHTPYTQAPS GFERWKKDRG APLNDIAPFG CTIALDPLRA ENCAVGNIPL
     SIDIPDAAFT RIAETPTVSD LECKVTECTY ASDFGGIATI SYKASKAGNC PIHSPSGIAV
     IKENDVTLAD SGAFTFHFST ASIHPAFKMQ VCTSVVTCKG DCKPPKDHIV DYPAQHTETY
     TSAVSATAWS WLKVLVGSTS AFIVLGLIAT AVVALVLFNH RH
//
DBGET integrated database retrieval system