ID S4X3P3_WNV Unreviewed; 1112 AA.
AC S4X3P3;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 24-JAN-2024, entry version 49.
DE SubName: Full=Polyprotein {ECO:0000313|EMBL:AGP25111.1};
DE Flags: Fragment;
OS West Nile virus (WNV).
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC Amarillovirales; Flaviviridae; Orthoflavivirus; Orthoflavivirus nilense.
OX NCBI_TaxID=11082 {ECO:0000313|EMBL:AGP25111.1};
OH NCBI_TaxID=7158; Aedes.
OH NCBI_TaxID=8495; Alligator.
OH NCBI_TaxID=34610; Amblyomma variegatum (Tropical bont tick).
OH NCBI_TaxID=8782; Aves (birds).
OH NCBI_TaxID=53527; Culex.
OH NCBI_TaxID=9796; Equus caballus (Horse).
OH NCBI_TaxID=9606; Homo sapiens (Human).
OH NCBI_TaxID=34627; Hyalomma marginatum.
OH NCBI_TaxID=308735; Mansonia uniformis.
OH NCBI_TaxID=308737; Mimomyia.
OH NCBI_TaxID=9940; Ovis aries (Sheep).
OH NCBI_TaxID=34630; Rhipicephalus.
OH NCBI_TaxID=34861; Sciurus niger (Eastern fox squirrel).
RN [1] {ECO:0000313|EMBL:AGP25111.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=2004Hou3 {ECO:0000313|EMBL:AGP25111.1};
RA Gorchakov R.V., Murray K.O.;
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004367}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004367}; Lumenal side
CC {ECO:0000256|ARBA:ARBA00004367}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00023443}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00023443}; Lumenal side
CC {ECO:0000256|ARBA:ARBA00023443}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}. Secreted
CC {ECO:0000256|ARBA:ARBA00004613}. Virion membrane
CC {ECO:0000256|ARBA:ARBA00004385}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004385}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KC928260; AGP25111.1; -; Genomic_RNA.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0019028; C:viral capsid; IEA:InterPro.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR CDD; cd12149; Flavi_E_C; 1.
DR CDD; cd17038; Flavi_M; 1.
DR Gene3D; 1.10.10.930; -; 1.
DR Gene3D; 1.20.1280.260; -; 1.
DR Gene3D; 2.60.40.350; -; 1.
DR Gene3D; 1.10.8.970; Flavivirus envelope glycoprotein M-like; 1.
DR Gene3D; 2.60.260.50; Flavivirus polyprotein propeptide domain; 1.
DR Gene3D; 2.60.98.10; Tick-borne Encephalitis virus Glycoprotein, domain 1; 1.
DR Gene3D; 3.30.67.10; Viral Envelope Glycoprotein, domain 2; 1.
DR Gene3D; 3.30.387.10; Viral Envelope Glycoprotein, domain 3; 1.
DR InterPro; IPR000069; Env_glycoprot_M_flavivir.
DR InterPro; IPR038302; Env_glycoprot_M_sf_flavivir.
DR InterPro; IPR013755; Flav_gly_cen_dom_subdom1.
DR InterPro; IPR001122; Flavi_capsidC.
DR InterPro; IPR037172; Flavi_capsidC_sf.
DR InterPro; IPR027287; Flavi_E_Ig-like.
DR InterPro; IPR026470; Flavi_E_Stem/Anchor_dom.
DR InterPro; IPR038345; Flavi_E_Stem/Anchor_dom_sf.
DR InterPro; IPR011998; Flavi_Glycoprot_E_cen/dimer.
DR InterPro; IPR001157; Flavi_NS1.
DR InterPro; IPR002535; Flavi_propep.
DR InterPro; IPR038688; Flavi_propep_sf.
DR InterPro; IPR000336; Flavivir/Alphavir_Ig-like_sf.
DR InterPro; IPR036253; Glycoprot_cen/dimer_sf.
DR InterPro; IPR038055; Glycoprot_E_dimer_dom.
DR InterPro; IPR013756; GlyE_cen_dom_subdom2.
DR InterPro; IPR014756; Ig_E-set.
DR NCBIfam; TIGR04240; flavi_E_stem; 1.
DR Pfam; PF01003; Flavi_capsid; 1.
DR Pfam; PF21659; Flavi_E_stem; 1.
DR Pfam; PF02832; Flavi_glycop_C; 1.
DR Pfam; PF00869; Flavi_glycoprot; 1.
DR Pfam; PF01004; Flavi_M; 1.
DR Pfam; PF00948; Flavi_NS1; 1.
DR Pfam; PF01570; Flavi_propep; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF101257; Flavivirus capsid protein C; 1.
DR SUPFAM; SSF56983; Viral glycoprotein, central and dimerisation domains; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Fusion of virus membrane with host endosomal membrane
KW {ECO:0000256|ARBA:ARBA00022510};
KW Fusion of virus membrane with host membrane
KW {ECO:0000256|ARBA:ARBA00022506};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius};
KW Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW Virion {ECO:0000256|ARBA:ARBA00022844};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296}.
FT TRANSMEM 44..66
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 107..126
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 250..269
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 276..292
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 743..764
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 771..791
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 6..123
FT /note="Capsid protein C flavivirus"
FT /evidence="ECO:0000259|Pfam:PF01003"
FT DOMAIN 133..214
FT /note="Flavivirus polyprotein propeptide"
FT /evidence="ECO:0000259|Pfam:PF01570"
FT DOMAIN 218..290
FT /note="Envelope glycoprotein M flavivirus"
FT /evidence="ECO:0000259|Pfam:PF01004"
FT DOMAIN 293..589
FT /note="Envelope glycoprotein E central and dimerisation"
FT /evidence="ECO:0000259|Pfam:PF00869"
FT DOMAIN 592..689
FT /note="Glycoprotein E immunoglobulin-like"
FT /evidence="ECO:0000259|Pfam:PF02832"
FT DOMAIN 692..786
FT /note="Flavivirus envelope glycoprotein E Stem/Anchor"
FT /evidence="ECO:0000259|Pfam:PF21659"
FT DOMAIN 794..1112
FT /note="Non-structural protein NS1 flavivirus"
FT /evidence="ECO:0000259|Pfam:PF00948"
FT NON_TER 1112
FT /evidence="ECO:0000313|EMBL:AGP25111.1"
SQ SEQUENCE 1112 AA; 121411 MW; 969006DB20D6FFD9 CRC64;
MSKKPGGPGK SRAVNMLKRG MPRVLSLIGL KRAMLSLIDG KGPIRFVLAL LAFFRFTAIA
PTRAVLDRWR GVNKQTAMKH LLSFKKELGT LTSAINRRSS KQKKRGGKTG IAVMIGLIAS
VGAVTLSNFQ GKVMMTVNAT DVTDVITIPT AAGKNLCIVR AMDVGYMCDD TITYECPVLS
AGNDPEDIDC WCTKSAVYVR YGRCTKTRHS RRSRRSLTVQ THGESTLANK KGAWMDSTKA
TRYLVKTESW ILRNPGYALV AAVIGWMLGS NTMQRVVFVV LLLLVAPAYS FNCLGMSNRD
FLEGVSGATW VDLVLEGDSC VTIMSKDKPT IDVKMMNMEA ANLAEVRSYC YLATVSDLST
KAACPTMGEA HNDKRADPAF VCKQGVVDRG WGNGCGLFGK GSIDTCAKFA CSTKAIGRTI
LKENIKYEVA IFVHGPTTVE SHGNYSTQAG ATQAGRFSIT PAAPSYTLKL GEYGEVTVDC
EPRSGIDTNA YYVMTVGTKT FLVHREWFMD LNLPWSSAGS TVWRNRETLM EFEEPHATKQ
SVIALGSQEG ALHQALAGAI PVEFSSNTVK LTSGHLKCRV KMEKLQLKGT TYGVCSKAFK
FLGTPADTGH GTVVLELQYT GTDGPCKVPI SSVVSLNDLT PVGRLVTVNP FVSVATANAK
VLIELEPPFG DSYIVVGRGE QQINHHWHKS GSSIGKAFTT TLKGAQRLAA LGDTAWDFGS
VGGVFTSVGK AVHQVFGGAF RSLFGGMSWI TQGLLGALLL WMGINARDRS IALTFLAFGG
VLLFLSVNVH ADTGCAIDIS RQELRCGSGV FIHNDVEAWM DRYKYYPETP QGLAKIIQKA
HKEGVCGLRS VSRLEHQMWE AVKDELNTLL KENGVDLSVV VEKQEGMYKS APKRLTATTE
KLEIGWKAWG KSILFAPELA NNTFVVDGPE TKECPTQNRA WNSLEVEDFG FGLTSTRMFL
KVRESNTTEC DSKIIGTAVK SNLAIHSDLS YWIESRLNDT WKLERAVLGE VKSCTWPETH
TLWGDGILES DLIIPVTLAG PRSNHNRRPG YKTQNQGPWD EGRVEIDFDY CPGTTVTLSE
SCGHRGPATR TTTESGKLIT DWCCRSCTLP PL
//