ID A0A1U8BWG5_MESAU Unreviewed; 650 AA.
AC A0A1U8BWG5;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Hepatocyte growth factor activator {ECO:0000313|RefSeq:XP_012967878.1};
GN Name=Hgfac {ECO:0000313|RefSeq:XP_012967878.1};
OS Mesocricetus auratus (Golden hamster).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Cricetinae; Mesocricetus.
OX NCBI_TaxID=10036 {ECO:0000313|Proteomes:UP000189706, ECO:0000313|RefSeq:XP_012967878.1};
RN [1] {ECO:0000313|RefSeq:XP_012967878.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00479}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_012967878.1; XM_013112424.2.
DR AlphaFoldDB; A0A1U8BWG5; -.
DR STRING; 10036.ENSMAUP00000014508; -.
DR GeneID; 101842778; -.
DR KEGG; maua:101842778; -.
DR CTD; 3083; -.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3627; Eukaryota.
DR OrthoDB; 4629979at2759; -.
DR Proteomes; UP000189706; Unplaced.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00061; FN1; 1.
DR CDD; cd00062; FN2; 1.
DR CDD; cd00108; KR; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.10.10; Fibronectin, type II, collagen-binding; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 2.40.20.10; Plasminogen Kringle 4; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000083; Fibronectin_type1.
DR InterPro; IPR000562; FN_type2_dom.
DR InterPro; IPR036943; FN_type2_sf.
DR InterPro; IPR000001; Kringle.
DR InterPro; IPR013806; Kringle-like.
DR InterPro; IPR018056; Kringle_CS.
DR InterPro; IPR038178; Kringle_sf.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF43; HEPATOCYTE GROWTH FACTOR ACTIVATOR; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00039; fn1; 1.
DR Pfam; PF00040; fn2; 1.
DR Pfam; PF00051; Kringle; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR PRINTS; PR00013; FNTYPEII.
DR PRINTS; PR00018; KRINGLE.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00058; FN1; 1.
DR SMART; SM00059; FN2; 1.
DR SMART; SM00130; KR; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57440; Kringle-like; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01253; FN1_1; 1.
DR PROSITE; PS51091; FN1_2; 1.
DR PROSITE; PS00023; FN2_1; 1.
DR PROSITE; PS51092; FN2_2; 1.
DR PROSITE; PS00021; KRINGLE_1; 1.
DR PROSITE; PS50070; KRINGLE_2; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Kringle {ECO:0000256|ARBA:ARBA00022572, ECO:0000256|PROSITE-
KW ProRule:PRU00121};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000189706};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..34
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 35..650
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010582084"
FT DOMAIN 99..146
FT /note="Fibronectin type-II"
FT /evidence="ECO:0000259|PROSITE:PS51092"
FT DOMAIN 154..192
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 194..234
FT /note="Fibronectin type-I"
FT /evidence="ECO:0000259|PROSITE:PS51091"
FT DOMAIN 235..273
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 279..361
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 403..641
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 32..97
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..70
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..97
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 163..180
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 182..191
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 244..261
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 263..272
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 650 AA; 69654 MW; F88761B31E8148C9 CRC64;
MGQRAWVPSP CPISKPCPFL LLLLLLVEPR GTQPQAGRNH TEPPGPNVTA TPVTPTIPVA
SGNLNTSIAS APEAAPEGPH GGSPLSPSSS PPGGQVLTES GQPCRFPFRY GGRMLHSCTS
EGSAYRKWCA TTHNYDRDRA WGYCAEATLS VEAVLDPCAS GPCLNGGTCS STHDHVTYHC
ACSLAFTGKD CGTEKCFDET RYEYFEVGDH WARVSQGSVE QCSCTAGQAR CEGTHHTACL
SSPCLNGGTC HLIVGTGISV CACPLGYAGR FCNIVPTELC ILGNGTEYRG VASTTASGLS
CLAWNSDLLY QELHVDSVGA AALLGLGPHA YCRNPDKDER PWCYVVKDNA VSWEYCHLTA
CESLVRTQSQ PPEVLVTLAA SAPTARPTCG KRHKKRTFLR PRIIGGSSSL PGSHPWLAAI
YIGNSFCAGS LVHTCWVVSA AHCFANNPPR DSVTVVLGQH FFNRTTDVTQ TFGIEKYVPY
SLYSVFNPND HDLVLIRLKK KGDRCAVRSQ FVQPICLPEA GSSFPVGHKC QIAGWGHMDE
NASDYSSSLR EALVPLVADH KCSSPEVYGA DISPNMLCAG YFDCKSDACQ GDSGGPLACE
KNGVAYLYGI ISWGDGCGRL NKPGVYTRVS NYVDWINDRI RPPKRPAAAS
//