ID G1M8F9_AILME Unreviewed; 652 AA.
AC G1M8F9;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=HGF activator {ECO:0000313|Ensembl:ENSAMEP00000015629.2};
GN Name=HGFAC {ECO:0000313|Ensembl:ENSAMEP00000015629.2};
OS Ailuropoda melanoleuca (Giant panda).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000015629.2, ECO:0000313|Proteomes:UP000008912};
RN [1] {ECO:0000313|Ensembl:ENSAMEP00000015629.2, ECO:0000313|Proteomes:UP000008912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20010809; DOI=10.1038/nature08696;
RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA Wang J.;
RT "The sequence and de novo assembly of the giant panda genome.";
RL Nature 463:311-317(2010).
RN [2] {ECO:0000313|Ensembl:ENSAMEP00000015629.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00479}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G1M8F9; -.
DR STRING; 9646.ENSAMEP00000015629; -.
DR Ensembl; ENSAMET00000016281.2; ENSAMEP00000015629.2; ENSAMEG00000014815.2.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00940000159778; -.
DR HOGENOM; CLU_006842_18_1_1; -.
DR TreeFam; TF329901; -.
DR Proteomes; UP000008912; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00061; FN1; 1.
DR CDD; cd00062; FN2; 1.
DR CDD; cd00108; KR; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.10.10.10; Fibronectin, type II, collagen-binding; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 2.40.20.10; Plasminogen Kringle 4; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000083; Fibronectin_type1.
DR InterPro; IPR000562; FN_type2_dom.
DR InterPro; IPR036943; FN_type2_sf.
DR InterPro; IPR000001; Kringle.
DR InterPro; IPR013806; Kringle-like.
DR InterPro; IPR018056; Kringle_CS.
DR InterPro; IPR038178; Kringle_sf.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF43; HEPATOCYTE GROWTH FACTOR ACTIVATOR; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF00039; fn1; 1.
DR Pfam; PF00040; fn2; 1.
DR Pfam; PF00051; Kringle; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR PRINTS; PR00013; FNTYPEII.
DR PRINTS; PR00018; KRINGLE.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00058; FN1; 1.
DR SMART; SM00059; FN2; 1.
DR SMART; SM00130; KR; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF57440; Kringle-like; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01253; FN1_1; 1.
DR PROSITE; PS51091; FN1_2; 1.
DR PROSITE; PS00023; FN2_1; 1.
DR PROSITE; PS51092; FN2_2; 1.
DR PROSITE; PS00021; KRINGLE_1; 1.
DR PROSITE; PS50070; KRINGLE_2; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Kringle {ECO:0000256|ARBA:ARBA00022572, ECO:0000256|PROSITE-
KW ProRule:PRU00121};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..652
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030171429"
FT DOMAIN 99..146
FT /note="Fibronectin type-II"
FT /evidence="ECO:0000259|PROSITE:PS51092"
FT DOMAIN 156..194
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 196..236
FT /note="Fibronectin type-I"
FT /evidence="ECO:0000259|PROSITE:PS51091"
FT DOMAIN 237..275
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 281..363
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 405..643
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 34..102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..67
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 165..182
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 184..193
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 246..263
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 265..274
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 652 AA; 69903 MW; AE5E04038F37AC6A CRC64;
MGRWAWGPSL CAPPGMALLL LLLLVPQGAQ PLAGRNLTEP PEPNATVTPG TPTIPVTSVS
PGTPATSAPE AQGPRGRGLT PPPRAAPSGS SPGGPVLTED GRPCRLPFRY GGRMFHSCTS
EGSAHRKWCA TTHNYDRDRA WGYCAQVSVP REGPAAPDPC ASSPCLNGGS CSNTQDPESY
HCTCPTAFTG KDCGTEKCFD ETRYEHLEVG DRWARVSQGQ VEQCECAGGQ IRCEGARHTA
CLSSPCLNGG TCHLIVATGT TVCSCPPGHA GRLCNVVPAQ RCFVGNGTDY RGVASTAASG
LSCLAWNSDL LYQELHVDSV GAAALLGLGP HAYCRNPDKD ERPWCYVVKD SALSWEYCRL
VACESLARIP PLPAEVLLSR AEPAPVGRQA CGKRHKKRSF LRPRIIGGSS SLPGSHPWLA
AIYIGNNFCA GSLVHTCWVV SAAHCFSNSP RRESVLVVLG QHFFNQTTDV TQTFGIEKYI
PYPMYSVFNP SDHDLVLIRL KKKGDRCAVR SQFVQPICLP EPSSPFPAGH KCQIAGWGHQ
DENVSGYSSS LREALVPLVA DHKCSSPEVY GADISPNMLC AGYFDCKSDA CQGDSGGPLA
CEKNGVAYLY GIISWGDGCG RLNKPGVYTR VAKYVDWIKD RIWPPKRPVD SS
//