ID A0A493TDH0_ANAPP Unreviewed; 738 AA.
AC A0A493TDH0;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE SubName: Full=Macrophage stimulating 1 {ECO:0000313|Ensembl:ENSAPLP00000023966.1};
GN Name=MST1 {ECO:0000313|Ensembl:ENSAPLP00000023966.1};
OS Anas platyrhynchos platyrhynchos (Northern mallard).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8840 {ECO:0000313|Ensembl:ENSAPLP00000023966.1, ECO:0000313|Proteomes:UP000016666};
RN [1] {ECO:0000313|Proteomes:UP000016666}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Hou Z.-C., Zhou Z.-K., Zhu F., Hou S.-S.;
RT "A new Pekin duck reference genome.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPLP00000023966.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00121}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A493TDH0; -.
DR Ensembl; ENSAPLT00000021779.1; ENSAPLP00000023966.1; ENSAPLG00000004404.2.
DR GeneTree; ENSGT00940000159461; -.
DR Proteomes; UP000016666; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00108; KR; 4.
DR CDD; cd01099; PAN_AP_HGF; 1.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 3.50.4.10; Hepatocyte Growth Factor; 1.
DR Gene3D; 2.40.20.10; Plasminogen Kringle 4; 4.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR024174; HGF/MST1.
DR InterPro; IPR000001; Kringle.
DR InterPro; IPR013806; Kringle-like.
DR InterPro; IPR018056; Kringle_CS.
DR InterPro; IPR038178; Kringle_sf.
DR InterPro; IPR003609; Pan_app.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR PANTHER; PTHR24261:SF12; HEPATOCYTE GROWTH FACTOR-LIKE PROTEIN-RELATED; 1.
DR PANTHER; PTHR24261; PLASMINOGEN-RELATED; 1.
DR Pfam; PF00051; Kringle; 4.
DR Pfam; PF00024; PAN_1; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PIRSF; PIRSF001152; HGF_MST1; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR PRINTS; PR00018; KRINGLE.
DR SMART; SM00130; KR; 4.
DR SMART; SM00473; PAN_AP; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF57414; Hairpin loop containing domain-like; 1.
DR SUPFAM; SSF57440; Kringle-like; 4.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS00021; KRINGLE_1; 3.
DR PROSITE; PS50070; KRINGLE_2; 4.
DR PROSITE; PS50948; PAN; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00121};
KW Kringle {ECO:0000256|ARBA:ARBA00022572, ECO:0000256|PROSITE-
KW ProRule:PRU00121}; Reference proteome {ECO:0000313|Proteomes:UP000016666};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..738
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019852963"
FT DOMAIN 50..137
FT /note="Apple"
FT /evidence="ECO:0000259|PROSITE:PS50948"
FT DOMAIN 141..218
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 222..300
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 312..391
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 399..478
FT /note="Kringle"
FT /evidence="ECO:0000259|PROSITE:PS50070"
FT DOMAIN 511..736
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 19..54
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 223..300
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00121"
FT DISULFID 244..283
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00121"
FT DISULFID 272..295
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00121"
FT DISULFID 334..373
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00121"
FT DISULFID 362..385
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00121"
SQ SEQUENCE 738 AA; 82304 MW; B0CDC4C3FEAF5817 CRC64;
MPVPRVLLAV AVAVALGAGR AGTGGGTAGS GQPRGGPIRP RAPSAQLPLC SPGSRSPLND
FQRLRATELL PLPPEAPPPP EPGPAEQCAQ RCATSPACRA FHHERQSQLC QLLRWTQHSP
GVRLQKNIHC DLYQKKDYLR DCIVADGISY RGTRATTEKG LRCQHWQATT PHDHRFLPSP
RNGLEENYCR NPDRDKRGPW CYTVDPNVRH QSCGIKKCQD AICMTCNGED YRGFVDHTES
GTECQRWDLQ HPHKHPYHPD KYPEKGLDDN YCRNPDGSEQ PWCYTVDPAR EREYCRIRVC
KKRPRPLNVT TNCFRGKGEG YRGRVNVTVS GIPCQRWDAQ APHRHHFVPE KYPCKDLQEN
YCRNPDGSEA PWCFTTRPSV RVAFCFHIRR CDDELGAQEC YHGHGETYRG HVSKTRKGIT
CQRWDARTPH VPQISPATHP EAHLEENYCR NPDNDSHGPW CYTMDPRMPF DYCAIKPCSG
NAVPSILESA EAVTFEQCGR RDERLQLKGR IVGGQPGNSP WTVSIRNRAG VHFCGGSLVN
EQWVISIRQC FSSCDADLSG YEVHLGLLFK DPGPADPDLQ AIPIVRILCG PSESHLVLLK
LARPAVLNKR VALICLPPER YVVPAGTICE IAGWGETRGT ADSRVLNVAQ LPVLAHGECQ
AALRGRLKES ELCTAPLRSG VGACEGDYGG PLACLTADCW VLEGVITPSR VCARTDQPAL
FIRVSLYVDW IHKVMKMV
//