GenomeNet

Database: UniProt
Entry: A0A401T147_CHIPU
LinkDB: A0A401T147_CHIPU
Original site: A0A401T147_CHIPU 
ID   A0A401T147_CHIPU        Unreviewed;       812 AA.
AC   A0A401T147;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   RecName: Full=Hepatocyte growth factor activator {ECO:0008006|Google:ProtNLM};
GN   ORFNames=chiPu_0014870 {ECO:0000313|EMBL:GCC36376.1};
OS   Chiloscyllium punctatum (Brownbanded bambooshark) (Hemiscyllium punctatum).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC   Elasmobranchii; Galeomorphii; Galeoidea; Orectolobiformes; Hemiscylliidae;
OC   Chiloscyllium.
OX   NCBI_TaxID=137246 {ECO:0000313|EMBL:GCC36376.1, ECO:0000313|Proteomes:UP000287033};
RN   [1] {ECO:0000313|EMBL:GCC36376.1, ECO:0000313|Proteomes:UP000287033}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA   Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA   Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA   Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA   Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT   "Shark genomes provide insights into elasmobranch evolution and the origin
RT   of vertebrates.";
RL   Nat. Ecol. Evol. 2:1761-1771(2018).
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00479}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GCC36376.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BEZZ01000820; GCC36376.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A401T147; -.
DR   STRING; 137246.A0A401T147; -.
DR   Proteomes; UP000287033; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00054; EGF_CA; 2.
DR   CDD; cd00062; FN2; 1.
DR   CDD; cd00108; KR; 2.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.10.10.10; Fibronectin, type II, collagen-binding; 1.
DR   Gene3D; 2.10.25.10; Laminin; 4.
DR   Gene3D; 2.40.20.10; Plasminogen Kringle 4; 2.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000083; Fibronectin_type1.
DR   InterPro; IPR000562; FN_type2_dom.
DR   InterPro; IPR036943; FN_type2_sf.
DR   InterPro; IPR000001; Kringle.
DR   InterPro; IPR013806; Kringle-like.
DR   InterPro; IPR018056; Kringle_CS.
DR   InterPro; IPR038178; Kringle_sf.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF54; HEPATOCYTE GROWTH FACTOR ACTIVATOR-LIKE; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00008; EGF; 4.
DR   Pfam; PF00040; fn2; 1.
DR   Pfam; PF00051; Kringle; 2.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   PRINTS; PR00013; FNTYPEII.
DR   PRINTS; PR00018; KRINGLE.
DR   SMART; SM00181; EGF; 4.
DR   SMART; SM00179; EGF_CA; 2.
DR   SMART; SM00058; FN1; 2.
DR   SMART; SM00059; FN2; 1.
DR   SMART; SM00130; KR; 2.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 2.
DR   SUPFAM; SSF57440; Kringle-like; 3.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS00022; EGF_1; 4.
DR   PROSITE; PS01186; EGF_2; 2.
DR   PROSITE; PS50026; EGF_3; 4.
DR   PROSITE; PS01253; FN1_1; 1.
DR   PROSITE; PS00023; FN2_1; 1.
DR   PROSITE; PS51092; FN2_2; 1.
DR   PROSITE; PS00021; KRINGLE_1; 2.
DR   PROSITE; PS50070; KRINGLE_2; 2.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW   Kringle {ECO:0000256|ARBA:ARBA00022572, ECO:0000256|PROSITE-
KW   ProRule:PRU00121};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000287033};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..812
FT                   /note="Hepatocyte growth factor activator"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019240535"
FT   DOMAIN          57..104
FT                   /note="Fibronectin type-II"
FT                   /evidence="ECO:0000259|PROSITE:PS51092"
FT   DOMAIN          109..147
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          190..228
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          234..316
FT                   /note="Kringle"
FT                   /evidence="ECO:0000259|PROSITE:PS50070"
FT   DOMAIN          318..356
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          399..437
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          443..525
FT                   /note="Kringle"
FT                   /evidence="ECO:0000259|PROSITE:PS50070"
FT   DOMAIN          565..803
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   REGION          21..50
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        32..47
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        118..135
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        137..146
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        199..216
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        218..227
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        327..344
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        346..355
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        408..425
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        427..436
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   812 AA;  91820 MW;  E9FA64C763A881B3 CRC64;
     MLAQYSIILS WCCLAGGLTQ KSPSDSAWRL KSGIPRHGDE QREPPHPLQH PRQVFTDNGW
     PCVFPFRFAS KLHRTCIPHR MTNRKWCATT PNFDKDEQWG YCEWIQPLTH DYCESDPCWH
     GAACVNIPRH NSYQCICKDP FTGRHCEKEK CFERQHLKYY DFGKTWMRSL HGDIEKCICT
     EGGIHSEIVR SKACEINPCL HGGTCHEVED SGESVCECSE NYVGEFCEID TAANCYENNG
     SSYRGAVAQT ASGTDCLAWT SNFISPEFNI KDVEEPLRLG LGDHPYCRNT DNEQEPWCYT
     LTDDHISWDF CDIPQCPKED PCTSHPCKNG GTCIRDPLLN SYHCRCHDEF TGVNCQEVKC
     FDSSHYRYFL IGERWVRIVG DEVENCVCTG SGPECDSVQY EECRVNQCLH DGQCRAVQDA
     EEIICQCRAG FAGKYCDIDK NATCYEGNGM SYKGPEDESA LGIECLPWNE DFLHDQLNIS
     SLEGALQLGL GDHSYCRNPD GDEKPWCYIM LDNHVSWEHC NIPKCGTMSF WSKRIPPIAR
     FTVSIFPQQS PSCGKQAQKG PKPRIVGGSS ALPGSHPWLA AIYIGRNFCT GSLIFSCWVV
     SAAHCFIDSP LTSSIRVVLG QHHFNKTSKN TQVFEIEKYI LHSKYNVFDE TIHDIALLKL
     KKENKRCATK TRFVQTLCLP SVTDSFPDCT KCQIAGWGHT HENATTYPCT LQEATVRLIP
     QELCSSNFLY GTEVTPNMIC AGEMMKSVDA CQGDSGGPLV CYKGDTAYLY GIVSWGEGCA
     KMNKPGVYTR VTKYINWIYR KIKHRPKYNQ KG
//
DBGET integrated database retrieval system