ID W5LR38_ASTMX Unreviewed; 405 AA.
AC W5LR38;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 24-JAN-2024, entry version 53.
DE SubName: Full=Hepsin {ECO:0000313|Ensembl:ENSAMXP00000025312.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000025312.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000025312.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; W5LR38; -.
DR STRING; 7994.ENSAMXP00000025312; -.
DR Ensembl; ENSAMXT00000025330.2; ENSAMXP00000025312.2; ENSAMXG00000024609.2.
DR eggNOG; KOG3627; Eukaryota.
DR GeneTree; ENSGT00940000159697; -.
DR HOGENOM; CLU_006842_19_2_1; -.
DR InParanoid; W5LR38; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000024609; Expressed in liver and 7 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0070008; F:serine-type exopeptidase activity; IEA:InterPro.
DR GO; GO:0072378; P:blood coagulation, fibrin clot formation; IEA:Ensembl.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR015352; Hepsin-SRCR_dom.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR036772; SRCR-like_dom_sf.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24253:SF149; HEPSIN; 1.
DR PANTHER; PTHR24253; TRANSMEMBRANE PROTEASE SERINE; 1.
DR Pfam; PF09272; Hepsin-SRCR; 2.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF56487; SRCR-like; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Serine protease {ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 144..388
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 405 AA; 44804 MW; 8C5825858EBFA6FD CRC64;
KIEKKIWMAL CSPWRVAVAV GLTVVVLGAL GAAIWALVTY LRTAEDTGLY DVQVSAADQR
LRVFDSVQRR WRHVCSSNAN QLLAAISCEE MGFVSHEFFC VKESELTYGK KISTALYPCK
CEKGQVVEVL CQECGRRMLP EERIVGGADA RQGSWPWQVS LQYDGVHQCG GSIISDRWIV
SAAHCFPERY RHVSRWRVLM GSIYNTPIHK NVVIAEVKTV VYHSSYLPFV DANIDDNSRD
IAVLALTKPL QFNDYIQPVC LPTYGQRLVD GQIGTVTGWG NVEYYGTQAN ILQEANIPII
SDAVCNAPDY YDNQVTSTMF CAGYEKGGTD SCQGDSGGPF VAADCLSKTS RYRLLGVVSW
GTGCAMAKKP GVYTRVSRFL PWISSAMRTY ENSPGVHKMA RAATA
//