GenomeNet

Database: UniProt
Entry: W5LR38_ASTMX
LinkDB: W5LR38_ASTMX
Original site: W5LR38_ASTMX 
ID   W5LR38_ASTMX            Unreviewed;       405 AA.
AC   W5LR38;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   24-JAN-2024, entry version 53.
DE   SubName: Full=Hepsin {ECO:0000313|Ensembl:ENSAMXP00000025312.2};
OS   Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC   Characoidei; Characidae; Astyanax.
OX   NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000025312.2, ECO:0000313|Proteomes:UP000018467};
RN   [1] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA   Jeffery W., Warren W., Wilson R.K.;
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX   PubMed=25329095; DOI=10.1038/ncomms6307;
RA   McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA   Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA   Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA   Yoshizawa M., Warren W.C.;
RT   "The cavefish genome reveals candidate genes for eye loss.";
RL   Nat. Commun. 5:5307-5307(2014).
RN   [3] {ECO:0000313|Ensembl:ENSAMXP00000025312.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; W5LR38; -.
DR   STRING; 7994.ENSAMXP00000025312; -.
DR   Ensembl; ENSAMXT00000025330.2; ENSAMXP00000025312.2; ENSAMXG00000024609.2.
DR   eggNOG; KOG3627; Eukaryota.
DR   GeneTree; ENSGT00940000159697; -.
DR   HOGENOM; CLU_006842_19_2_1; -.
DR   InParanoid; W5LR38; -.
DR   Proteomes; UP000018467; Unassembled WGS sequence.
DR   Bgee; ENSAMXG00000024609; Expressed in liver and 7 other cell types or tissues.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0070008; F:serine-type exopeptidase activity; IEA:InterPro.
DR   GO; GO:0072378; P:blood coagulation, fibrin clot formation; IEA:Ensembl.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR015352; Hepsin-SRCR_dom.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR036772; SRCR-like_dom_sf.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24253:SF149; HEPSIN; 1.
DR   PANTHER; PTHR24253; TRANSMEMBRANE PROTEASE SERINE; 1.
DR   Pfam; PF09272; Hepsin-SRCR; 2.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF56487; SRCR-like; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW   Serine protease {ECO:0000256|RuleBase:RU363034}.
FT   DOMAIN          144..388
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   405 AA;  44804 MW;  8C5825858EBFA6FD CRC64;
     KIEKKIWMAL CSPWRVAVAV GLTVVVLGAL GAAIWALVTY LRTAEDTGLY DVQVSAADQR
     LRVFDSVQRR WRHVCSSNAN QLLAAISCEE MGFVSHEFFC VKESELTYGK KISTALYPCK
     CEKGQVVEVL CQECGRRMLP EERIVGGADA RQGSWPWQVS LQYDGVHQCG GSIISDRWIV
     SAAHCFPERY RHVSRWRVLM GSIYNTPIHK NVVIAEVKTV VYHSSYLPFV DANIDDNSRD
     IAVLALTKPL QFNDYIQPVC LPTYGQRLVD GQIGTVTGWG NVEYYGTQAN ILQEANIPII
     SDAVCNAPDY YDNQVTSTMF CAGYEKGGTD SCQGDSGGPF VAADCLSKTS RYRLLGVVSW
     GTGCAMAKKP GVYTRVSRFL PWISSAMRTY ENSPGVHKMA RAATA
//
DBGET integrated database retrieval system