GenomeNet

Database: UniProt
Entry: G3WVD4_SARHA
LinkDB: G3WVD4_SARHA
Original site: G3WVD4_SARHA 
ID   G3WVD4_SARHA            Unreviewed;       557 AA.
AC   G3WVD4;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   27-MAR-2024, entry version 70.
DE   SubName: Full=Hyaluronan binding protein 2 {ECO:0000313|Ensembl:ENSSHAP00000019389.2};
GN   Name=HABP2 {ECO:0000313|Ensembl:ENSSHAP00000019389.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000019389.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000019389.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000019389.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3WVD4; -.
DR   STRING; 9305.ENSSHAP00000019389; -.
DR   Ensembl; ENSSHAT00000019546.2; ENSSHAP00000019389.2; ENSSHAG00000016467.2.
DR   eggNOG; KOG1217; Eukaryota.
DR   eggNOG; KOG3627; Eukaryota.
DR   GeneTree; ENSGT00940000157814; -.
DR   HOGENOM; CLU_006842_18_2_1; -.
DR   InParanoid; G3WVD4; -.
DR   TreeFam; TF329901; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00054; EGF_CA; 2.
DR   CDD; cd00108; KR; 1.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 2.40.20.10; Plasminogen Kringle 4; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000001; Kringle.
DR   InterPro; IPR013806; Kringle-like.
DR   InterPro; IPR018056; Kringle_CS.
DR   InterPro; IPR038178; Kringle_sf.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF40; HYALURONAN-BINDING PROTEIN 2; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00008; EGF; 2.
DR   Pfam; PF00051; Kringle; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   PRINTS; PR00018; KRINGLE.
DR   SMART; SM00181; EGF; 3.
DR   SMART; SM00179; EGF_CA; 2.
DR   SMART; SM00130; KR; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 2.
DR   SUPFAM; SSF57440; Kringle-like; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01186; EGF_2; 3.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS00021; KRINGLE_1; 1.
DR   PROSITE; PS50070; KRINGLE_2; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW   Kringle {ECO:0000256|ARBA:ARBA00022572, ECO:0000256|PROSITE-
KW   ProRule:PRU00121};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034}.
FT   DOMAIN          73..109
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          150..188
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          193..276
FT                   /note="Kringle"
FT                   /evidence="ECO:0000259|PROSITE:PS50070"
FT   DOMAIN          313..551
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DISULFID        99..108
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        159..176
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        178..187
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   557 AA;  62460 MW;  EAFB81253F269326 CRC64;
     MITRASRLRI LPLIILVGHG ALGFFSLTSP LASIDKDWTP YDYEYSAEYY DHEINNISDY
     EEYIDWTFIE NDTTDLCQPN PCMHGGDCVI NGDNFKCNCP APFTGGKCQA VRNACRKNPC
     VRGECLIIQT PPYYKCACNH PYGNPHCSKV FTACRPNPCK NGGICKRNRR RSKFTCSCPD
     GFRGKFCEIG PEDCYEDKGL NYRGKVSKTI NHNTCLHWNS HLLLREAYNV FMEDAELHGI
     GEHNYCRNPD GDKSPWCFIQ LDKRTLSWEF CDVTSCSSSD VVESLWKPTE PSASSEMFGT
     CGLPEIESKI KRIYGGFKST PGKHPWQASL QAISPLTVSS PNGHLCGGTL IEPCWVLTAA
     HCVMLKAKQI RVVLGVQDLL KSESHEQSFR VEKIFVHPDY QEEDDIPYND IALLKLKAVK
     GQCAQESKYV KTACLSEVPF PSQTECYISG WGETSTGRGS RYLLDAKVQL ISKSHCNAPN
     QYNNLIDETM FCAGGQGIDS CQGDSGGPLT CERDGKYYLY GIVSWGFKCG KKPGVYTLVT
     KYHNWIKDTI QEESGSY
//
DBGET integrated database retrieval system