ID G3W598_SARHA Unreviewed; 462 AA.
AC G3W598;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 72.
DE SubName: Full=Procollagen C-endopeptidase enhancer {ECO:0000313|Ensembl:ENSSHAP00000010603.2};
GN Name=PCOLCE {ECO:0000313|Ensembl:ENSSHAP00000010603.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000010603.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000010603.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000010603.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00059}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003768870.1; XM_003768822.2.
DR AlphaFoldDB; G3W598; -.
DR STRING; 9305.ENSSHAP00000010603; -.
DR Ensembl; ENSSHAT00000010696.2; ENSSHAP00000010603.2; ENSSHAG00000009152.2.
DR GeneID; 100918291; -.
DR KEGG; shr:100918291; -.
DR CTD; 5118; -.
DR eggNOG; ENOG502QTZ9; Eukaryota.
DR GeneTree; ENSGT00940000159264; -.
DR HOGENOM; CLU_034096_0_0_1; -.
DR TreeFam; TF316506; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR CDD; cd00041; CUB; 2.
DR CDD; cd03576; NTR_PCOLCE; 1.
DR Gene3D; 2.40.50.120; -; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 2.
DR InterPro; IPR000859; CUB_dom.
DR InterPro; IPR001134; Netrin_domain.
DR InterPro; IPR018933; Netrin_module_non-TIMP.
DR InterPro; IPR035814; NTR_PCOLCE.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR008993; TIMP-like_OB-fold.
DR PANTHER; PTHR24251; OVOCHYMASE-RELATED; 1.
DR PANTHER; PTHR24251:SF24; PROCOLLAGEN C-ENDOPEPTIDASE ENHANCER 1; 1.
DR Pfam; PF00431; CUB; 2.
DR Pfam; PF01759; NTR; 1.
DR SMART; SM00643; C345C; 1.
DR SMART; SM00042; CUB; 2.
DR SUPFAM; SSF49854; Spermadhesin, CUB domain; 2.
DR SUPFAM; SSF50242; TIMP-like; 1.
DR PROSITE; PS01180; CUB; 2.
DR PROSITE; PS50189; NTR; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00059}; Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..462
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029601652"
FT DOMAIN 42..154
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 164..278
FT /note="CUB"
FT /evidence="ECO:0000259|PROSITE:PS01180"
FT DOMAIN 335..454
FT /note="NTR"
FT /evidence="ECO:0000259|PROSITE:PS50189"
FT REGION 282..335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 295..311
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 316..335
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 164..191
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00059"
SQ SEQUENCE 462 AA; 49808 MW; DF8F203765BEB7F9 CRC64;
MLPLIPLFVL GPLFTSFTLL SLTQAQGAPA QSPNYTRPVF LCGGDVTGES GYVASEGFPN
HYPPNKECIW TIMVPEGQTV FLSFRVFDLE LDPSCRYDSL EIFAGAGTSG QRLGHFCGTF
RPGPVVAPGN QVTLRMRADE GTGGRGFLLW YSGRATNGNE HQFCGGRLEK PQGTLTTPNW
PESDYPPGVS CSWLIIAPPE QVISLTFGKF DLEPDTYCRY DSVSIFNGAQ SDDSKRVGKY
CGDTAPSSIT SEGNELLVQF VSDLSVTADG FFASYKAQPR GGGKEGTFSM GNNQPVPPSP
QPLPKVKPPT KPQAPVQENL KPTLATESDT PKTTCPKQCR RTGTLQSNFC ASDFVVTGTV
KSMVRGPGEG ITVTITLTGV YKTGALVLSS TSIGSPLKLY VLCRQCPPIK KGTSYLLMGS
VDEEGRAVLH PNSFVVLYRP NQDQILTNMS KKKCPSQLPA RA
//