ID A0A3P8SGZ9_AMPPE Unreviewed; 718 AA.
AC A0A3P8SGZ9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Proteoglycan 4b {ECO:0000313|Ensembl:ENSAPEP00000011828.1};
OS Amphiprion percula (Orange clownfish) (Lutjanus percula).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Pomacentridae; Amphiprion.
OX NCBI_TaxID=161767 {ECO:0000313|Ensembl:ENSAPEP00000011828.1, ECO:0000313|Proteomes:UP000265080};
RN [1] {ECO:0000313|Ensembl:ENSAPEP00000011828.1, ECO:0000313|Proteomes:UP000265080}
RP NUCLEOTIDE SEQUENCE.
RA Lehmann R.;
RT "Finding Nemo's genes: A chromosome-scale reference assembly of the genome
RT of the orange clownfish Amphiprion percula.";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPEP00000011828.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8SGZ9; -.
DR STRING; 161767.ENSAPEP00000011828; -.
DR Ensembl; ENSAPET00000012138.1; ENSAPEP00000011828.1; ENSAPEG00000008449.1.
DR GeneTree; ENSGT00530000063751; -.
DR OMA; RRITDVW; -.
DR Proteomes; UP000265080; Chromosome 2.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00094; HX; 1.
DR Gene3D; 4.10.410.20; -; 2.
DR Gene3D; 2.110.10.10; Hemopexin-like domain; 1.
DR InterPro; IPR000585; Hemopexin-like_dom.
DR InterPro; IPR036375; Hemopexin-like_dom_sf.
DR InterPro; IPR018487; Hemopexin-like_repeat.
DR InterPro; IPR018486; Hemopexin_CS.
DR InterPro; IPR036024; Somatomedin_B-like_dom_sf.
DR InterPro; IPR001212; Somatomedin_B_dom.
DR PANTHER; PTHR22917; HEMOPEXIN DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR22917:SF1; PROTEOGLYCAN 4; 1.
DR Pfam; PF00045; Hemopexin; 1.
DR Pfam; PF01033; Somatomedin_B; 2.
DR SMART; SM00120; HX; 2.
DR SMART; SM00201; SO; 2.
DR SUPFAM; SSF50923; Hemopexin-like domain; 1.
DR SUPFAM; SSF90188; Somatomedin B domain; 2.
DR PROSITE; PS00024; HEMOPEXIN; 1.
DR PROSITE; PS51642; HEMOPEXIN_2; 1.
DR PROSITE; PS00524; SMB_1; 2.
DR PROSITE; PS50958; SMB_2; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000265080};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..718
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017938390"
FT DOMAIN 21..64
FT /note="SMB"
FT /evidence="ECO:0000259|PROSITE:PS50958"
FT DOMAIN 65..104
FT /note="SMB"
FT /evidence="ECO:0000259|PROSITE:PS50958"
FT REPEAT 531..578
FT /note="Hemopexin"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01011"
FT REGION 110..485
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 110..146
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 193..306
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 326..404
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..437
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 438..457
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..485
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 718 AA; 77848 MW; B3D3ECB801156AC6 CRC64;
MSSTAFYTVI LLACALKFSA SQTSCKGRCG AEYYRGYMCQ CDYNCLAYGE CCRDFESQCT
TKNSCLGRCG EGFKRGRLCS CDPDCVKYKQ CCPDYKRHCD AEEEISGSSS ATAPVKTNSC
DNINNNKPKE PTLNDATEQP STFSEGNDAD EYVIPPDGLT DELSDDTNSK IYPVDDFSNN
GMEEMEASPI PETSSGYGPT TADLLSQGST DPTAVTDTME FTTEPVTVLS QTETPPDDAD
LSTLDSTTGE APTESTDASD FTSSPDVGTA PPQPTTAADA DSTQPTATSE PQTEPSTPTS
IPELETEGQS EIPPTDAVFP SEEPEATTVP VSTVPDPSTL DSTATPEETT SNPNTEDNSS
DVTTSPPSSL ADVEDDSTNI SPEGSELDAA TTDLPSSTAA VQDDTTHDVP PEITTADPLK
VTPKPTNTPT SEPTTKPQDK PDPYKPLPDK PTSKPETKPL DVEQTSNIED TRGFQGDDSN
DTNLCSGRPV GAVTTLRNGT VAVFRGHYFW FLDRNRVPGP ARGITQVWGV PSPVDTVFTR
CNCQGKTYIF KGNQYWRFEN DALDPGYPKA VSTGFDGLRG HITAALSVPQ HQTRRESVYF
FKRGGYVQKY SYQFGTSPTC SRKPQYAIYT IRNRVVRQAV SVLEPAINIR TTWRGFPSTI
TAAVSIPNNR EPEGYKYYVF SRTKSYNVRM NGERPVIAAP KANTAPQSND IFKCPKKV
//