ID A0A2Y9T0U1_PHYMC Unreviewed; 1685 AA.
AC A0A2Y9T0U1;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Collagen alpha-5(IV) chain isoform X1 {ECO:0000313|RefSeq:XP_023983993.1};
GN Name=COL4A5 {ECO:0000313|RefSeq:XP_023983993.1};
OS Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Physeteridae; Physeter.
OX NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_023983993.1};
RN [1] {ECO:0000313|RefSeq:XP_023983993.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_023983993.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_023983993.1; XM_024128225.2.
DR Ensembl; ENSPCTT00005024971; ENSPCTP00005022682; ENSPCTG00005014781.
DR KEGG; pcad:102975365; -.
DR Proteomes; UP000248484; Chromosome 21.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 19.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119,
KW ECO:0000313|RefSeq:XP_023983993.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000248484};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1685
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5015862959"
FT DOMAIN 1461..1685
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 49..1459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..105
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 139..153
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..279
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 289..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..402
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..462
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 705..728
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..766
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 817..837
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 849..880
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1040..1054
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1123..1166
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1232..1270
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1394..1408
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1685 AA; 161097 MW; A91CDBD0C45BDFE7 CRC64;
MKLRGVSLAA GLFLLALSLW GQPAEAAACY GCSPGSKCDC SGIKGEKGER GFPGLEGHPG
LPGFPGPEGP PGPRGQKGSD GIPGPPGPKG IRGPPGLPGF PGTPGLPGMP GHDGAPGPQG
IPGCNGTKGE RGFPGSPGFP GLQGPPGPPG IPGMKGEPGS IIMSSLPGPK GNPGFPGPPG
IQGPAGPTGI PGPIGPLGPP GLMGPPGPPG LPGPKGNMGL NFQGPKGEKG EQGLQGPPGP
PGQISEQKRP IDVEFQKGDQ GLPGDRGPPG PPGIRGPPGP PGGVKGEKGE QGEPGKRGKP
GKDGENGQPG IPGLPGDPGY PGEPGRDGEK GQKGDIGPTG PPGLVIPRPG TGVTVGEKGN
IGLPGLPGEK GERGFPGIQG PPGLPGPPGL AVTGPPGPPG FPGERGQKGD EGPPGISIPG
SPGLDGQPGA PGLPGPPGPP GPHIPPSDEV CEAGPPGPPG SPGDRGLQGE QGVKGDKGDT
CFNCIGTGVS GPPGEPGLPG LPGPPGSLGF PGQKGEKGHA GATGPKGLTG IPGAPGAPGF
PGPKGEPGDI LTFPGMKGDK GDLGSPGAPG LPGLPGTPGQ DGLPGLPGPK GEPGGIAFKG
ERGPPGNPGF PGLPGNRGPM GPLGFGPPGP PGEKGIQGVA GNPGQPGIPG PKGDPGQTIA
QPGKPGLPGN PGRDGEVGLP GEPGLPGQPG LPGIPGSKGE PGIPGIGLPG PPGPKGFPGI
PGPPGAPGTP GRIGLEGPSG PPGFPGLKGE PGFGLPGPPG PPGLPGFKGI LGPKGDRGFP
GPQGPPGQAG LDGLPGPKGD IGPNGQPGTM GPPGLPGTGV QGPPGPPGIP GPIGPPGLHG
IPGEKGDPGP PGFDVPGPPG ERGSPGIPGA PGPMGPPGSP GLPGKAGASG FPGAKGEMGM
MGPPGPTGPL GIPGRSGVPG LKGDDGLQGQ PGLPGPAGEK GSKGEPGLPG LPGPMDPDLL
GSKGEKGDPG LPGIPGVAGP KGYQGLPGDP GQPGLSGQPG LPGPSGPKGN PGLPGKPGLT
GPPGLKGNIG DMGFPGPQGA KGSPGPPGVP GQPGSPGLPG QKGEKGDPGI SGIGLPGLPG
PKGEPGLPGY PGNPGIKGAM GDTGLPGLPG TPGAKGQPGL PGFPGTPGLP GPKGINGPPG
NPGLPGEPGP VGGGGRPGPP GPPGEKGKPG QDGIPGPAGQ KGEPGQPGFG IPGPPGLPGL
SGQKGDGGLP GIPGNPGLPG PKGEPGFHGF PGLQGPPGPP GSPGPALEGP KGNPGPQGPP
GRPGLPGPEG PRGLPGIGGI KGERGNPGQP GQPGLSGLKG DQGPPGLQGN PGRPGLNGMK
GDPGLPGVPG FPGMKGPSGE PGSTGPEGDP GLIGPPGPPG LPGPSGQSIV IKGDAGPPGV
PGQPGLKGLP GLPGPQGLPG PIGPPGDPGR NGLPGFDGAG GHKGDPGLPG QPGIRGLDGP
PGPDGLQGPP GPPGTSSVAH GFLITRHSQT TDAPQCPQGT IQVYEGFSLL YVQGNKRAHG
QDLGTAGSCL RRFSTMPFMF CNINNVCNFA SRNDYSYWLS TPEPMPMSME PLKGQSIQPF
ISRCAVCEAP AVVIAVHSQT IQIPRCPQGW DSLWIGYSFM MHTSAGAEGS GQALASPGSC
LEEFRSAPFI ECHGRGTCNY YANSYSFWLA TVDVSDMFSK PQSETLKAGD LRTRISRCQV
CMKRT
//