ID A0A2K6C3K6_MACNE Unreviewed; 1691 AA.
AC A0A2K6C3K6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Collagen type IV alpha 5 chain {ECO:0000313|Ensembl:ENSMNEP00000018246.1};
GN Name=COL4A5 {ECO:0000313|Ensembl:ENSMNEP00000018246.1};
OS Macaca nemestrina (Pig-tailed macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9545 {ECO:0000313|Ensembl:ENSMNEP00000018246.1, ECO:0000313|Proteomes:UP000233120};
RN [1] {ECO:0000313|Ensembl:ENSMNEP00000018246.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_011754403.1; XM_011756101.1.
DR STRING; 9545.ENSMNEP00000018246; -.
DR Ensembl; ENSMNET00000042481.1; ENSMNEP00000018246.1; ENSMNEG00000033014.1.
DR GeneID; 105490456; -.
DR KEGG; mni:105490456; -.
DR CTD; 1287; -.
DR GeneTree; ENSGT00940000162034; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000233120; Unplaced.
DR Bgee; ENSMNEG00000033014; Expressed in adult mammalian kidney and 12 other cell types or tissues.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0031594; C:neuromuscular junction; IEA:Ensembl.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0038063; P:collagen-activated tyrosine kinase receptor signaling pathway; IEA:Ensembl.
DR GO; GO:0007528; P:neuromuscular junction development; IEA:Ensembl.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1077; COLLAGEN ALPHA-3(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 18.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Reference proteome {ECO:0000313|Proteomes:UP000233120};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1691
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014384270"
FT DOMAIN 1467..1691
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 49..1465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..105
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 139..153
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..279
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..402
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..444
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 607..629
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 705..766
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 806..833
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 849..878
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 997..1018
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1039..1054
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1122..1145
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1232..1279
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1400..1414
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1691 AA; 161719 MW; 4D8C42EB1408819E CRC64;
MKLRGVSLAA GLFLLALSLW GQPAEAAACY GCSPGSKCDC SGIKGEKGER GFPGLEGHPG
LPGFPGPEGP PGPRGQKGDD GIPGPPGPKG IRGPPGLPGF PGTPGLPGMP GHDGAPGPQG
IPGCNGTKGE RGFPGSPGFP GLQGPPGPPG IPGMKGEPGS IIMSSLPGPK GNPGYPGPPG
IQGLPGPTGI PGPIGPPGPP GLMGPPGPPG LPGPKGNMGL NFQGPKGEKG EQGLQGPPGP
PGQISEQKRP IDVEFQKGDQ GLPGDRGPPG PPGIRGPPGP PGGEKGEKGE QGEPGKRGKP
GKDGENGQPG IPGLPGDPGY PGEPGRDGEK GQKGDIGPPG PPGLVIPRPG TGVTIGEKGN
IGLPGLPGEK GERGFPGIQG PPGLPGPPGA AVVGPPGPPG FPGERGQKGD EGPPGISIPG
PPGLEGQPGA PGLPGPPGPP GPHIPRSDEI CEAGPPGPPG SPGDKGLQGE QGMKGDKGDT
CFNCIGTGIS GPPGQPGLPG LPGPPGSLGF PGQKGEKGQA GATGSKGLPG IPGAPGAPGF
PGSKGEPGDI LTFPGMKGDK GELGSPGAPG LPGLPGTPGQ DGLPGLPGPK GEPGGITFKG
ERGPPGNPGL PGLPGNIGPM GPPGFGPPGP VGEKGIQGVA GNPGQPGIPG PKGDPGQTIT
QPGKPGLPGN PGRDGEVGLP GDPGLPGQPG LPGIPGSKGE PGIPGIGLPG PPGPKGFPGI
PGPPGAPGTP GRIGLEGPPG PPGFPGPKGE PGFALPGPPG PPGLPGFKGT LGPKGDRGFP
GPPGPPGRTG IDGLPGPKGD VGPNGQPGPM GPPGLPGIGV QGPPGPPGIP GPIGQPGLHG
IPGEKGDPGP PGLDVPGPPG ERGSPGIPGA PGSIGPPGSP GLPGKAGASG FPGTKGEMGM
MGPPGPPGPL GIPGRSGVPG LKGDDGLQGQ PGLPGPAGEK GSKGEPGLPG PPGPMDPNLL
GSKGEKGEPG LPGIPGVSGP KGYQGLPGDP GQPGLSGQPG LPGPPGPKGN PGLPGQPGLI
GPPGLKGTIG DMGFPGPQGV EGPPGPPGAP GQPGSPGLPG QKGDKGDPGI SSIGLPGLPG
PKGEPGLPGY PGNPGIKGSV GDPGLPGLPG TPGAKGQPGL PGFPGTPGPP GPKGISGPPG
NPGLPGETGP VGGGGRPGQP GPPGEKGKPG QDGIPGPAGQ KGEPGQPGFG NPGPPGLPGL
SGQKGDGGLP GIPGNPGLPG PKGEPGFHGF PGVQGPPGPP GSPGPALEGP KGNPGPQGPP
GRPGPTGFQG LPGPEGPPGL PGNGGIKGEK GNPGQPGLPG LPGLKGDQGP PGLQGNPGRP
GLNGMKGDPG LPGVPGFPGM KGPSGIPGSA GPEGEPGLTG PPGPPGLPGP SGQSIIIKGD
AGPPGIPGQP GLKGLPGPQG PQGLPGPTGP PGDPGRNGLP GFDGAGGRKG DPGLPGQPGT
RGLDGPPGPD GLQGPPGPPG TSSIAHGFLI TRHSQTTDAP QCPQGTLQIY EGFSLLYVQG
NKRAHGQDLG TAGSCLRRFS TMPFMFCNIN NVCNFASRND YSYWLSTPEP MPVSMQPLKG
QSIQPFISRC AVCEAPAVVI AVHSQTIQIP RCPQGWDSLW IGYSFMMHTS AGAEGSGQAL
ASPGSCLEEF RSAPFIECHG RGTCNYYANS YSFWLATVDV SDMFSKPQSE TLKAGDLRTR
ISRCQVCMKR T
//