ID A0A0D9R530_CHLSB Unreviewed; 1692 AA.
AC A0A0D9R530;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 24-JAN-2024, entry version 48.
DE SubName: Full=Collagen type IV alpha 4 chain {ECO:0000313|Ensembl:ENSCSAP00000003719.1};
GN Name=COL4A4 {ECO:0000313|Ensembl:ENSCSAP00000003719.1};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000003719.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000003719.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000003719.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01002322; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002323; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002324; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002325; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002326; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002327; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002328; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002329; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002330; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01002331; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007964687.1; XM_007966496.1.
DR STRING; 60711.ENSCSAP00000003719; -.
DR Ensembl; ENSCSAT00000005500.1; ENSCSAP00000003719.1; ENSCSAG00000007607.1.
DR KEGG; csab:103217992; -.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153991; -.
DR OMA; ISCNVTY; -.
DR OrthoDB; 2882192at2759; -.
DR BioGRID-ORCS; 103217992; 0 hits in 9 CRISPR screens.
DR Proteomes; UP000029965; Chromosome 10.
DR Bgee; ENSCSAG00000007607; Expressed in liver and 3 other cell types or tissues.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR GO; GO:0032836; P:glomerular basement membrane development; IEA:Ensembl.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF714; COLLAGEN ALPHA-4(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 14.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1467..1692
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 61..173
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..258
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 367..390
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 404..1459
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..79
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..545
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 614..644
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 877..911
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1117..1131
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1220..1243
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1260..1280
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1298..1314
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1334..1377
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1692 AA; 164310 MW; 0FE91E8BEB08B19C CRC64;
MSSLHVVLMR CSFRLTKSLA TGPWSLILIL FSVQYVYGSG KKYVGPCGGR DCSVCHCVPE
KGSRGPPGPP GPQGPIGPLG APGPTGLSGE KGMRGDRGPP GAAGDKGDKG PTGVPGFPGL
DGIPGHPGPP GPRGKPGKCG YNGSRGDPGF PGGRGALGPG GPPGHPGEKG EKGNSVFILG
AIKGIQGDRG DPGLPGLPGS WGAGGPAGPA GYPGEPGLVG PPGQPGRPGL KGNPGVGVKG
QMGDPGEVGQ QGSPGPTLLV EPPDFCLYKG EKGIKGIPGM IGLPGPPGRK GESGIGAKGE
KGIPGFPGPW GDPGSYGSPG FPGLKGELGL VGDPGPFGFL GPKGDPGDRG HPGPPGVLVT
PPLPLKGPPG DPGFPGRYGE TGDVGPPGPP GLLGRPGEAC AGMIGPPGPQ GFPGLPGLPG
EAGIPGRPDS APGKPGKPGS PGLPGAPGLQ GLPGSSVTYC SVGNPGPQGI KGKVGPPGGR
GSKGEKGNEG LCACEPGPMG PPGPPGLPGR QGSKGDLGLP GWLGTKGDPG PPGAEGPPGL
PGKPGASGPP GNKGAKGDVV ISRVKGHKGE RGPDGPPGFP GQPGSHGLDG RAGEKGDPGL
PGDHEDAIPG GKGFPGPLGP PGKAGPVGPP GLGFPGPPGE RGHPGVPGHP GVRGPDGLKG
QKGDTVSCNV TYPGRQGPPG FDGRPGPKGF PGPQGAPGLS GSDGHKGRPG TPGTSEIPGP
PGFRGDIGDP GFGGEKGSSP VGPPGPPGSP GVNGQKGIPG DPAFGHLGPL GKRGLSGVPG
IKGPRGDPGY PGAEGPAGIP GFPGLKGPKG REGHAGFPGV PGPPGHSCER GAPGIPGQPG
LPGDPGSPGA PGGKGLPGDV GPPGPAGMKG LPGLPGRPGA HGPPGLPGIP GPFGDDGLPG
PPGPKGPRGL PGFPGFPGER GKPGAEGCPG TKGEPGEKGM SGFPGDRGVR GAKGAIGPPG
DEGEMAIISK KGKPGEPGPP GDDGFPGERG DKGTPGMQGR RGEPGRYGPP GFHRGEPGEK
GQPGPPGPPG PPGSTGLRGF IGFPGLPGDQ GEPGSPGPPG FSGIDGARGP KGNKGDPAPA
SHFGPPGPKG EPGSSGCPGH FGASGEQGLP GVQGPRGSPG WPGPPGSSGP PGCPGDQGMP
GLRGQPGEMG DPGPRGLQGD PGIPGPPGIK GPSGSPGLNG LHGLKGQKGT KGASGLHDVG
PPGPVGMPGL KGERGDPGSP GISPPGPCGE QGLPGPPGRS GPPGPAGATG RAPKDIPDPG
PSGDQGPPGP DGPRGAPGPP GLPGSVDLLR GEPGDCGLPG PPGLPGPPGP PGYKGFPGCD
GKDGQKGPMG FPGPQGPHGF PGPPGEKGLP GPPGRKGPTG LPGPRGEPGP PADADDCPRI
PGLPGVPGLR GPEGAMGLPG MRGPPGPGCK GEPGLDGRRG MDGIPGSPGP PGRKGDTGED
GYPGGPGPPG PTGDPGPKGF GAGYLSGFLL VLHSQTDQEP TCPLGMPRLW TGYSLLYLEG
QEKAHNQDLG LAGSCLPVFS TLPFAYCNIH QVCHYAQRND RSYWLASAAP LPMMPLSEEA
IRPYVSRCAV CEAPAQAVAV HSQDQSIPPC PQTWRSLWIG YSFLMHTGAG DQGGGQALMS
PGSCLEDFRA APFLECQGRQ GTCHFFANEY SFWLTTVKAD LQFSPAPAPD TLKESQAQRQ
KISRCQVCLK YS
//