ID A0A2K5V539_MACFA Unreviewed; 1688 AA.
AC A0A2K5V539;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 24-JAN-2024, entry version 28.
DE SubName: Full=Collagen type IV alpha 6 chain {ECO:0000313|Ensembl:ENSMFAP00000019848.2};
GN Name=COL4A6 {ECO:0000313|Ensembl:ENSMFAP00000019848.2};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000019848.2, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000019848.2, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000019848.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSMFAT00000070398.2; ENSMFAP00000019848.2; ENSMFAG00000032821.2.
DR VEuPathDB; HostDB:ENSMFAG00000032821; -.
DR GeneTree; ENSGT00940000153991; -.
DR Proteomes; UP000233100; Chromosome X.
DR Bgee; ENSMFAG00000032821; Expressed in lung and 4 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0030246; F:carbohydrate binding; IEA:UniProtKB-KW.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF533; COLLAGEN ALPHA-6(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 20.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Lectin {ECO:0000256|ARBA:ARBA00022734};
KW Reference proteome {ECO:0000313|Proteomes:UP000233100};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1688
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030051317"
FT DOMAIN 1464..1688
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 58..91
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 108..315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 784..876
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 912..1111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1184..1458
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 194..216
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 427..441
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 606..620
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1688 AA; 162890 MW; 361E7A4B414C7595 CRC64;
MLINKLWLLL VTLCLTEELA GAGEKSYGKP CGGQDCSGSC QCFPEKGARG RPGPIGIQGP
SGPQGFTGST GLSGLKGERG SPGLLGPYGP KGDKGPMGVP GFLGINGIPG HPGQPGPRGP
PGLDGCNGTQ GAVGFPGPDG YPGLLGPPGL PGQKGSKGDP VLAPGSFKGM KGDPGLPGLD
GITGPQGAPG SPGAVGPAGP PGLQGPPGPP GPPGPDGNMG LGFQGEKGVK GDVGLPGPAG
PPPSTGELEF MGFPKGKKGS KGEPGPKGFP GISGPPGFPG LGTTGEKGEK GIPGLPGPRG
PMGSEGVQGP PGQQGKKGTL GFPGLNGFQG IKGEKGDIGL PGPDVFIDID GAVISGNPGD
PGVPGLPGLK GDEGIQGLRG PSGVAGLPAL SGVPGALGPQ GFPGLKGDQG NPGRTTIGAA
GLPGRDGLPG PPGPPGPPGP EFEAETLHNK EPGFPGLRGE QGPKGNPGLK GIKGDSGFCA
CDGGVPNTGP PGEPGPPGPR GLIGLPGLKG ARGDRGSGGA QGPAGAPGLV GSPGPSGPKG
KKGEPILSTI SGMPGDRGDS GSQGFPGVIG KPGNDGVPGL PGLPGLPGDG GQGFPGEKGL
PGLPGEKGHP GPPGLPGIGL PGLPGPRGLP GDKGKDGLPG QQGPPGSKGI TLPCIIPGSY
GPSGFPGTPG FPGPKGSRGL PGTPGQPGSS GNKGKPGSPG LVHLPELPGF PGPRGEKGLP
GFPGLPGKDG LPGTIGSPGL PGSKGATGDI FGAENGAPGE QGLQGLTGDK GLLGDSGLPG
LKGVYGKPGL LGPKGERGSP GTPGPVGQPG TPGSSGPYGI KGKSGIPGAP GFPGTSGHPG
KKGTRGEKGP PGSIVKKGLP GLKGLPGNPG LIGLKGSPGS PGVAGLPALS GPKGEKGSVG
FVGFPGIPGL PGIPGTRGLK GIPGSTGKMG PSGHAGTPGE KGDRGNPGPV GIPGPRRPMS
NLWLKGDKGS QGSAGSDGFP GPRGDKGEAG QPGPPGLPGA PGLPGTIKGV SGKPGPPGFM
GIRGLPGLKG SSGITGFPGM PGESGSQGIR GSSGLPGTSG LPGLKGDNGQ TLEISGSPGP
KGQPGESGFK GTKGRDGPIG NIGFPGNKGE DGKVGVSGDV GLHGAPGFPG VAGMRGEPGL
PGSSGHQGAI GPLGPPGLIG PKGFPGFPGL HGLNGLPGTK GTHGTPGPSI TGVPGPAGLP
GPKGEKGYPG IGIGAPGKPG LRGQKGDRGF PGLQGPAGLP GAPGISLPSL IAGQPGDPGR
PGLDGERGRP GPPGPPGPTG PSSNQGNTGD PGFPGIPGPK GPKGDQGIPG FSGLPGELGL
KGMRGEPGFM GTPGKVGPPG DPGFPGMKGK AGPRGSSGPQ GAPGQTPTAE AVQVPPGPLG
LPGIDGIPGL TGDPGAQGPV GLQGSKGLPG IPGKDGPSGL PGPPGALGDP GLPGLQGPPG
FEGAPGQQGP FGMPGMPGQS ARVGYTLVKH SQSEQVPLCP IGMSQLWVGY SLLFVEGQEK
AHNQDLGFAG SCLPRFSTMP FIYCNINEVC HYARRNDKSY WLSTTAPIPM MPVSQTQIPQ
YISRCSVCEA PSQAIAVHSQ DITIPQCPLG WRSLWIGYSF LMHTAAGAEG GGQSLVSPGS
CLEDFRATPF IECSGARGTC HYFANKYSFW LTTVEERQQF GELPVSETLK AGQLHTRVSR
CQVCMKSL
//