ID G3NKM8_GASAC Unreviewed; 1648 AA.
AC G3NKM8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000005891.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000005891.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000005891.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 69293.ENSGACP00000005891; -.
DR Ensembl; ENSGACT00000005908.1; ENSGACP00000005891.1; ENSGACG00000004458.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000166749; -.
DR InParanoid; G3NKM8; -.
DR OMA; NQGQDGI; -.
DR TreeFam; TF316865; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000004458; Expressed in pharyngeal gill and 1 other cell type or tissue.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 12.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1424..1648
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 33..83
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 111..212
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 242..634
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 720..1063
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1083..1291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1308..1413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 361..376
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1171..1201
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1260..1282
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1648 AA; 158920 MW; A47EBC8F8EB78E84 CRC64;
CSGKSLCHCD GVKGQKGESG YFGYDGLPGV IGYPGNEGNQ GPPGDKGDLG SAGGSGLKGS
RGLPGRPGFP GPPGLPGLPG HEGHIGLPGL RGCNGTKGDM GYPGFSGVPG YAGLQGQQGP
RGAKGDTFVF PPTRDIKGAT GPPGRDGEDG LGGDPGEPGL SGPPGSAGLP GTPGFPGLQG
EKGQEGPSPV IPGPSGQKGD RGPPGIPGQK GIKLYAEGPK ELKGEPGDVG LKGCQGFPGV
PGTLGVKGEK GGRGDPGSLG KPGKDGVFGE LGQQGDPGDR GYLGPPGIEG ATGEKGERGS
PGFPGEVSRP GTAERGLIIS AAEVQGQRGV PGPRGPEGVQ GEQGFIGFPG LQGEPGVAST
SPPGPQGPPG LAGPKGDPGG PGPLVFGPLG EVGALGAGGA EGEPGEPGIP GDGENFVPGG
VGRPGAKGVK GPLGFPGARG EEGPAGEKGQ VCPDIQGPAG KPGRRGEDGA PGSVGSPGAK
GQTGYNGVQG PKGQQGLIGN KGLQGPQGPI GDPGIMQRVG VKGEEGDRGE SGPPGPDGRD
GSPGFTGRKG ALGQKGNPGP LGQKGNMGRT GDTGPRGLMG QLGPSGPLGI GDPGPVGSKG
NVGVLGPRGD PGNPGEKGSP REIMTSPGDP GPKGEKGLPG IPGAKGVTGS VGAPGFVGPM
GEKGDDGFSV EGPLGFPGFK GLKGPPGTPG VPNFGIKGRA GPSGPPGIPG PKGNIYRGIC
FPGQQGQLGP QGEDGSVGPQ GDPGPEGSDG VPGPVGQVGY PGPKGHSGLN GLPGPGGPVG
RCFPGAPGPD GEEGQTGIFG FTGMRGLKGD RGDPGRPGFG APGPQGSYGV PGFDGTPGPV
GVIGLKGLRG TNGEPGSTGT RGDLGPPGAL GSPGDDGRPG EPGRDGAKGQ TGADGSGGSQ
GPQGPPGDRG EPGPKGVRTA VDIYGEPGEP GAPGSTGVTG SKGESGLTGP QGLDGRTGRP
GDGGPQGPPG DTGRGGWRGP PGPCGLNGNQ GDMGMPGFKG EKGCTGPCGK PGPDGIPGPA
GLKGSKGESS PSGPGSKGPP GAKGNPGCSG GPGTKGEPGD VGDLGPSGAC GPCGPKGALG
APGCRGSPGT QGEKGCDGPP GINVLPGPAG LTGTKGAPGV PGPAGSRGNQ GQDGIPGPPG
EKGAMGAPGS RAGPRGADGK TGSPGYPGEP GSPGACGPPG PNGPGGNPGC YGSPGPPGPS
GPVGKEGVCI EGSKGNNGIP GQRGPKGQSG LKGPPGSPGP GFKGEKGSKG ATGFQGPPGY
PGEQGPPGKP GEDRPPGPRG PVGQMGPTGV TGPVVLSCVS GIKGEAGQAG FVGSQGPPGP
PGRRGQKGED VPLVPGDQGP AGPPGLKGEK GPPGARGIPG GPGLKGVRGS AGRRGPGGRP
GVMGDQGDQG VDGLKGVPGP KGSLGAKGAP GRPGPCGNPP DTNGFMFTRH SQNLLVPECP
AGSAELYSGY SLLFINGNNR AHGQDLGSLG SCLPRFTTMP FLFCNTDSTC RYASRNDYSY
WLSTDQPLPS SVPLISGDSL RNYISRCSVC EARANVIAVH SQTSLVPDCP SGWDALWFGY
SFVMETGVGA EGSGQPLASP GSCLENFRKI PFIECHGRGT CNYYTDSYSY WLAALNPADM
FSKPTPQTDS GDFPARLISR CRVCMKQL
//