ID A0A3P8WC61_CYNSE Unreviewed; 1092 AA.
AC A0A3P8WC61;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Cynoglossus semilaevis (Tongue sole).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Carangaria; Pleuronectiformes; Pleuronectoidei; Cynoglossidae;
OC Cynoglossinae; Cynoglossus.
OX NCBI_TaxID=244447 {ECO:0000313|Ensembl:ENSCSEP00000025043.1, ECO:0000313|Proteomes:UP000265120};
RN [1] {ECO:0000313|Ensembl:ENSCSEP00000025043.1, ECO:0000313|Proteomes:UP000265120}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=24487278;
RA Chen S., Zhang G., Shao C., Huang Q., Liu G., Zhang P., Song W., An N.,
RA Chalopin D., Volff J.N., Hong Y., Li Q., Sha Z., Zhou H., Xie M., Yu Q.,
RA Liu Y., Xiang H., Wang N., Wu K., Yang C., Zhou Q., Liao X., Yang L.,
RA Hu Q., Zhang J., Meng L., Jin L., Tian Y., Lian J., Yang J., Miao G.,
RA Liu S., Liang Z., Yan F., Li Y., Sun B., Zhang H., Zhang J., Zhu Y., Du M.,
RA Zhao Y., Schartl M., Tang Q., Wang J.;
RT "Whole-genome sequence of a flatfish provides insights into ZW sex
RT chromosome evolution and adaptation to a benthic lifestyle.";
RL Nat. Genet. 46:253-260(2014).
RN [2] {ECO:0000313|Ensembl:ENSCSEP00000025043.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8WC61; -.
DR Ensembl; ENSCSET00000025374.1; ENSCSEP00000025043.1; ENSCSEG00000015995.1.
DR GeneTree; ENSGT00940000153991; -.
DR Proteomes; UP000265120; Chromosome 20.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1104; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 14.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000265120};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 869..1092
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 197..735
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 794..866
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..62
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..697
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1092 AA; 106840 MW; C5DF2054E651019B CRC64;
MTVILQGAGH AGLPGPKGYP GVVGPKGNPG TAGPRGPGLP GLRGSPGPTG DPGPPGPSGF
LGRPGPPGEV DKCCHENEIG SPGPKGEPGF PGNPGRPGRN GLQGHPGLQG PKGMKGDNGN
FAVGLPGPPG LDGNPGDPGL RGISLDGPHG SPGLTGLPGR KGEPGDVFSS GHGIFGLPGF
PGLSGVVGNK GLRGLSGIPG TPGIQGNPGR PGPKGEPGVN GDPGAIGFPG PQCAKCDIIE
TPGPPGPPGN PGQRGFLGHQ GPKGLKGDVG FPGHGPKGLQ GFPGPSGVIG LPGPGGIAGP
VGDPGSAGFP GEKGERGSPG RAGARSLPGR TGSPGPRGKQ GERGFNGRPG QPGDLGLPGV
KGLPGPPGLS GINVTKGPAG LPGRRGLQGS LGQTGTRGQR GIPGERGPDG RPGPPGLKGS
IGAPGKSGLP GITGNPGIKG LPGPRGFPGR IGDPGDPGPT GHHGIPGLPG STGAKGEIGE
SVGHPGPPGP KGLPGDHGPC ASRAHPGDPG DPGPTGRPGS PGQPGIQGGQ GRRGQPGYTG
PRGPYGSAGV PGIPGDKGGV GPPGPSGPYG PSGALGPPGL DGLNGLRGAK GIKGSNGKDT
PGPPGPDGFP GVKGWRGHPG LPGSSHLGPK GLQGPLGHAG RPGFPGEPGY PGKECHRPPQ
GFPGDTGVEG PPGPPGPPGE PGTPREGISP KGDPGPPGLP GSSGLIGVRG IPGPPGFQGD
PGINGPKGEK GSIGLMGVPG RKGQQGFPGP MGLKGRQGCP GNEGLKGASG DIIRLIQVAP
MPGPPGPPGF PGPVGFPGTH GLPGDLGRKG QKGSVGSIGP PGVQGPAGPD GRVGDAGEPG
LTGFTGPQGV PGAPGDPGQP GQRGSSRLGF LLVMHSQSVQ VPQCPDGSTQ LWVGYSLVYL
EGQGQAHAQD LGQAGSCLPV FSTMPFSYCN KAACHYSSRN DKSCWLSTTA PLPMMPLFGQ
EIVSHISRCA VCETVSPVAA FHSQDHTVPM CPPGWRSLWT GYSFLLHAGA GNGGGGQSLT
SSGSCLKDFR THPFIECQGV RGSCHYFANL YSFWLTTVNQ AEQFVTPRSG TIKTDDKQRA
KSSHCHVCLR GK
//