ID H3C2U6_TETNG Unreviewed; 1495 AA.
AC H3C2U6;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 75.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000002564.1, ECO:0000313|Proteomes:UP000007303};
RN [1] {ECO:0000313|Proteomes:UP000007303}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|Ensembl:ENSTNIP00000002564.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 99883.ENSTNIP00000002564; -.
DR Ensembl; ENSTNIT00000000154.1; ENSTNIP00000002564.1; ENSTNIG00000013921.1.
DR GeneTree; ENSGT00940000168406; -.
DR InParanoid; H3C2U6; -.
DR OMA; FDPNQDK; -.
DR TreeFam; TF344135; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF588; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 12.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1273..1495
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..163
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 298..697
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 736..762
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 884..979
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 998..1081
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1143..1198
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..85
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..153
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 458..478
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1495 AA; 148633 MW; D0AE276B58B7BABF CRC64;
LQGDFGLPGT TGLPGPPGLP GLQGNRGFYG PKFPHQGPPG DPGPRGQKGT MVKVLKGVKG
EQGEIGPMGP PGNSTYQQNR SPYGPPGFQG QKGLKGEPGD RADNKGETGM VGFSGPRGPS
GANGSAGIRG DLGDPGPRGR DGPKGDRGEP EDRWPGSTAG GRGVTVCAFY SNVGTTGTKG
GRGNRGVGGK VGAMGVEGIK GIKVWDLLGP FGFPGEEGDK GSRGEPGRPA LFAGPEGSPG
RQGSPGPPGP KGTWSEWFKG APGGRGRPGS AGFKGPKGSL SCWSLEGDHG QCECAIVNAP
PGPPGPAGDQ GDAGMTGEFG KRGDVGDPGP QGEDGFSGPN GLIGQPGPKG QKGEQLVVKE
KGFQGESGDP GLSGEPGKAG APGSNGVPGF PGSRGFQGEA TQGHQGEKGF PGLSGQPGVA
GLRGEPGQDF IGVKGQRGLP GDAGFTGYEG VPGTPGSPGP CNAVPGPRGP PGPSGSNGLP
GIPGPLGEKG FKGQPGARGE KGELGEQGIG GQPGLPGFPG PRGDPGFSGI RGSEGVPGSD
GITGQQGLKG EKGAHGEVLG ASPGPPGDLG LPGIRGNKGP RGDPGTQGYE GMSGVPGMIG
SKGEQGPLGL QGEQGRPGPP GVYGYPGDPG RAGPPGPLGA VGQPGFTGER GTEGDAGSPG
PVGLKGTLGA YGEIGPIGES GPTGDPGLPG PDGQPGVPGM KGYKGSPGMS GFEGMRGEAG
VKGWSGQQGI SGGFGQTGPK GLPGVKGQKG DRGLGGPPGE KPHISPVMII HMKGNKGEPG
YQGYQGFPGP RGAKGFHGVP GHGGLHGLPG DPNNVKGHRG DLGAQGLPGI RGMPGVSGKR
GISGFKGMQG PKGIKGSTGA YGASGELGIK GTKGEKGQQI DLPGSTGFRG ELGLPGDPGA
KGQKGFTGTV GERGSPGFEG IQGKKGESGN SGIPGFAGVD GQKGTPGNQG QRGSPGIPGI
DGHPGLPGYP GNTDEKNIQI RTGFPGLKGL YGLHGLKGQK GLQGPPGLEI MGPTGDPGGK
GPKGDGGERN DQQGSPGSLG LKGFQGPQGD PGDPGLPGPR GLTGFDATPG PKVTKGFPGT
RGLPGLDGVH GEKGVPGISG FPGPVGLKGS KGGQGSFGYR GPDGVYGKKA GVKGEQGLMG
VPGVTGVQGD RGSTGPKGNV GATGFQGPPG NQGPRPFPPK IPGEKGSQGP WGTPGPHGVK
GQMGPQGFPG DAGLMGPRGQ KGMPGINGRP GVPGFRGDVG QRGHAGLQGM EGSRGRPGAS
GLPGMPGRSV SVGYLLVKHS QTEQIPMCPV GMAKLWSGYS LLYMEGQEKA HNQDLGLAGS
CLPRFSTMPF LYCNPGDICY YASRNDKSYW LSTTAPLPMM PVEDVEIKPY ISRCSVCEAP
SVAIAVHSQD ITIPQCPVGW RSLWIGYSFL MHTAAGNEGG GQSLSSPGSC LEDFRTTPFI
ECNGAKGTCH YFANKHSFWL SSVDKSFHTE PKAETLKAGQ LLLRIGRCQV CMKNL
//