ID H3D8T5_TETNG Unreviewed; 1493 AA.
AC H3D8T5;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000016926.1, ECO:0000313|Proteomes:UP000007303};
RN [1] {ECO:0000313|Proteomes:UP000007303}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|Ensembl:ENSTNIP00000016926.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSTNIT00000017140.1; ENSTNIP00000016926.1; ENSTNIG00000013921.1.
DR GeneTree; ENSGT00940000168406; -.
DR HOGENOM; CLU_002023_1_0_1; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF588; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1271..1493
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 1..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..276
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 297..704
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 738..769
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 914..979
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 996..1080
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1141..1196
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 71..85
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..151
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 470..485
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1493 AA; 148364 MW; 71F22DC797A1903A CRC64;
LQGDFGLPGT TGLPGPPGLP GLQGNRGFYG PKFPHQGPPG DPGPRGQKGT MVKVLKGVKG
EQGEIGPMGP PGNSTYQQNR SPYGPPGFQG QKGLKGEPGD RADNKGETGM VGFSGPRGPS
GANGSAGIRG DLGDPGPRGR DGPKGDRGEP GELILLEDRW PGSTAVCAFY SNVGTTGTKG
GRGNRGVGGK VGAMGVEGIK GIKVWDLLGP FGFPGEEGDK GSRGEPGRPA LFAGPEGSPG
RQGSPGPPGP KGTWSEWFKG APGGRGRPGS AGFKGPKGSL SCWSLEGDHG QCECAIVNAP
PGPPGPAGDQ GDAGMTGEFG KRGDVGDPGP QGEDGFSGPN GLIGQPGPKG QKGEQLVVKE
KGFQGESGDP GLSGEPGKAG APGSNGVPGF PGSRGFQGEA TQGHQGEKGF PGLSGQPGVA
GLRGEPGQDF IGVKGQRGLP GDAGFTGYEG VPGTPGSPDS SLPFAGPCNA VPGPRGPPGP
SGSNGLPGIP GPLGEKGFKG QPGARGEKGE LGEQGIGGQP GLPGFPGPRG DPGFSGIRGS
EGVPGSDGIT GQQGLKGEKG AHGEVLGASP GPPGDLGLPG IRGNKGPRGD PGTQGYEGMS
GVPGMIGSKG EQGPLGLQGE QGRPGPPGVY GYPGDPGRAG PPGPLGAVGQ PGFTGERGTE
GDAGSPGPVG LKGTLGAYGE IGPIGESGPT GDPGLPGPDG QPGVPGMKGY KGSPGMSGFE
GMRGEAGVKG WSGQQGISGG FGQTGPKGLP GVKGQKGDRG LGGPPGEKPH ISPVMIIHMK
GNKGEPGYQG YQGFPGPRGA KGFHGVPGHG GLHGLPGDPN NVKGHRGDLG AQGLPGIRGM
PGVSGKRGIS GFKGMQGPKG IKGSTGAYGA SGELGIKGTK GEKGQQIDLP GSTGFRGELG
LPGDPGAKGQ KGFTGTVGER GSPGFEGIQG KKGESGNSGI PGFAGVDGQK GTPGNQGQRG
SPGIPGIDGH PGLPGYPGNT GFPGLKGLYG LHGLKGQKGL QGPPGLEIMG PTGDPGGKGP
KGDGGERNDQ QGSPGSLGLK GFQGPQGDPG DPGLPGPRGL TGFDATPGPK VTKGFPGTRG
LPGLDGVHGE KGVPGISGFP GPVGLKGSKG GQGSFGYRGP DGVYGKKAGV KGEQGLMGVP
GVTGVQGDRG STGPKGNVGA TGFQGPPGNQ GPRPFPPKIP GEKGSQGPWG TPGPHGVKGQ
MGPQGFPGDA GLMGPRGQKG MPGINGRPGV PGFRGDVGQR GHAGLQGMEG SRGRPGASGL
PGMPGRSVSV GYLLVKHSQT EQIPMCPVGM AKLWSGYSLL YMEGQEKAHN QDLGLAGSCL
PRFSTMPFLY CNPGDICYYA SRNDKSYWLS TTAPLPMMPV EDVEIKPYIS RCSVCEAPSV
AIAVHSQDIT IPQCPVGWRS LWIGYSFLMH TAAGNEGGGQ SLSSPGSCLE DFRTTPFIEC
NGAKGTCHYF ANKHSFWLSS VDKSFHTEPK AETLKAGQLL LRIGRCQVCM KNL
//