GenomeNet

Database: UniProt
Entry: H3C2U6_TETNG
LinkDB: H3C2U6_TETNG
Original site: H3C2U6_TETNG 
ID   H3C2U6_TETNG            Unreviewed;      1495 AA.
AC   H3C2U6;
DT   18-APR-2012, integrated into UniProtKB/TrEMBL.
DT   18-APR-2012, sequence version 1.
DT   27-MAR-2024, entry version 75.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS   Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS   nigroviridis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX   NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000002564.1, ECO:0000313|Proteomes:UP000007303};
RN   [1] {ECO:0000313|Proteomes:UP000007303}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=15496914; DOI=10.1038/nature03025;
RA   Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA   Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA   Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA   Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA   Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA   Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA   Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA   McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA   Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA   Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA   Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT   "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT   early vertebrate proto-karyotype.";
RL   Nature 431:946-957(2004).
RN   [2] {ECO:0000313|Ensembl:ENSTNIP00000002564.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 99883.ENSTNIP00000002564; -.
DR   Ensembl; ENSTNIT00000000154.1; ENSTNIP00000002564.1; ENSTNIG00000013921.1.
DR   GeneTree; ENSGT00940000168406; -.
DR   InParanoid; H3C2U6; -.
DR   OMA; FDPNQDK; -.
DR   TreeFam; TF344135; -.
DR   Proteomes; UP000007303; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF588; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 12.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          1273..1495
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          1..163
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          215..276
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          298..697
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          736..762
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          884..979
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          998..1081
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1143..1198
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        71..85
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        134..153
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        458..478
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1495 AA;  148633 MW;  D0AE276B58B7BABF CRC64;
     LQGDFGLPGT TGLPGPPGLP GLQGNRGFYG PKFPHQGPPG DPGPRGQKGT MVKVLKGVKG
     EQGEIGPMGP PGNSTYQQNR SPYGPPGFQG QKGLKGEPGD RADNKGETGM VGFSGPRGPS
     GANGSAGIRG DLGDPGPRGR DGPKGDRGEP EDRWPGSTAG GRGVTVCAFY SNVGTTGTKG
     GRGNRGVGGK VGAMGVEGIK GIKVWDLLGP FGFPGEEGDK GSRGEPGRPA LFAGPEGSPG
     RQGSPGPPGP KGTWSEWFKG APGGRGRPGS AGFKGPKGSL SCWSLEGDHG QCECAIVNAP
     PGPPGPAGDQ GDAGMTGEFG KRGDVGDPGP QGEDGFSGPN GLIGQPGPKG QKGEQLVVKE
     KGFQGESGDP GLSGEPGKAG APGSNGVPGF PGSRGFQGEA TQGHQGEKGF PGLSGQPGVA
     GLRGEPGQDF IGVKGQRGLP GDAGFTGYEG VPGTPGSPGP CNAVPGPRGP PGPSGSNGLP
     GIPGPLGEKG FKGQPGARGE KGELGEQGIG GQPGLPGFPG PRGDPGFSGI RGSEGVPGSD
     GITGQQGLKG EKGAHGEVLG ASPGPPGDLG LPGIRGNKGP RGDPGTQGYE GMSGVPGMIG
     SKGEQGPLGL QGEQGRPGPP GVYGYPGDPG RAGPPGPLGA VGQPGFTGER GTEGDAGSPG
     PVGLKGTLGA YGEIGPIGES GPTGDPGLPG PDGQPGVPGM KGYKGSPGMS GFEGMRGEAG
     VKGWSGQQGI SGGFGQTGPK GLPGVKGQKG DRGLGGPPGE KPHISPVMII HMKGNKGEPG
     YQGYQGFPGP RGAKGFHGVP GHGGLHGLPG DPNNVKGHRG DLGAQGLPGI RGMPGVSGKR
     GISGFKGMQG PKGIKGSTGA YGASGELGIK GTKGEKGQQI DLPGSTGFRG ELGLPGDPGA
     KGQKGFTGTV GERGSPGFEG IQGKKGESGN SGIPGFAGVD GQKGTPGNQG QRGSPGIPGI
     DGHPGLPGYP GNTDEKNIQI RTGFPGLKGL YGLHGLKGQK GLQGPPGLEI MGPTGDPGGK
     GPKGDGGERN DQQGSPGSLG LKGFQGPQGD PGDPGLPGPR GLTGFDATPG PKVTKGFPGT
     RGLPGLDGVH GEKGVPGISG FPGPVGLKGS KGGQGSFGYR GPDGVYGKKA GVKGEQGLMG
     VPGVTGVQGD RGSTGPKGNV GATGFQGPPG NQGPRPFPPK IPGEKGSQGP WGTPGPHGVK
     GQMGPQGFPG DAGLMGPRGQ KGMPGINGRP GVPGFRGDVG QRGHAGLQGM EGSRGRPGAS
     GLPGMPGRSV SVGYLLVKHS QTEQIPMCPV GMAKLWSGYS LLYMEGQEKA HNQDLGLAGS
     CLPRFSTMPF LYCNPGDICY YASRNDKSYW LSTTAPLPMM PVEDVEIKPY ISRCSVCEAP
     SVAIAVHSQD ITIPQCPVGW RSLWIGYSFL MHTAAGNEGG GQSLSSPGSC LEDFRTTPFI
     ECNGAKGTCH YFANKHSFWL SSVDKSFHTE PKAETLKAGQ LLLRIGRCQV CMKNL
//
DBGET integrated database retrieval system