GenomeNet

Database: UniProt
Entry: H2STS1_TAKRU
LinkDB: H2STS1_TAKRU
Original site: H2STS1_TAKRU 
ID   H2STS1_TAKRU            Unreviewed;      1159 AA.
AC   H2STS1;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 3.
DT   27-MAR-2024, entry version 62.
DE   SubName: Full=Collagen type IV alpha 1 chain {ECO:0000313|Ensembl:ENSTRUP00000015808.3};
GN   Name=COL4A1 {ECO:0000313|Ensembl:ENSTRUP00000015808.3};
OS   Takifugu rubripes (Japanese pufferfish) (Fugu rubripes).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Takifugu.
OX   NCBI_TaxID=31033 {ECO:0000313|Ensembl:ENSTRUP00000015808.3, ECO:0000313|Proteomes:UP000005226};
RN   [1] {ECO:0000313|Ensembl:ENSTRUP00000015808.3, ECO:0000313|Proteomes:UP000005226}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21551351;
RA   Kai W., Kikuchi K., Tohari S., Chew A.K., Tay A., Fujiwara A., Hosoya S.,
RA   Suetake H., Naruse K., Brenner S., Suzuki Y., Venkatesh B.;
RT   "Integration of the genetic map and genome assembly of fugu facilitates
RT   insights into distinct features of genome evolution in teleosts and
RT   mammals.";
RL   Genome Biol. Evol. 3:424-442(2011).
RN   [2] {ECO:0000313|Ensembl:ENSTRUP00000015808.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; H2STS1; -.
DR   STRING; 31033.ENSTRUP00000080847; -.
DR   Ensembl; ENSTRUT00000015878.3; ENSTRUP00000015808.3; ENSTRUG00000006458.3.
DR   GeneTree; ENSGT00940000157678; -.
DR   Proteomes; UP000005226; Chromosome 1.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF854; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 8.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005226};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          935..1159
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          1..572
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          702..723
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          741..832
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          860..935
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        16..35
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        48..62
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        387..401
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        907..924
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1159 AA;  115945 MW;  EE60A25C4DB56987 CRC64;
     MTRPNALSCV MSGRHLEGAR GERGLKGDQG EKGDMGIEGE SLFGPPGQPG IPGLPGPPGE
     PIDPNECDVG TGAPGPPGPP GLQGELGQKG DKGDTCFQCE SSGLPGLPGP QGPKGDHGPP
     GSGFPGSPGL LGAPGPVGEP GDIFVAPGLK GDKGLTGVPG SPGMPGIKGE PGRTGIPGIP
     GLKGESAKEG IKGERGPSGD PGTIGPPGER GPPGLPGFGR PGEHGDKGSQ GRSGSPGIPG
     PPGAKGEPGQ GVGSPGPQGV PGPQGEPGRP GVQGERGLPG DAGIPGFPGQ KGEIGPSGIG
     LPGLTGPKGI SGIPGDGEPG QGLPGPKGSQ GIPGLTGFPG EKGSIGLPGI PGQDGHAGPP
     GPQGVKGEVG PPGPPGLNGL QGPPGKGTPG LPGPQGPPGE PGPFGREGVK GERGYQGPPG
     LDIPGPKGDK GAPGFPGVSG TKGLPGVEGL QGRDGLIGTQ GPKGEMGVIG TPGVPGFPGL
     PGQPGSPGWR GDPGVTGPRG AIGESGIKGE RGDPGLQGPA GNMSDVDMEH MKGEKGDIGV
     LGEPGFTGQK GTRGMPGDPG QRGTDGEPGL PGQPERRLWF SWRAWQHGTT WTERRSWRDG
     ITRSDHTTSK TLHVGKSSVK LLVLVSLCTG TMGPKGTKGN FGTPGFPGLP GEKGVRGFEG
     MPGKPGLEGI KGDKGSIGYT GQNSASLSFI LLLYLSQPGR PGEKGVPGLP GPAGERGVDG
     QPGKCSQENC LIHTCLSGEA GLQGPPGSPG QKGEAGVDGI PGSSGERGDP GEKGSPGLHG
     TPGHPGVPGT KGDKGLPGTP GPRGDLGERG LPGVSLEGPK GDRGETGQPG EIGTLSSCCQ
     PAVKFKHLSS IHHYVCLPQT GSQGLPGPPG LQGRDGLKGD KGEQGNPGFQ GELGQKGETG
     GNDGPPGPRG YPGPTGPDGV PGQVGPPGPS SMDHGFLVTR HSQTVDVPQC PEGTSLIFDG
     YSLLYVQGNE RSHGQDLGTA GSCLRKFSPM PFLFCNINNV CNFASRNDYS YWLTSPEPMP
     MSMAPITGHS IKPFISRCAV CEAPAMVIAV HSQTIMIPPC PYGWDSLWIG YSFVMHTSAG
     AEGSGQALAS PGSCLEEFRS APFIECHGRG TCNYYANSYS FWLATIEDSD MFTKPVPTTL
     KAGNLRTHIS RCQVCMKRT
//
DBGET integrated database retrieval system