GenomeNet

Database: UniProt
Entry: H0ZCV9_TAEGU
LinkDB: H0ZCV9_TAEGU
Original site: H0ZCV9_TAEGU 
ID   H0ZCV9_TAEGU            Unreviewed;      1678 AA.
AC   H0ZCV9;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 2.
DT   27-MAR-2024, entry version 86.
DE   SubName: Full=Collagen type IV alpha 4 chain {ECO:0000313|Ensembl:ENSTGUP00000008420.2};
GN   Name=COL4A4 {ECO:0000313|Ensembl:ENSTGUP00000008420.2};
OS   Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC   Estrildinae; Taeniopygia.
OX   NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000008420.2, ECO:0000313|Proteomes:UP000007754};
RN   [1] {ECO:0000313|Ensembl:ENSTGUP00000008420.2, ECO:0000313|Proteomes:UP000007754}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20360741; DOI=10.1038/nature08819;
RA   Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA   Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA   Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA   Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA   Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA   Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA   Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA   McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA   Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA   Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA   Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA   Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA   Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA   Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT   "The genome of a songbird.";
RL   Nature 464:757-762(2010).
RN   [2] {ECO:0000313|Ensembl:ENSTGUP00000008420.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 59729.ENSTGUP00000008420; -.
DR   Ensembl; ENSTGUT00000008508.2; ENSTGUP00000008420.2; ENSTGUG00000008147.2.
DR   GeneTree; ENSGT00940000153991; -.
DR   HOGENOM; CLU_002023_1_0_1; -.
DR   InParanoid; H0ZCV9; -.
DR   OMA; ISCNVTY; -.
DR   TreeFam; TF316865; -.
DR   Proteomes; UP000007754; Chromosome 9.
DR   GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR   GO; GO:0032836; P:glomerular basement membrane development; IEA:Ensembl.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF714; COLLAGEN ALPHA-4(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 14.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007754};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..29
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           30..1678
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5025609493"
FT   DOMAIN          1453..1678
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          57..147
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          180..213
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          265..292
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          460..593
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          671..917
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          967..1006
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1037..1449
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        549..563
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1678 AA;  164891 MW;  9225532DA6A8854C CRC64;
     MALTKDSFER IKWLITAAWW LLIVFSTQGI DGGGYAYIDP CGGQDCSVCR CFPEKGSRGQ
     PGELGAQGPI GSLGSTGPAG LPGEKGQRGE NGKPGPAGGK GDKGPTGVPG FPGLDGVPGL
     PGREGARGKP GLDGCNGSRG DPGFPGENGY MGPRGPYGIP GQKGEKGNSV YILHFGKGLP
     GERGDPGPPG MPGPRGSRGT TGPSGYPGHP GLPGIPGYPG LPGEQGNPGI GVDGQKGEPG
     DIGLPGPPGS PLLVGPPGAQ LFKGQKGQKG LPGVTGYRGP RGPKGILGRG EKGEKGLPGF
     PGLRGHPGSY GPAGFPGMKG ETGFAGFPGQ PGYPGIQGDP GEKGPPGPPG AVGTPLLPIK
     GPRGDPGFPG PVGEMGSVGP IGPSGLLGRP GYDGTSLPGL PGVSGAPGPQ GFQGDPGLPG
     TGELIPGRPG FPGPPGLPGQ PGRQGLPGLP SVICTDRGIP GEPGPKGQIG LPGRKGAKGE
     KGSQGFCSCS AGPPGPRGVQ GPPGTQGRKG QMGYPGGHGE KGDPGLSGAV GSPGLPGTPG
     SAGQLGEKGE KGDPGRVRIK GLKGERGATG LPGFPGQRGL DGRDGELGLP GEKGAEGDSG
     VAILGDKGFP GLPGPPGVKG QMGLPGLGLP GSPGARGSPG DFGDMGNVGP PGPKGQKGET
     VCITSPYPGY PGPPGFKGAQ GQKGLKGLPG HPGPNGFDGP KGHRGKPGAG IPGPEGFRGT
     AGDPGDEGER GSTIDGKNGL PGPPGRDGQK GLPGDTTYGP PGIPGNRGLP GPTGAQGARG
     NPGIPGLQGQ PGTPGFPGAK GFRGPEGDVG APGCPGSPGS PCVPGLPGPP GLRGVMGLPG
     PQGLPGFKGQ RGDRGLAGPP GIKGLKGSLG SRGPPGPPGS RGPPGLPGNK GPIGFPGETG
     SKGIQGPQGF PGLPGIQGPM GNFGVKGEEG KMGLPGPSGE CGDMGIKGER GPPGDSGFVN
     LRLEKGQMGD PGFPGEHGLR GERGEKGNTG FRGTRGFPGK NGVPGLPGDQ GDTGVMGFPG
     SRGLPGPKGL QGITGFQGEP GDQGDVGLPG SPGYPGLTGS KGCKGKRGDP TPVLGPHGQK
     GSLGDPGLPG LCGFPGEKGL PGIQGQPGRP GSKGDPGYPG LPGLPGATGP QGLPGEPGEK
     GKPGILGPPG LQGLPGSQGR KGLPGLPGLD GLDGLKGQKG SAGAPGQSET GPPGYSGEIG
     PKGDRGEPGW PGISIPGPPG ERGFPGFPGK RGPVGPTGPM GRSPDSASPG PPGDQGLPGL
     DGIRGDPGNP GPPGETIFVR GDPGDSGIRG APGNPGPRGQ QGARGPPGSQ GREGPKGPMG
     VHGPQGPPGA LGQPGDQGFP GRPGPRGPTG DPGEPGKVDD SCPTIPGPPG EAGQRGEDGS
     AGLPGPIGQP GPQGIKGEEG SYGLPGQDGL PGAPGPPGDQ GSRGEQGYAG PQGPPGQTGI
     PGQPGPQIRS ASGFLLVLHS QSDREPLCPQ GMPKLWTGYS LLYLEGQERA HNQDLGLAGS
     CLPVFSTMPF AYCNINQVCY YASRNDKSYW LSSAAPLPTA PLAEEEIRPY ISRCAVCQAP
     AQPVALHSQD QSIPPCPPSW RSLWIGYSFL MHTGSGDQGG GQSLMSPGSC LEDFRSAPFI
     ECQGQRGTCQ YFANEYSFWL TTVMPELQFA SAPLSGTLKE GQEQRKKISR CQVCLKHG
//
DBGET integrated database retrieval system