ID H0ZCV9_TAEGU Unreviewed; 1678 AA.
AC H0ZCV9;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 86.
DE SubName: Full=Collagen type IV alpha 4 chain {ECO:0000313|Ensembl:ENSTGUP00000008420.2};
GN Name=COL4A4 {ECO:0000313|Ensembl:ENSTGUP00000008420.2};
OS Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC Estrildinae; Taeniopygia.
OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000008420.2, ECO:0000313|Proteomes:UP000007754};
RN [1] {ECO:0000313|Ensembl:ENSTGUP00000008420.2, ECO:0000313|Proteomes:UP000007754}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20360741; DOI=10.1038/nature08819;
RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT "The genome of a songbird.";
RL Nature 464:757-762(2010).
RN [2] {ECO:0000313|Ensembl:ENSTGUP00000008420.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 59729.ENSTGUP00000008420; -.
DR Ensembl; ENSTGUT00000008508.2; ENSTGUP00000008420.2; ENSTGUG00000008147.2.
DR GeneTree; ENSGT00940000153991; -.
DR HOGENOM; CLU_002023_1_0_1; -.
DR InParanoid; H0ZCV9; -.
DR OMA; ISCNVTY; -.
DR TreeFam; TF316865; -.
DR Proteomes; UP000007754; Chromosome 9.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:Ensembl.
DR GO; GO:0032836; P:glomerular basement membrane development; IEA:Ensembl.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF714; COLLAGEN ALPHA-4(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 14.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007754};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1678
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5025609493"
FT DOMAIN 1453..1678
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 57..147
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 180..213
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 265..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 460..593
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 671..917
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 967..1006
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1037..1449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 549..563
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1678 AA; 164891 MW; 9225532DA6A8854C CRC64;
MALTKDSFER IKWLITAAWW LLIVFSTQGI DGGGYAYIDP CGGQDCSVCR CFPEKGSRGQ
PGELGAQGPI GSLGSTGPAG LPGEKGQRGE NGKPGPAGGK GDKGPTGVPG FPGLDGVPGL
PGREGARGKP GLDGCNGSRG DPGFPGENGY MGPRGPYGIP GQKGEKGNSV YILHFGKGLP
GERGDPGPPG MPGPRGSRGT TGPSGYPGHP GLPGIPGYPG LPGEQGNPGI GVDGQKGEPG
DIGLPGPPGS PLLVGPPGAQ LFKGQKGQKG LPGVTGYRGP RGPKGILGRG EKGEKGLPGF
PGLRGHPGSY GPAGFPGMKG ETGFAGFPGQ PGYPGIQGDP GEKGPPGPPG AVGTPLLPIK
GPRGDPGFPG PVGEMGSVGP IGPSGLLGRP GYDGTSLPGL PGVSGAPGPQ GFQGDPGLPG
TGELIPGRPG FPGPPGLPGQ PGRQGLPGLP SVICTDRGIP GEPGPKGQIG LPGRKGAKGE
KGSQGFCSCS AGPPGPRGVQ GPPGTQGRKG QMGYPGGHGE KGDPGLSGAV GSPGLPGTPG
SAGQLGEKGE KGDPGRVRIK GLKGERGATG LPGFPGQRGL DGRDGELGLP GEKGAEGDSG
VAILGDKGFP GLPGPPGVKG QMGLPGLGLP GSPGARGSPG DFGDMGNVGP PGPKGQKGET
VCITSPYPGY PGPPGFKGAQ GQKGLKGLPG HPGPNGFDGP KGHRGKPGAG IPGPEGFRGT
AGDPGDEGER GSTIDGKNGL PGPPGRDGQK GLPGDTTYGP PGIPGNRGLP GPTGAQGARG
NPGIPGLQGQ PGTPGFPGAK GFRGPEGDVG APGCPGSPGS PCVPGLPGPP GLRGVMGLPG
PQGLPGFKGQ RGDRGLAGPP GIKGLKGSLG SRGPPGPPGS RGPPGLPGNK GPIGFPGETG
SKGIQGPQGF PGLPGIQGPM GNFGVKGEEG KMGLPGPSGE CGDMGIKGER GPPGDSGFVN
LRLEKGQMGD PGFPGEHGLR GERGEKGNTG FRGTRGFPGK NGVPGLPGDQ GDTGVMGFPG
SRGLPGPKGL QGITGFQGEP GDQGDVGLPG SPGYPGLTGS KGCKGKRGDP TPVLGPHGQK
GSLGDPGLPG LCGFPGEKGL PGIQGQPGRP GSKGDPGYPG LPGLPGATGP QGLPGEPGEK
GKPGILGPPG LQGLPGSQGR KGLPGLPGLD GLDGLKGQKG SAGAPGQSET GPPGYSGEIG
PKGDRGEPGW PGISIPGPPG ERGFPGFPGK RGPVGPTGPM GRSPDSASPG PPGDQGLPGL
DGIRGDPGNP GPPGETIFVR GDPGDSGIRG APGNPGPRGQ QGARGPPGSQ GREGPKGPMG
VHGPQGPPGA LGQPGDQGFP GRPGPRGPTG DPGEPGKVDD SCPTIPGPPG EAGQRGEDGS
AGLPGPIGQP GPQGIKGEEG SYGLPGQDGL PGAPGPPGDQ GSRGEQGYAG PQGPPGQTGI
PGQPGPQIRS ASGFLLVLHS QSDREPLCPQ GMPKLWTGYS LLYLEGQERA HNQDLGLAGS
CLPVFSTMPF AYCNINQVCY YASRNDKSYW LSSAAPLPTA PLAEEEIRPY ISRCAVCQAP
AQPVALHSQD QSIPPCPPSW RSLWIGYSFL MHTGSGDQGG GQSLMSPGSC LEDFRSAPFI
ECQGQRGTCQ YFANEYSFWL TTVMPELQFA SAPLSGTLKE GQEQRKKISR CQVCLKHG
//