ID U3K7K5_FICAL Unreviewed; 1747 AA.
AC U3K7K5;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 2.
DT 24-JAN-2024, entry version 52.
DE SubName: Full=Collagen type IV alpha 2 chain {ECO:0000313|Ensembl:ENSFALP00000011009.2};
GN Name=COL4A2 {ECO:0000313|Ensembl:ENSFALP00000011009.2};
OS Ficedula albicollis (Collared flycatcher) (Muscicapa albicollis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Muscicapidae; Ficedula.
OX NCBI_TaxID=59894 {ECO:0000313|Ensembl:ENSFALP00000011009.2, ECO:0000313|Proteomes:UP000016665};
RN [1] {ECO:0000313|Ensembl:ENSFALP00000011009.2, ECO:0000313|Proteomes:UP000016665}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23103876; DOI=10.1038/nature11584;
RA Ellegren H., Smeds L., Burri R., Olason P.I., Backstrom N., Kawakami T.,
RA Kunstner A., Makinen H., Nadachowska-Brzyska K., Qvarnstrom A., Uebbing S.,
RA Wolf J.B.;
RT "The genomic landscape of species divergence in Ficedula flycatchers.";
RL Nature 491:756-760(2012).
RN [2] {ECO:0000313|Ensembl:ENSFALP00000011009.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 59894.ENSFALP00000011009; -.
DR Ensembl; ENSFALT00000011054.2; ENSFALP00000011009.2; ENSFALG00000010548.2.
DR eggNOG; KOG0613; Eukaryota.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000157234; -.
DR HOGENOM; CLU_278180_0_0_1; -.
DR Proteomes; UP000016665; Chromosome 1.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0071560; P:cellular response to transforming growth factor beta stimulus; IEA:Ensembl.
DR GO; GO:0038063; P:collagen-activated tyrosine kinase receptor signaling pathway; IEA:Ensembl.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:Ensembl.
DR GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR GO; GO:0016525; P:negative regulation of angiogenesis; IEA:Ensembl.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF588; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000016665};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1747
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5032755452"
FT DOMAIN 1493..1715
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 116..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 198..285
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..988
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1126..1490
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 214..228
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..276
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 473..487
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1389..1403
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1747 AA; 172483 MW; 4B302A8B642A3FF8 CRC64;
MDISQLPSAR IFLCQNTGLL LILMEVVLSA VKVDAGGKKY TGPCGGRDCS GGCQCFPEKG
SRGQPGPLGS QGFTGPPGLM GIPGVQGPKG HKGERGYPGI TGPKGEVGQR GVTGFPGADG
IPGHPGQPGP RGKPGHDGCN GTAGDPGDLG IPGVSGFPGS IGVQGPKGQK GEPYIVAPEI
ISRHRGDPGD SGFIGFPGPP GTLGIQGPIG PRGQRGRPGP PGPPGLPGPQ GNRGLGFYGE
KGEQGPPGPP GPPGLPTREL MGVPSDKHKG ERGDPGPRGE TGIPGVLLFP LEKGEEGVMG
YPGQRGLPGI TGFPGLGGER GFPGFDGPPG GPGPRGSKGE LGEMGPPGPD SYFPSRIPRK
GARGDPGFPG ASGLPGDRGD EGDRGLPGLP GFSDADGDKP GLPGEMGPKG EKGEIGSPAY
FAGPPGLPGK DGSPGIRGPP GPVGAPGPLF GLKGREGTPG RFGTQGFPGP RGRKGPKGEE
GDCTTCLLTE ELRRGPTGPR GPPGFPGSSG QPGRKGEPGD QGPHGLPGFP GAKGFSGPAG
FPGRKGEKGD SLHITTKGTK GIRGDPGLPG IRGEDGFPGR DGLDGLPGQP GLPGDAIRGF
PGDPGYPGEL GPKGFPGEAG LPGEGFPGPK GFRGPPGDQG QSGSPGTPGL PGLPGEPGQP
DCGQVTEDFP RGDATEPIWS GAGCVRPPKG SQGRPGVPGA TGAKGARGFP GDPGPMGYPG
LNGTRGDPGR EGFPGPPGFT GPRGDRGPNG LPGLQGHPGL TGKSGAPGAP GPKGLPGEVF
GAAAGSRGDV GLPGFPGLKG APGDQGVPGT RGADGSPGLP GAKGDPGPQG LPGLMGLPGT
PGTHGFPGPP GNRGPDGGPG SQGPLGPPGA RGEDGEQGFP GPVGLKGLSG DKGDVGHTGL
PGIRGVTGPP GIRGMDGFPG DKGLQGSPGI DGFKGMTGLK GRPGIKGIKG EFGPLGTRGD
KGSQGARGFK GDRGDQGPPG DPPKLMPSMM MEVKGEKGDV GERGTKGFFG LKGSKGMPGL
PGRTGTPGSP GHPSYVAGVK GDIGAKGLTG VKGYPGPAGS PGIRGFPGTT GVRGEKGIPG
ISGHFGTPGS HGEIGDRGDT INLPGMPGLK GEIGVPGLTG IRGGVGPKGE GGDPGFPGIE
GLKGTQGVPG SVGQEGLPGL VGPPGQQGSP GTPGFPGEKG TAGWPGLPGQ AGQPGLRGIS
GLHGLPGTKG LPGSPGPDGY GSAGFPGAVG DKGEAGEPSR VEGSRGPPGQ KGDRGVPGVP
GPFGIPGQEG FPGPPGISNI SGYPGDKGSP GLDGVPGYPG PQGQPGIPAP PGSKGESGQT
GRTGEIGPKG SRGDPGSAGR PGLPGFPGPK GPRGEQGVIG FMGTVGFPGD LGPIGPKGDR
GVTGFQGPPG SPGLPPLPPR LVAEQGSPGP RGNIGPQGSP GDMGPQGPPG DPGFRGSPGE
PGLQGRGGIP APPGSRGEQG GMGFQGPVGF EGQPGRPGSP GPPGMPGRSV SMGYLLVKHS
QSDQEPMCPV GMNKLWSGYS LLYFEGQEKA HNQDLGLAGS CLSRFSTMPF LYCNPGDICY
YASRNDKSYW LSTTAPLPMM PVAEEDIKPY ISRCSVCEAP AVAIAVHSQE ASIPHCPEGW
RSLWIGYSFL MHTAAGDEGG GQSLVSPGSC LEDFRATPFI ECNGARGTCH YFANKYSFWL
TTIDQPFQSK PSGDTLKAGL IRSHISRCQV YVPQAKGSLL HHAVDMSSVA HHPRRAAIGF
CIAPFPL
//