GenomeNet

Database: UniProt
Entry: U3K7K5_FICAL
LinkDB: U3K7K5_FICAL
Original site: U3K7K5_FICAL 
ID   U3K7K5_FICAL            Unreviewed;      1747 AA.
AC   U3K7K5;
DT   13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 2.
DT   24-JAN-2024, entry version 52.
DE   SubName: Full=Collagen type IV alpha 2 chain {ECO:0000313|Ensembl:ENSFALP00000011009.2};
GN   Name=COL4A2 {ECO:0000313|Ensembl:ENSFALP00000011009.2};
OS   Ficedula albicollis (Collared flycatcher) (Muscicapa albicollis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Passeriformes; Muscicapidae; Ficedula.
OX   NCBI_TaxID=59894 {ECO:0000313|Ensembl:ENSFALP00000011009.2, ECO:0000313|Proteomes:UP000016665};
RN   [1] {ECO:0000313|Ensembl:ENSFALP00000011009.2, ECO:0000313|Proteomes:UP000016665}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23103876; DOI=10.1038/nature11584;
RA   Ellegren H., Smeds L., Burri R., Olason P.I., Backstrom N., Kawakami T.,
RA   Kunstner A., Makinen H., Nadachowska-Brzyska K., Qvarnstrom A., Uebbing S.,
RA   Wolf J.B.;
RT   "The genomic landscape of species divergence in Ficedula flycatchers.";
RL   Nature 491:756-760(2012).
RN   [2] {ECO:0000313|Ensembl:ENSFALP00000011009.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- FUNCTION: Type IV collagen is the major structural component of
CC       glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC       together with laminins, proteoglycans and entactin/nidogen.
CC       {ECO:0000256|ARBA:ARBA00003696}.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 59894.ENSFALP00000011009; -.
DR   Ensembl; ENSFALT00000011054.2; ENSFALP00000011009.2; ENSFALG00000010548.2.
DR   eggNOG; KOG0613; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000157234; -.
DR   HOGENOM; CLU_278180_0_0_1; -.
DR   Proteomes; UP000016665; Chromosome 1.
DR   GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0071560; P:cellular response to transforming growth factor beta stimulus; IEA:Ensembl.
DR   GO; GO:0038063; P:collagen-activated tyrosine kinase receptor signaling pathway; IEA:Ensembl.
DR   GO; GO:0006351; P:DNA-templated transcription; IEA:Ensembl.
DR   GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR   GO; GO:0016525; P:negative regulation of angiogenesis; IEA:Ensembl.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF588; COLLAGEN ALPHA-2(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 15.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000016665};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..29
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           30..1747
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5032755452"
FT   DOMAIN          1493..1715
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          116..172
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          198..285
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          312..988
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1126..1490
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        214..228
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        262..276
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        473..487
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1389..1403
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1747 AA;  172483 MW;  4B302A8B642A3FF8 CRC64;
     MDISQLPSAR IFLCQNTGLL LILMEVVLSA VKVDAGGKKY TGPCGGRDCS GGCQCFPEKG
     SRGQPGPLGS QGFTGPPGLM GIPGVQGPKG HKGERGYPGI TGPKGEVGQR GVTGFPGADG
     IPGHPGQPGP RGKPGHDGCN GTAGDPGDLG IPGVSGFPGS IGVQGPKGQK GEPYIVAPEI
     ISRHRGDPGD SGFIGFPGPP GTLGIQGPIG PRGQRGRPGP PGPPGLPGPQ GNRGLGFYGE
     KGEQGPPGPP GPPGLPTREL MGVPSDKHKG ERGDPGPRGE TGIPGVLLFP LEKGEEGVMG
     YPGQRGLPGI TGFPGLGGER GFPGFDGPPG GPGPRGSKGE LGEMGPPGPD SYFPSRIPRK
     GARGDPGFPG ASGLPGDRGD EGDRGLPGLP GFSDADGDKP GLPGEMGPKG EKGEIGSPAY
     FAGPPGLPGK DGSPGIRGPP GPVGAPGPLF GLKGREGTPG RFGTQGFPGP RGRKGPKGEE
     GDCTTCLLTE ELRRGPTGPR GPPGFPGSSG QPGRKGEPGD QGPHGLPGFP GAKGFSGPAG
     FPGRKGEKGD SLHITTKGTK GIRGDPGLPG IRGEDGFPGR DGLDGLPGQP GLPGDAIRGF
     PGDPGYPGEL GPKGFPGEAG LPGEGFPGPK GFRGPPGDQG QSGSPGTPGL PGLPGEPGQP
     DCGQVTEDFP RGDATEPIWS GAGCVRPPKG SQGRPGVPGA TGAKGARGFP GDPGPMGYPG
     LNGTRGDPGR EGFPGPPGFT GPRGDRGPNG LPGLQGHPGL TGKSGAPGAP GPKGLPGEVF
     GAAAGSRGDV GLPGFPGLKG APGDQGVPGT RGADGSPGLP GAKGDPGPQG LPGLMGLPGT
     PGTHGFPGPP GNRGPDGGPG SQGPLGPPGA RGEDGEQGFP GPVGLKGLSG DKGDVGHTGL
     PGIRGVTGPP GIRGMDGFPG DKGLQGSPGI DGFKGMTGLK GRPGIKGIKG EFGPLGTRGD
     KGSQGARGFK GDRGDQGPPG DPPKLMPSMM MEVKGEKGDV GERGTKGFFG LKGSKGMPGL
     PGRTGTPGSP GHPSYVAGVK GDIGAKGLTG VKGYPGPAGS PGIRGFPGTT GVRGEKGIPG
     ISGHFGTPGS HGEIGDRGDT INLPGMPGLK GEIGVPGLTG IRGGVGPKGE GGDPGFPGIE
     GLKGTQGVPG SVGQEGLPGL VGPPGQQGSP GTPGFPGEKG TAGWPGLPGQ AGQPGLRGIS
     GLHGLPGTKG LPGSPGPDGY GSAGFPGAVG DKGEAGEPSR VEGSRGPPGQ KGDRGVPGVP
     GPFGIPGQEG FPGPPGISNI SGYPGDKGSP GLDGVPGYPG PQGQPGIPAP PGSKGESGQT
     GRTGEIGPKG SRGDPGSAGR PGLPGFPGPK GPRGEQGVIG FMGTVGFPGD LGPIGPKGDR
     GVTGFQGPPG SPGLPPLPPR LVAEQGSPGP RGNIGPQGSP GDMGPQGPPG DPGFRGSPGE
     PGLQGRGGIP APPGSRGEQG GMGFQGPVGF EGQPGRPGSP GPPGMPGRSV SMGYLLVKHS
     QSDQEPMCPV GMNKLWSGYS LLYFEGQEKA HNQDLGLAGS CLSRFSTMPF LYCNPGDICY
     YASRNDKSYW LSTTAPLPMM PVAEEDIKPY ISRCSVCEAP AVAIAVHSQE ASIPHCPEGW
     RSLWIGYSFL MHTAAGDEGG GQSLVSPGSC LEDFRATPFI ECNGARGTCH YFANKYSFWL
     TTIDQPFQSK PSGDTLKAGL IRSHISRCQV YVPQAKGSLL HHAVDMSSVA HHPRRAAIGF
     CIAPFPL
//
DBGET integrated database retrieval system