ID M3VVC6_FELCA Unreviewed; 1683 AA.
AC M3VVC6;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 3.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=Collagen type IV alpha 5 chain {ECO:0000313|Ensembl:ENSFCAP00000000704.5};
GN Name=COL4A5 {ECO:0000313|Ensembl:ENSFCAP00000000704.5,
GN ECO:0000313|VGNC:VGNC:61064};
OS Felis catus (Cat) (Felis silvestris catus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Feliformia; Felidae; Felinae; Felis.
OX NCBI_TaxID=9685 {ECO:0000313|Ensembl:ENSFCAP00000000704.5, ECO:0000313|Proteomes:UP000011712};
RN [1] {ECO:0000313|Ensembl:ENSFCAP00000000704.5, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000000704.5,
RC ECO:0000313|Proteomes:UP000011712};
RX PubMed=17975172; DOI=10.1101/gr.6380007;
RA Pontius J.U., Mullikin J.C., Smith D.R., Lindblad-Toh K., Gnerre S.,
RA Clamp M., Chang J., Stephens R., Neelam B., Volfovsky N., Schaffer A.A.,
RA Agarwala R., Narfstrom K., Murphy W.J., Giger U., Roca A.L., Antunes A.,
RA Menotti-Raymond M., Yuhki N., Pecon-Slattery J., Johnson W.E., Bourque G.,
RA Tesler G., O'Brien S.J.;
RT "Initial sequence and comparative analysis of the cat genome.";
RL Genome Res. 17:1675-1689(2007).
RN [2] {ECO:0000313|Ensembl:ENSFCAP00000000704.5, ECO:0000313|Proteomes:UP000011712}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000000704.5,
RC ECO:0000313|Proteomes:UP000011712};
RA Hillier L.W., Warren W., Obrien S., Wilson R.K.;
RT "Sequence assembly of the Felis catus genome version 6.2.";
RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSFCAP00000000704.5}
RP IDENTIFICATION.
RC STRAIN=breed Abyssinian {ECO:0000313|Ensembl:ENSFCAP00000000704.5};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AANG04001034; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 9685.ENSFCAP00000047412; -.
DR PaxDb; 9685-ENSFCAP00000000704; -.
DR Ensembl; ENSFCAT00000000758.6; ENSFCAP00000000704.5; ENSFCAG00000000758.6.
DR VGNC; VGNC:61064; COL4A5.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000162034; -.
DR HOGENOM; CLU_002023_0_0_1; -.
DR Proteomes; UP000011712; Chromosome X.
DR Bgee; ENSFCAG00000000758; Expressed in eyeball of camera-type eye and 11 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1077; COLLAGEN ALPHA-3(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 20.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000011712};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1683
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5016368991"
FT DOMAIN 1459..1683
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 49..547
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 559..1457
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..105
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 139..153
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..279
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 289..303
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..402
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..462
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 703..726
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 738..764
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 804..831
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 862..878
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1038..1052
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1121..1164
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1230..1268
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1392..1406
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1683 AA; 161037 MW; 33AB6FF4918FD0C5 CRC64;
MQLRGISLAA GLFLLALSLW GQPAEAAACY GCSPGSKCDC SGVKGEKGER GFPGLEGHPG
LPGFPGPEGP PGPRGQKGDD GIPGPPGPKG IRGPPGLPGF PGTPGLPGMP GHDGAPGPQG
IPGCNGTKGE RGFPGSPGFP GLQGPPGPPG IPGMKGEPGS IIMSSLPGPK GNPGYPGPPG
IQGPAGPTGL PGPIGPPGPP GLMGPPGPPG LPGPKGNMGL NFQGPKGEKG EQGLQGPPGP
PGQISEQKRP IDVEFQKGDQ GLPGDRGPPG PPGIRGPPGP PGGVKGEKGE QGEPGKRGKP
GKDGENGQPG IPGLPGDPGS PGEPGRDGEK GQKGDIGPTG PPGLVIPRLG TGVTVGEKGS
IGLPGLPGEK GERGFPGIQG PPGLPGPPGT AVMGPPGPPG FPGERGQKGD EGPPGLSIPG
SPGLDGQPGA PGLPGPPGPP GPQIPPSDDI CEAGPPGPPG SPGDRGLQGE QGMKGDKGDT
CFNCIGTGIS GPPGQPGLPG LPGPPGSLGF PGQKGEKGHA GPTGPKGLTG IPGVPGPPGF
PGSKGEPGDI LTFPGMKGDK GELGSPGAPG LPGLPGTPGQ DGLPGLPGPK GEPGGIAFKG
ERGPPGNPGL PGLPGNRGPM GPVGFGPPGP VGEKGIQGVA GNPGQPGIPG PKGDPGQTIT
QPGKPGLPGN PGRDGEVGLD PGLPGQPGLP GIPGSKGEPG IPGIGLPGPP GPKGFPGIPG
PPGAPGTPGR IGLEGPSGPP GFPGPKGEPG LGLPGPPGPP GLPGFKGTLG PKGDRGFPGP
PGPPGRAGLD GLPGPKGDIG PNGQPGPMGP PGLPGIGVQG PPGPPGIPGP VGQPGLHGIP
GEKGDPGPPG FDVLGPPGER GSPGIPGAPG PMGPPGSPGI PGKAGASGFP GAKGEMGMMG
PPGPPGPLGI PGRSGVPGLK GDNGLQGQPG PPGPVGEKGG KGEPGLPGPP GPMDPDLLGS
KGEKGDPGLP GIPGVSGPKG YQGLPGDPGQ PGLSGQPGLP GPSGPKGNPG LPGKPGLTGP
PGLKGSIGDM GFPGPQGVKG SPGPPGVPGQ PGSPGLPGQK GEKGDPGISG IGLPGLPGPK
GEPGLPGYPG NPGIKGSMGD TGLPGLPGTP GAKGHPGLPG FPGTPGLPGP KGISGPPGNP
GLPGEPGPVG GGGRPGPPGP PGEKGSPGQD GIPGPAGQKG EPGQPGFGIP GPPGLPGLSG
QKGDGGLPGI PGNPGLPGPK GEPGFQGFPG VQGPPGPPGS PGPALEGPKG DPGPQGPPGR
PGLPGPEGPR GLPGNGGIKG ERGNPGQPGQ PGLPGLKGDQ GPPGLQGNPG RPGLNGMKGD
PGLPGVPGFP GMKGPSGIPG STGPEGDPGL VGPPGPPGLP GPSGQSIIIK GDVGPPGIPG
QPGLKGLPGL PGPQGLPGPI GPPGDPGRNG LPGFDGAGGR KGDPGLPGQP GTRGLDGPPG
PDGLQGPPGP PGTSSIAHGF LITRHSQTTD APQCPHGTVQ IYEGFSLLYV QGNKRAHGQD
LGTAGSCLRR FSTMPFMFCN INNVCNFASR NDYSYWLSTP EPMPMSMEPL KGRSIQPFIS
RCAVCEAPSM VIAVHSQTIQ IPHCPQGWDS LWIGYSFMMH TSAGAEGSGQ ALASPGSCLE
EFRSAPFIEC HGRGTCNYYA NSYSFWLATV DMSDMFSKPQ SETLKAGDLR TRISRCQVCM
KRT
//