ID A0A3P9NHK4_POERE Unreviewed; 1664 AA.
AC A0A3P9NHK4;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Collagen type IV alpha 5 chain {ECO:0000313|Ensembl:ENSPREP00000009066.1};
GN Name=COL4A5 {ECO:0000313|Ensembl:ENSPREP00000009066.1};
OS Poecilia reticulata (Guppy) (Acanthophacelus reticulatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=8081 {ECO:0000313|Ensembl:ENSPREP00000009066.1, ECO:0000313|Proteomes:UP000242638};
RN [1] {ECO:0000313|Proteomes:UP000242638}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Guanapo {ECO:0000313|Proteomes:UP000242638};
RA Kuenstner A., Dreyer C.;
RT "The genomic landscape of the Guanapo guppy.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPREP00000009066.1}
RP IDENTIFICATION.
RC STRAIN=Guanapo {ECO:0000313|Ensembl:ENSPREP00000009066.1};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 8081.ENSPREP00000009066; -.
DR Ensembl; ENSPRET00000009176.1; ENSPREP00000009066.1; ENSPREG00000006199.1.
DR GeneTree; ENSGT00940000162034; -.
DR OMA; EHGSCHY; -.
DR Proteomes; UP000242638; Unassembled WGS sequence.
DR Bgee; ENSPREG00000006199; Expressed in caudal fin and 1 other cell type or tissue.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1077; COLLAGEN ALPHA-3(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000242638};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1664
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017975601"
FT DOMAIN 1440..1664
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 53..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..119
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 182..217
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..250
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..288
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 372..386
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..428
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..489
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..667
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 800..817
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 844..868
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 986..1000
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1191..1225
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1352..1376
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1664 AA; 157570 MW; B06B2C62C014850F CRC64;
MTRGLRRLRQ PAWVVIWLVL YATIQLSDSA ACNGCGGSKC DCSGVKGEKG ERGFPGLTGQ
PGVPGFPGPE GPIGTRGEKG SDGPSGPSGP KGIRGPSGLP GFPGTPGLPG LPGQDGPPGP
RGVPGCNGTK GEPGFPGSSG IPGRQGFPGP PGLPGEKGDR GDVFYTNNYG LKGAVGLPGL
PGSPGSPGPT GLPGPTGPPG IPGYEGIPGP PGPPGPKGNM GLNFEGPKGD KGDQGLPGPP
GPPGPGQGEQ VRPPPTEIQR GDKGPSGGQK GEKGEPGEAG KRGKQGKDGD PGAIGYPGLK
GEPGSPGLPG RDGERGQKGD RGFPGPPGPV IRPTNGAAVG PKGEPGFPGN PGAKGERGPQ
GFSGPPGTPG VPGIGGSGPP GPPGFPGDKG QKGDPGTPSN ALPGPPGRPG NPGNPGPQGP
PGPPGYSSPV DNCLPGEPGL PGIQGQKGFP GEQGQKGQKG ETCVNCIDGG NPIPGPPGPP
GPPGFPGSPG PQGLKGEPGF QGITGLSGPP GVPGSVGAPG FPGEKGEPGD TFGGGGVKGE
KGDSGFPGPP GLPGLDGRPG RDGAPGTPGP KGAPGSLLLK GERGPPGDTG PPGIPGDRGP
PGGPGYGVPG APGEKGAQGN SGIPGIPGQP GVKGEPGPTV TEKGQPGPKG IDGNPGSPGP
PGPRGTDGQP GSPGFPGPKG EPGQPGVGLP GPAGVKGFPG IPGQPGAPGK VGRPGIDGFP
GPPGFPGPKG EAGFGLPGPP GQPGLAGAKG FPGQKGDPGF PGNPGGPGRP GFDGGPGLKG
EPGSPGQPGP RGPPGSATSG VQGPPGPPGP PGPMGPTGYP GGSGEKGDPG PPGLDIPGSP
GDRGNPGFPG PPGPIGIPGP PGGPGRDGAP GFPGPKGDMG SMGPPGPSGG PGSPGGPGAP
GLKGEPGFPG SNGIPGGPGS KGEKGDPGSP GLPGPIGPVD IRGAKGDPGT PGSPGSPGPK
GIAGLPGDPG LPGQDGRSGI PGPPGLKGEP GIPGSPGGPG FPGPKGAMGD MGIPGPMGSK
GTAGFQGRPG SPGQPGIPGL QGPKGDPGSA GVGPPGPQGQ KGDPGQPGFP GSPGGKGGPG
TPGLPGLPGL PGAKGDVGLA GFQGPPGFPG PKGLDGGPGA PGLSGAPGRP GESGRPGPPG
QVGEKGQPGR DGIPGPAGVK GEPGIPGFGV PGPSGLPGSP GEKGNPGLPG PTGGPGFPGP
KGDTGFPGPP GSPGVSGPPG PPGLPFQGIK GNQGPPGPPG RAGVPGPEGP RGLPGGGGVK
GDKGNPGNPG LPGQPGQKGD PGFPGAPGPS GGPGGPGPKG DIGFPGVSGF PGSKGEPGLP
GPSGLPGDPG VDGAPGPPGD PGSSSPVDYV KGAPGPPGPP GSPGATGPPG SPGAPGSSGL
PGPSGDTGSP GPPGFNGPPG KKGDTGPAGQ PGQKGNTGPP GSDGPQGPPG FPGSGSAAHG
FLLTRHSQSE ITPECPSGTN FIYDGYSLLY VQGNERAHGQ DLGTAGSCLR RFSPMPFMFC
NINNVCNFAA RNDYSYWLST LEPMPMSMEA VTGDNIKPYI SRCAVCEAPA MVIAVHSQTI
QIPSCPANWR SLWIGFSFMM HTSAGAEGSG QALASPGSCL EDFRSTPFIE CHGRGSCNYF
SNSYSFWLAT VEKLDMFKKP QSETLKAGNL LTRVSRCVVC MKTT
//