ID I3M3Z8_ICTTR Unreviewed; 1671 AA.
AC I3M3Z8;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Collagen type IV alpha 1 chain {ECO:0000313|Ensembl:ENSSTOP00000003864.3};
GN Name=COL4A1 {ECO:0000313|Ensembl:ENSSTOP00000003864.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000003864.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000003864.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01004021; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01004022; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01004023; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 43179.ENSSTOP00000003864; -.
DR Ensembl; ENSSTOT00000004305.3; ENSSTOP00000003864.3; ENSSTOG00000004254.3.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000157678; -.
DR HOGENOM; CLU_002023_0_0_1; -.
DR InParanoid; I3M3Z8; -.
DR TreeFam; TF344135; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005587; C:collagen type IV trimer; IEA:Ensembl.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:Ensembl.
DR GO; GO:0048407; F:platelet-derived growth factor binding; IEA:Ensembl.
DR GO; GO:0071711; P:basement membrane organization; IEA:Ensembl.
DR GO; GO:0007420; P:brain development; IEA:Ensembl.
DR GO; GO:0001569; P:branching involved in blood vessel morphogenesis; IEA:Ensembl.
DR GO; GO:0071230; P:cellular response to amino acid stimulus; IEA:Ensembl.
DR GO; GO:0038063; P:collagen-activated tyrosine kinase receptor signaling pathway; IEA:Ensembl.
DR GO; GO:0007528; P:neuromuscular junction development; IEA:Ensembl.
DR GO; GO:0061333; P:renal tubule morphogenesis; IEA:Ensembl.
DR GO; GO:0061304; P:retinal blood vessel morphogenesis; IEA:Ensembl.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023:SF1019; COLLAGEN; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 15.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1447..1671
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 47..1464
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..123
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 140..154
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 193..217
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..301
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 367..382
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..819
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1341..1376
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1417..1436
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1448..1462
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1671 AA; 160632 MW; 00F784DC9A052DB7 CRC64;
XXXXXXXXXX XXXVALLLHE ERSRAAAKGG CDGSGCGKCD CHGVKGQKGE RGLPGLQGVI
GFPGMQGPEG PQGPPGQKGD TGEPGLPGTK GTRGPPGAAG YPGNPGLPGI PGQDGPPGPP
GIPGCNGTKG ERGPLGPPGL PGFSGNPGPP GLPGMKGDPG EILGFGYVPG MLLKGERGFP
GAPGIPGSPG LPGLQGPVGP PGFTGPPGPP GPPGPPGEKG QMGSSFQGPK GDKGDQGVSG
PPGMPGQAQV KEKGDFAPTG EKGQKGEPGF QGIPGIGEKG EPGKPGPRGK PGKDGDKGEK
GSPGFPGEPG YPGLPGPQGP QGDKGAAGPP GPPGIVIGTG PLGEKGERGY PGAPGLRGEP
GPKGYPGLQG QPGPPGFPVS GQPGAPGFPG ERGEKGDQGY PGTSLPGLSG KDGVPGPPGL
PGPPGQPGHT NGIVECQPGP PGDQGPPGTP GQPGLTGEVG EKGQKGDSCL ICDTEGLRGP
PGPQGPPGEI GFPGQPGAKG DRGLPGRDGL EGLPGPQGAP GLMGPPGAKG EPGEIYFDMR
LKGDKGDPGF PGQPGMPGRA GSPGRDGHPG LPGPKGSPGS VGLKGERGPP GGAGFPGSRG
DIGPPGPPGF GPIGPVGDKG QPGFPGNPGS PGLPGPKGEA GKVVPLPGPP GAEGLPGSPG
FPGPQGDRGF PGTPGRPGLP GEKGSVGQPG IGFPGPPGPK GVDGLPGDVG TPGSPGRPGF
NGLPGSPGLQ GQKGEPGIGL PGLKGLPGLP GIPGTPGEKG NIGGPGVPGE HGAIGPPGLQ
GIRGDPGPPG VQGPAGPPGA PGIGPPGVMG PPGGQGPPGS SGPPGVKGEK GFPGFPGLDM
PGPKGDKGSQ GLPGLTGQSG LPGLPGQQGT PGIPGFPGPK GEMGVMGTPG QPGSPGPAGA
PGLPGEKGDH GFPGSSGPRG DPGFKGDKGD VGLPGKPGSM EKVDMGSMKG QKGDQGEKGQ
IGPTGDKGSR GDPGTPGVPG KDGQAGHPGQ PGPKGDPGVS GTPGAPGLPG PKGAVGGMGL
PGTPGEKGVP GIPGPQGVPG LPGEKGAKGE KGQAGLPGVG IPGRPGDKGD QGAAGFPGSP
GEKGEKGSIG IPGMPGSPGP KGSPGSVGYP GSPGLPGEKG DKGLPGLDGV PGVKGEAGLP
GQPGPTGPAG QKGEPGSDGI PGSAGEKGEP GLPGRGFPGF PGSKGDKGSK GEVGFPGLAG
SPGIPGPKGE QGFMGPPGPQ GQPGLPGTPG RPVEGPKGDR GPQGQPGLPG LPGPMGPPGL
PGLDGLKGDK GNPGWPGAPG IPGPKGDPGF QGMPGIGGSP GLTGSKGDMG PPGVPGFQGQ
KGLPGLQGLK GDQGDQGVPG PKGLPGPPGP PGPYDIIKGE PGLPGPEGPP GLKGLQGPPG
PKGQQGVAGS VGLPGPPGAP GFDGAPGQKG ETGPFGPPGP RGFPGPPGPD GLPGSMGPPG
TPSVDHGFLV TRHSQTTDDP SCPPGTKILY HGYSLLYVQG NERAHGQDLG TAGSCLRKFS
TMPFLFCNIN NVCNFASRND YSYWLSTPEP MPMSMAPITG DNIRPFISRC AVCEAPAMVM
AVHSQTIQIP QCPNGWSSLW IGYSFVMHTS AGAEGSGQAL ASPGSCLEEF RSAPFIECHG
RGTCNYYANA YSFWLATIER SEMFKKPTPS TLKAGELRTH VSRCQVCMRR T
//