ID G1KCR9_ANOCA Unreviewed; 505 AA.
AC G1KCR9;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 29-SEP-2021, sequence version 3.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0000259|PROSITE:PS51403};
OS Anolis carolinensis (Green anole) (American chameleon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Iguania; Dactyloidae; Anolis.
OX NCBI_TaxID=28377 {ECO:0000313|Ensembl:ENSACAP00000004252.4, ECO:0000313|Proteomes:UP000001646};
RN [1] {ECO:0000313|Ensembl:ENSACAP00000004252.4, ECO:0000313|Proteomes:UP000001646}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JBL SC #1 {ECO:0000313|Ensembl:ENSACAP00000004252.4,
RC ECO:0000313|Proteomes:UP000001646};
RG The Genome Sequencing Platform;
RA Di Palma F., Alfoldi J., Heiman D., Young S., Grabherr M., Johnson J.,
RA Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Anolis carolinensis (Green Anole Lizard).";
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACAP00000004252.4}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Type IV collagen is the major structural component of
CC glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork
CC together with laminins, proteoglycans and entactin/nidogen.
CC {ECO:0000256|ARBA:ARBA00003696}.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G1KCR9; -.
DR STRING; 28377.ENSACAP00000004252; -.
DR Ensembl; ENSACAT00000004351.4; ENSACAP00000004252.4; ENSACAG00000004217.4.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000161675; -.
DR HOGENOM; CLU_002023_1_0_1; -.
DR InParanoid; G1KCR9; -.
DR OrthoDB; 2882192at2759; -.
DR Proteomes; UP000001646; Chromosome 3.
DR Bgee; ENSACAG00000004217; Expressed in lung and 5 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IBA:GO_Central.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR GO; GO:0090729; F:toxin activity; IEA:UniProtKB-KW.
DR GO; GO:0030198; P:extracellular matrix organization; IBA:GO_Central.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 2.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000001646};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Toxin {ECO:0000256|ARBA:ARBA00022656}.
FT DOMAIN 280..504
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 37..155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 505 AA; 52873 MW; B809B5A70BCD4EB0 CRC64;
KNGRLSCWLT RTTRSMWRGR GAIGAKGDPG IPGLDIPGFP GQFGLPGIPG PQGNDGPPGN
KGQPGRPGVP GIPGPKGGKG LPGVLGRPGL PGPPGDRGDM GVPGDNGRKG SKGLPGQNGI
KGPSGLSGDD GPVGEKGNQG RDGIPGHPGE KGEQGFHSFS TCFFLITGAL KRRPEVYVQV
PPTPSPLLPT FMPTLPLSEI DQNMGMAGVP GRPGAKGLRG DPGPRGPPGI PGQTGPSGKP
GMAVAGPKGN RGLPGLLGFP GIQGLPGFPG TSLPGPPKRG FMFSRHSQTT KIPSCPSGTV
QIYSGYSLLF VQGNEQSHGQ DLGTVGSCLQ RFTTMPFLVC NPNNICRFAS RNDYSYWLST
AEPMPSDMKP ISGKALEPYI SRCIVCEGPA MVIAVHSQTT SVPPCPQEWQ SLWRGFSFVM
YTGAGSEASG QALASPGSCL ENFYAIPFIE CHGRGTCSYY SNSYSFWLAS LNTRRMFRKP
LPQTLKAGEL EKIISRCQVC MRKSN
//