ID A0A226NFS0_CALSU Unreviewed; 694 AA.
AC A0A226NFS0;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=C1q domain-containing protein {ECO:0000259|PROSITE:PS50871};
GN ORFNames=ASZ78_011935 {ECO:0000313|EMBL:OXB66401.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB66401.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB66401.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB66401.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB66401.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB66401.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01000064; OXB66401.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226NFS0; -.
DR STRING; 9009.A0A226NFS0; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR15427:SF46; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 2.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000198323};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..694
FT /note="C1q domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012646643"
FT DOMAIN 561..694
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 44..548
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..89
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 207..223
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 494..522
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 694 AA; 67266 MW; 4C4B12DBF98C7CCF CRC64;
MLPDAAVLLL LLLLVGLRAV AAGGAAGYAQ VKYMQPMVKG PLGPPFREGK GQYLDMPPLL
PMDLKGEPGP PGKPGPRGPP GPPGYPGKPG TGKPGMHGQP GPAGPPGFSG IGKPGIPGLP
GKAGMKGMPG AKGEPGMRGE QGPRGLPGPP GLPGPAGISV NGKPGPQGGP GLPGFRGEPG
PKGEPGPRGE RGMKGENGVG KPGLPGPRGN GGPPGPAGPP GPVSVGKPGL DGLPGAPGEK
GDMGPPGGPG VSGEPGPVGP RGPPGIDGIG IPGAAGVPGI QGPMGPKGEP GIRGPPGLPG
ATGYGKPGLP GLKGDRGQPG VPGAIGDKGE PGVDGEPGEQ GPAGVIGPPG PPGSMGLPGK
HGLPGPKGDV GPSGPPGMPG MRGDQGPNGF AGKPGVPGER GLPGLQGPPG PTGPKGEPGF
IGLPGVPGLT GGPGPKGDGG IPGQPGLRGP SGIPGLQGPA GPMGPQGLPG LKGEPGLPGV
PGEGKMGEPG MAGPIGPPGM PGTPGLNGPP GPPGPPGPPG APGVFDETGI AGLHLPDGGV
EGAVLGNGKP GKPQYGRGEL SARIAPAFTA ILTSPFPASG MPVKFDRTLY NGHNGYNPVT
GIFTCPISGI YYFAYHVHVK GTNVWVALYK NNVPATYTYD EYKKGYLDQA SGSAVLELKE
NDQVWVQMPS DQANGLYSTE YIHSSFSGFL LCPT
//