ID A0A3Q2GU24_HORSE Unreviewed; 255 AA.
AC A0A3Q2GU24;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=C1q domain-containing protein {ECO:0000259|PROSITE:PS50871};
GN Name=LOC111772277 {ECO:0000313|Ensembl:ENSECAP00000023814.1};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000023814.1, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000023814.1, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000023814.1,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000023814.1}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000023814.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000023814; -.
DR PaxDb; 9796-ENSECAP00000023814; -.
DR Ensembl; ENSECAT00000046008.2; ENSECAP00000023814.1; ENSECAG00000039587.2.
DR Ensembl; ENSECAT00000046595.2; ENSECAP00000035861.1; ENSECAG00000034709.2.
DR Ensembl; ENSECAT00000051227.1; ENSECAP00000037394.1; ENSECAG00000033918.1.
DR GeneTree; ENSGT00940000161639; -.
DR OMA; GFMIYAD; -.
DR OrthoDB; 4210331at2759; -.
DR Proteomes; UP000002281; Chromosome 29.
DR Bgee; ENSECAG00000033918; Expressed in prefrontal cortex and 6 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR22923; CEREBELLIN-RELATED; 1.
DR PANTHER; PTHR22923:SF96; COMPLEMENT C1Q-LIKE PROTEIN 3; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 1.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..255
FT /note="C1q domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041087713"
FT DOMAIN 122..255
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 39..109
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 74..91
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 255 AA; 26687 MW; 529FBAF4B2191BC1 CRC64;
MVLLLVILIP VLVSSAGTSA HYEMLGTCRM VCDPYGGTKA PSTAATPDRG LMQSLPTFIQ
GPKGEAGRPG KAGPRGPPGE PGPPGPVGPP GEKGEPGRQG LPGPPGAPGL NAAGAISAAT
YSTVPKIAFY AGLKRQHEGY EVLKFDDVVT NLGNHYDPTT GKFTCSIPGI YFFTYHVLMR
GGDGTSMWAD LCKNNQVRAS AIAQDADQNY DYASNSVVLH LEPGDEVYIK LDGGKAHGGN
NNKYSTFSGF IIYAD
//