ID F6SQN2_HORSE Unreviewed; 791 AA.
AC F6SQN2;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 2.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=HHIP like 1 {ECO:0000313|Ensembl:ENSECAP00000022244.2};
GN Name=HHIPL1 {ECO:0000313|VGNC:VGNC:18776};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000022244.2, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000022244.2, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022244.2,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000022244.2}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022244.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the HHIP family.
CC {ECO:0000256|ARBA:ARBA00010658}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000022244; -.
DR PaxDb; 9796-ENSECAP00000022244; -.
DR Ensembl; ENSECAT00000026640.4; ENSECAP00000022244.2; ENSECAG00000024687.4.
DR VGNC; VGNC:18776; HHIPL1.
DR GeneTree; ENSGT00940000162083; -.
DR HOGENOM; CLU_012344_2_1_1; -.
DR InParanoid; F6SQN2; -.
DR OMA; YSVEVRY; -.
DR TreeFam; TF329059; -.
DR Proteomes; UP000002281; Chromosome 24.
DR Bgee; ENSECAG00000024687; Expressed in articular cartilage of joint and 15 other cell types or tissues.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR018143; Folate_rcpt-like.
DR InterPro; IPR012938; Glc/Sorbosone_DH.
DR InterPro; IPR011041; Quinoprot_gluc/sorb_DH.
DR InterPro; IPR001190; SRCR.
DR InterPro; IPR017448; SRCR-like_dom.
DR InterPro; IPR036772; SRCR-like_dom_sf.
DR PANTHER; PTHR19328; HEDGEHOG-INTERACTING PROTEIN; 1.
DR PANTHER; PTHR19328:SF32; HHIP-LIKE PROTEIN 1; 1.
DR Pfam; PF03024; Folate_rec; 1.
DR Pfam; PF07995; GSDH; 1.
DR Pfam; PF00530; SRCR; 1.
DR PRINTS; PR00258; SPERACTRCPTR.
DR SMART; SM00202; SR; 1.
DR SUPFAM; SSF50952; Soluble quinoprotein glucose dehydrogenase; 1.
DR SUPFAM; SSF56487; SRCR-like; 1.
DR PROSITE; PS00420; SRCR_1; 1.
DR PROSITE; PS50287; SRCR_2; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00196}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..791
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018614378"
FT DOMAIN 682..785
FT /note="SRCR"
FT /evidence="ECO:0000259|PROSITE:PS50287"
FT REGION 613..689
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 637..653
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 754..764
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00196"
SQ SEQUENCE 791 AA; 87488 MW; 862B627C3A0DFD9F CRC64;
XXXXXXGRRA GAGALLALLA LRALGAAAHP QCLDFRPPFR PPEPLRFCAQ YSAFGCCAPE
QDAALAHRFG ALAARVDADE WAACAGYALD LLCQECSPYA AHLYDAEDPS TPLRTLPGLC
EDYCLDMWQT CRGLIRHLSP DRELWALEDN RAKFCHYLSL DDTDYCFPRL LVNENLNSNL
GRVVADAKGC LQLCLEEVAN GLRNPVAMVH AHDGTHRFFV AEQVGLVWAY LPDRSRLEKP
FLNVSQAVLT SPWEGDERGF LGIALHPGFR HNGKLYVYYS VGVDFDEWIR ISEFRVSEDD
MNTVDHRSER IILEIEEPAS NHNGGQLLFG DDGYLYIFTG DGGMAGDPFG KFGNAQNKSA
LLGKVLRIDV DRNERGPLYR IPPDNPFVGD PAARPEVYAL GVRNMWRCSF DRGDPASGAG
RGRLFCGDVG QNKFEEVDLV ERGRNYGWRA REGFECYDRK LCNNASLDDV LPIFAYPHKL
GKSVTGGYVY RGCEYPNLNG LYIFGDFMSG RLMSLRENPG TGQWRYSEIC MGRGQTCEFP
GLINNYYPYI ISFAEDEAGE LYFMSTGVPS ATVARGVVYK VIDPSRRAPP GKCRIQPTQV
KVRSRLVRFV PKEKFIRRTE STPRPTARAP TNAPRRSRPT AAPPAPTPRP ARPTRRPGGR
RGGGRRRGRP GTADPAPSNG AVRLVRPAGL SPGRGRVEVF AGGRWGTVCD DAWDTKAAAV
VCRQLGFAHV VRATKRAEFG EGRALPILLD DVRCKGGERT LLECTHAGVG THNCDHQEDA
GVVCSREDPD L
//