GenomeNet

Database: UniProt
Entry: F6SQN2_HORSE
LinkDB: F6SQN2_HORSE
Original site: F6SQN2_HORSE 
ID   F6SQN2_HORSE            Unreviewed;       791 AA.
AC   F6SQN2;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   10-APR-2019, sequence version 2.
DT   27-MAR-2024, entry version 61.
DE   SubName: Full=HHIP like 1 {ECO:0000313|Ensembl:ENSECAP00000022244.2};
GN   Name=HHIPL1 {ECO:0000313|VGNC:VGNC:18776};
OS   Equus caballus (Horse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX   NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000022244.2, ECO:0000313|Proteomes:UP000002281};
RN   [1] {ECO:0000313|Ensembl:ENSECAP00000022244.2, ECO:0000313|Proteomes:UP000002281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022244.2,
RC   ECO:0000313|Proteomes:UP000002281};
RX   PubMed=19892987; DOI=10.1126/science.1178158;
RG   Broad Institute Genome Sequencing Platform;
RG   Broad Institute Whole Genome Assembly Team;
RA   Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA   Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA   Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA   Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA   Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA   Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA   Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA   Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA   Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA   Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT   "Genome sequence, comparative analysis, and population genetics of the
RT   domestic horse.";
RL   Science 326:865-867(2009).
RN   [2] {ECO:0000313|Ensembl:ENSECAP00000022244.2}
RP   IDENTIFICATION.
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022244.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- SIMILARITY: Belongs to the HHIP family.
CC       {ECO:0000256|ARBA:ARBA00010658}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 9796.ENSECAP00000022244; -.
DR   PaxDb; 9796-ENSECAP00000022244; -.
DR   Ensembl; ENSECAT00000026640.4; ENSECAP00000022244.2; ENSECAG00000024687.4.
DR   VGNC; VGNC:18776; HHIPL1.
DR   GeneTree; ENSGT00940000162083; -.
DR   HOGENOM; CLU_012344_2_1_1; -.
DR   InParanoid; F6SQN2; -.
DR   OMA; YSVEVRY; -.
DR   TreeFam; TF329059; -.
DR   Proteomes; UP000002281; Chromosome 24.
DR   Bgee; ENSECAG00000024687; Expressed in articular cartilage of joint and 15 other cell types or tissues.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0016020; C:membrane; IEA:InterPro.
DR   Gene3D; 3.10.250.10; SRCR-like domain; 1.
DR   Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR   InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR   InterPro; IPR018143; Folate_rcpt-like.
DR   InterPro; IPR012938; Glc/Sorbosone_DH.
DR   InterPro; IPR011041; Quinoprot_gluc/sorb_DH.
DR   InterPro; IPR001190; SRCR.
DR   InterPro; IPR017448; SRCR-like_dom.
DR   InterPro; IPR036772; SRCR-like_dom_sf.
DR   PANTHER; PTHR19328; HEDGEHOG-INTERACTING PROTEIN; 1.
DR   PANTHER; PTHR19328:SF32; HHIP-LIKE PROTEIN 1; 1.
DR   Pfam; PF03024; Folate_rec; 1.
DR   Pfam; PF07995; GSDH; 1.
DR   Pfam; PF00530; SRCR; 1.
DR   PRINTS; PR00258; SPERACTRCPTR.
DR   SMART; SM00202; SR; 1.
DR   SUPFAM; SSF50952; Soluble quinoprotein glucose dehydrogenase; 1.
DR   SUPFAM; SSF56487; SRCR-like; 1.
DR   PROSITE; PS00420; SRCR_1; 1.
DR   PROSITE; PS50287; SRCR_2; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00196}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..27
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           28..791
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018614378"
FT   DOMAIN          682..785
FT                   /note="SRCR"
FT                   /evidence="ECO:0000259|PROSITE:PS50287"
FT   REGION          613..689
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        637..653
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        754..764
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00196"
SQ   SEQUENCE   791 AA;  87488 MW;  862B627C3A0DFD9F CRC64;
     XXXXXXGRRA GAGALLALLA LRALGAAAHP QCLDFRPPFR PPEPLRFCAQ YSAFGCCAPE
     QDAALAHRFG ALAARVDADE WAACAGYALD LLCQECSPYA AHLYDAEDPS TPLRTLPGLC
     EDYCLDMWQT CRGLIRHLSP DRELWALEDN RAKFCHYLSL DDTDYCFPRL LVNENLNSNL
     GRVVADAKGC LQLCLEEVAN GLRNPVAMVH AHDGTHRFFV AEQVGLVWAY LPDRSRLEKP
     FLNVSQAVLT SPWEGDERGF LGIALHPGFR HNGKLYVYYS VGVDFDEWIR ISEFRVSEDD
     MNTVDHRSER IILEIEEPAS NHNGGQLLFG DDGYLYIFTG DGGMAGDPFG KFGNAQNKSA
     LLGKVLRIDV DRNERGPLYR IPPDNPFVGD PAARPEVYAL GVRNMWRCSF DRGDPASGAG
     RGRLFCGDVG QNKFEEVDLV ERGRNYGWRA REGFECYDRK LCNNASLDDV LPIFAYPHKL
     GKSVTGGYVY RGCEYPNLNG LYIFGDFMSG RLMSLRENPG TGQWRYSEIC MGRGQTCEFP
     GLINNYYPYI ISFAEDEAGE LYFMSTGVPS ATVARGVVYK VIDPSRRAPP GKCRIQPTQV
     KVRSRLVRFV PKEKFIRRTE STPRPTARAP TNAPRRSRPT AAPPAPTPRP ARPTRRPGGR
     RGGGRRRGRP GTADPAPSNG AVRLVRPAGL SPGRGRVEVF AGGRWGTVCD DAWDTKAAAV
     VCRQLGFAHV VRATKRAEFG EGRALPILLD DVRCKGGERT LLECTHAGVG THNCDHQEDA
     GVVCSREDPD L
//
DBGET integrated database retrieval system