ID W5QH50_SHEEP Unreviewed; 520 AA.
AC W5QH50;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE RecName: Full=Histidine-rich glycoprotein {ECO:0000256|ARBA:ARBA00039613};
DE AltName: Full=Histidine-proline-rich glycoprotein {ECO:0000256|ARBA:ARBA00041330};
GN Name=HRG {ECO:0000313|Ensembl:ENSOARP00000022050.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000022050.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000022050.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000022050.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000022050.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01009184; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5QH50; -.
DR SMR; W5QH50; -.
DR STRING; 9940.ENSOARP00000022050; -.
DR PaxDb; 9940-ENSOARP00000022050; -.
DR Ensembl; ENSOART00000022355.1; ENSOARP00000022050.1; ENSOARG00000020523.1.
DR eggNOG; ENOG502S50D; Eukaryota.
DR HOGENOM; CLU_575637_0_0_1; -.
DR OMA; GKGHFPF; -.
DR Proteomes; UP000002356; Chromosome 1.
DR Bgee; ENSOARG00000020523; Expressed in adult mammalian kidney and 16 other cell types or tissues.
DR GO; GO:0004869; F:cysteine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0042730; P:fibrinolysis; IEA:UniProtKB-KW.
DR CDD; cd00042; CY; 1.
DR Gene3D; 3.10.450.10; -; 2.
DR InterPro; IPR000010; Cystatin_dom.
DR InterPro; IPR046350; Cystatin_sf.
DR PANTHER; PTHR13814; FETUIN; 1.
DR PANTHER; PTHR13814:SF3; HISTIDINE-RICH GLYCOPROTEIN; 1.
DR Pfam; PF00031; Cystatin; 1.
DR SMART; SM00043; CY; 2.
DR SUPFAM; SSF54403; Cystatin/monellin; 2.
PE 4: Predicted;
KW Blood coagulation {ECO:0000256|ARBA:ARBA00023084};
KW Fibrinolysis {ECO:0000256|ARBA:ARBA00023281};
KW Hemostasis {ECO:0000256|ARBA:ARBA00022696};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT SIGNAL 1..34
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 35..520
FT /note="Histidine-rich glycoprotein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004870657"
FT DOMAIN 30..144
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 154..259
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT REGION 274..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..320
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 332..346
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 368..396
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 416..443
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 520 AA; 58741 MW; 461002E9AE4BB2D2 CRC64;
AVSQRKGCFS HFNKMRALPA ALLSILLITQ QCSCAVSPTG CDAVEPVAAK ALDLINKGRW
DGYLFQLLRV ADAHLERVES TAVYYLVLDV KESDCPVLFR KHWDDCEPDV SRHPSEIVIG
QCKVIAITRL AGSEDLRVND FNCTTSSVSS ALTNTIDSPV LFDFFEDTEL YREQADNALE
KYKRENSDFA PFRVDKVMRA VRARGGKGTS YFLDFSVRNC SSHHFPRHSH IFGFCRADLF
YDVEASDLET PKTIVTNCEV FNLKEHRNFS GVQHHLGRPF HSGEHEHFPA RRPPFKPGGS
KDHGHPHESH NFRCPSPLEH KNHSDSPPFQ ASVPLLFPPP GLRCPHPPFG TKGKHRPPHD
HSSDGHHPHG HHPHGHHPHG HHPRGHHPHG HHPHGHHPHD HDFYDHGPCD PPPHSQGPQD
HHRQGRGPPP WHSKKRGPGE GHLRFHWRPI GYIHRLPSLK KGEVLPLPEA NFPSFSLPNH
NNPLQPEIQA FPQSASESCP GTFNVRFLHI SKFFAYTLPK
//