GenomeNet

Database: UniProt/TrEMBL
Entry: F7DST6_HORSE
LinkDB: F7DST6_HORSE
Original site: F7DST6_HORSE 
ID   F7DST6_HORSE            Unreviewed;       246 AA.
AC   F7DST6;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   27-JUL-2011, sequence version 1.
DT   31-JAN-2018, entry version 40.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSECAP00000012520};
GN   Name=LOC100049983 {ECO:0000313|Ensembl:ENSECAP00000012520};
OS   Equus caballus (Horse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX   NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000012520, ECO:0000313|Proteomes:UP000002281};
RN   [1] {ECO:0000313|Ensembl:ENSECAP00000012520, ECO:0000313|Proteomes:UP000002281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000012520,
RC   ECO:0000313|Proteomes:UP000002281};
RX   PubMed=19892987; DOI=10.1126/science.1178158;
RG   Broad Institute Genome Sequencing Platform;
RG   Broad Institute Whole Genome Assembly Team;
RA   Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA   Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H.,
RA   Distl O., Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N.,
RA   Penedo M.C.T., Raison J.M., Sharpe T., Vogel J., Andersson L.,
RA   Antczak D.F., Biagi T., Binns M.M., Chowdhary B.P., Coleman S.J.,
RA   Della Valle G., Fryc S., Guerin G., Hasegawa T., Hill E.W., Jurka J.,
RA   Kiialainen A., Lindgren G., Liu J., Magnani E., Mickelson J.R.,
RA   Murray J., Nergadze S.G., Onofrio R., Pedroni S., Piras M.F.,
RA   Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A., Searle S., Skow L.,
RA   Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J., Vaudin M.,
RA   White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT   "Genome sequence, comparative analysis, and population genetics of the
RT   domestic horse.";
RL   Science 326:865-867(2009).
RN   [2] {ECO:0000313|Ensembl:ENSECAP00000012520}
RP   IDENTIFICATION.
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000012520};
RG   Ensembl;
RL   Submitted (JUL-2011) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family.
CC       {ECO:0000256|SAAS:SAAS00559343}.
CC   -!- CAUTION: The sequence shown here is derived from an Ensembl
CC       automatic analysis pipeline and should be considered as
CC       preliminary data. {ECO:0000313|Ensembl:ENSECAP00000012520}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   RefSeq; XP_001487997.1; XM_001487947.3.
DR   STRING; 9796.ENSECAP00000012520; -.
DR   MEROPS; S01.258; -.
DR   PaxDb; F7DST6; -.
DR   Ensembl; ENSECAT00000015566; ENSECAP00000012520; ENSECAG00000014506.
DR   GeneID; 100049983; -.
DR   KEGG; ecb:100049983; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   eggNOG; COG5640; LUCA.
DR   GeneTree; ENSGT00760000118862; -.
DR   InParanoid; F7DST6; -.
DR   KO; K01312; -.
DR   OMA; TRAGQFC; -.
DR   OrthoDB; EOG091G0DF7; -.
DR   TreeFam; TF331065; -.
DR   Proteomes; UP000002281; Chromosome 4.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0006508; P:proteolysis; IBA:GO_Central.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; SSF50494; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Complete proteome {ECO:0000313|Proteomes:UP000002281};
KW   Disulfide bond {ECO:0000256|SAAS:SAAS00037407};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     15       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        16    246       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5013220561.
FT   DOMAIN       24    244       Peptidase S1. {ECO:0000259|PROSITE:
FT                                PS50240}.
SQ   SEQUENCE   246 AA;  26225 MW;  A0A343ECCA7177D0 CRC64;
     MKLLIFLALL GAAVASSTDD DDKIVGGYTC QANSVPYQVS LNVGYHICGG SLISNQWVVS
     AAHCYQSRFQ VRLGEHNIAV TEGNEQFINS AKVIRHPSYN SRTYDNDILL IKLSSPASIN
     SKVSAISLPA SFPAAGTQCL ISGWGNTLSS GSNYPNLLQC LNAPILSDSS CRSSYPNQIT
     SNMFCAGFLE GGKDSCQGDS GGPVACSGVL QGIVSWGYGC AQRNKPGVYT KVYNYVNWIR
     QTIAAN
//
DBGET integrated database retrieval system