GenomeNet

Database: UniProt/TrEMBL
Entry: G3VR43_SARHA
LinkDB: G3VR43_SARHA
Original site: G3VR43_SARHA 
ID   G3VR43_SARHA            Unreviewed;       247 AA.
AC   G3VR43;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   28-MAR-2018, entry version 40.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSSHAP00000005648};
GN   Name=LOC100929943 {ECO:0000313|Ensembl:ENSSHAP00000005648};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000005648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000005648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E.,
RA   Miller J., Walenz B., Knight J., Qi J., Zhao F., Wang Q.,
RA   Bedoya-Reina O.C., Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A.,
RA   Woodbridge P., Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S.,
RA   Helgen K.M., Lesk A.M., Pringle T.H., Patterson N., Zhang Y.,
RA   Kreiss A., Woods G.M., Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered
RT   marsupial Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000005648}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2011) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family.
CC       {ECO:0000256|SAAS:SAAS00559343}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AEFK01205049; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_003772482.1; XM_003772434.1.
DR   ProteinModelPortal; G3VR43; -.
DR   STRING; 9305.ENSSHAP00000005648; -.
DR   MEROPS; S01.120; -.
DR   Ensembl; ENSSHAT00000005702; ENSSHAP00000005648; ENSSHAG00000004936.
DR   GeneID; 100929943; -.
DR   KEGG; shr:100929943; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   eggNOG; COG5640; LUCA.
DR   GeneTree; ENSGT00760000118862; -.
DR   InParanoid; G3VR43; -.
DR   KO; K01312; -.
DR   OMA; TRAGQFC; -.
DR   OrthoDB; EOG091G0DF7; -.
DR   TreeFam; TF331065; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; SSF50494; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   3: Inferred from homology;
KW   Complete proteome {ECO:0000313|Proteomes:UP000007648};
KW   Disulfide bond {ECO:0000256|SAAS:SAAS00037407};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     19       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        20    247       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5003457731.
FT   DOMAIN       25    245       Peptidase S1. {ECO:0000259|PROSITE:
FT                                PS50240}.
SQ   SEQUENCE   247 AA;  26510 MW;  1386A603F2D3439A CRC64;
     MKAIIFLALL GAAVAYSASD DDDKIVGGYT CAPNSLPYQV SLNAGYHFCG GSLINEQWVV
     SAAHCYKSRI QVRLGEHNID VIEGGEQFID SAKVIRHPNY NSYMIDNDIM LIKLKTPATL
     SSRVSTISLP KYCAAVGTSC LISGWGNTLS SGVNYPELLQ CLNAPLLSDA TCRKAYPGQI
     TDNMICLGYL EGGKDSCQGD SGGPVVCNGE LQGIVSWGYG CAQKGKPGVY TKVCNYVNWI
     KKTIAEN
//
DBGET integrated database retrieval system