ID T0NK57_CAMFR Unreviewed; 666 AA.
AC T0NK57;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 22-FEB-2023, entry version 33.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN ORFNames=CB1_000556025 {ECO:0000313|EMBL:EQB77943.1};
OS Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX NCBI_TaxID=419612 {ECO:0000313|EMBL:EQB77943.1, ECO:0000313|Proteomes:UP000030684};
RN [1] {ECO:0000313|EMBL:EQB77943.1, ECO:0000313|Proteomes:UP000030684}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX PubMed=23149746;
RG Bactrian Camels Genome Sequencing and Analysis Consortium;
RA Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA Bayaer T., Li Y., Meng H.;
RT "Genome sequences of wild and domestic bactrian camels.";
RL Nat. Commun. 3:1202-1202(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB016834; EQB77943.1; -; Genomic_DNA.
DR AlphaFoldDB; T0NK57; -.
DR Proteomes; UP000030684; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 5.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24253:SF100; POLYSERASE-2; 1.
DR PANTHER; PTHR24253; TRANSMEMBRANE PROTEASE SERINE; 1.
DR Pfam; PF00089; Trypsin; 4.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 4.
DR PROSITE; PS50240; TRYPSIN_DOM; 3.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000030684};
KW Serine protease {ECO:0000256|RuleBase:RU363034}.
FT DOMAIN 1..66
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 260..375
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 440..666
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 68..89
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 70..89
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 666 AA; 71524 MW; 52B77A2202C8CB1F CRC64;
MLCAGYAEGH RDTCQGDSGG PLVCEEGGRW FQAGITSFGF GCGRRNRPGV FTAVAPYEAW
IREQVMGSEP GPTFPSQSPQ PQSVLSEPTD ENCTIALPEC GKAPRPGAWP WEAQVMVPGS
RPCYGALVSE SWVLAPASCF LGRISSDHQP GDLDNWRVLL PSRPRAEHVA RLVPHENASW
DDASDLALLQ LREPVNLGAA PRPTHGPWIS HVTRGAYLED QLAWDWGPEG EETETHICPP
HTEHGACGLQ PEPAPVGVLW PWLAEVHVAG EHVCTGILVA PGWVLAATHC VLRPGSTTVP
YIEVYLGRAG ASPLPQSHQV SRSVISIRLP RHLGLRPPLA LLELSSRVEP SPSALSICLH
PGGIPLGSSC WVLGWKDPQD RGESPESWGA PGPWAMAQRM RLGLRWPEAV ATALLLGLFQ
NGMGAEGSEA SCGVAVQARV AGGSNASPGH WPWQVSINHD GIHVCGGSLV SEQWVLSAAH
CFPRDYHKDH YEVKLGAHEL DYYNSQVEVR TVAQVISHPS YLQEGSEGDI ALLQLSSPVT
FSRYIRPICL PAANASFPNG LQCTVTGWGH VAPSVSLQAP RPLQKLEVPL ISRETCNCLY
NIDAKPNEPH VIQQDMVCAG YVNGGKDACQ GDSGGPLSCP VEGLWYLAGI DPVAAQSTTL
TSDPLF
//