ID A0A151MSQ0_ALLMI Unreviewed; 263 AA.
AC A0A151MSQ0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Chymotrypsinogen B2 isoform B {ECO:0000313|EMBL:KYO27532.1};
GN Name=CTRB2 {ECO:0000313|EMBL:KYO27532.1};
GN ORFNames=Y1Q_0003683 {ECO:0000313|EMBL:KYO27532.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO27532.1};
RN [1] {ECO:0000313|EMBL:KYO27532.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO27532.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO27532.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03005169; KYO27532.1; -; Genomic_DNA.
DR RefSeq; XP_006262259.1; XM_006262197.2.
DR AlphaFoldDB; A0A151MSQ0; -.
DR STRING; 8496.A0A151MSQ0; -.
DR GeneID; 102565040; -.
DR KEGG; amj:102565040; -.
DR eggNOG; KOG3627; Eukaryota.
DR OrthoDB; 4629979at2759; -.
DR PhylomeDB; A0A151MSQ0; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24250; CHYMOTRYPSIN-RELATED; 1.
DR PANTHER; PTHR24250:SF29; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00089; Trypsin; 1.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|RuleBase:RU363034};
KW Protease {ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Serine protease {ECO:0000256|RuleBase:RU363034};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..263
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012113659"
FT DOMAIN 34..261
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 263 AA; 28343 MW; C81907CB683A2812 CRC64;
MALLWLLSCL TLASAAHGCG IPAIHPIISG YTRIVNGEEA IPGSWPWQAS LQEKSGWHFC
GGSLVGEDWV VTAAHCGVTT SNVVVLGEHD RGSSTEQVQK LAVKKVFTHP KWNPQTIDYD
IALIKLATPA KLSRTVSPVC LGETRDHFYS GELCVTTGWG KTRYNAFLTP NKLQQTALPL
LSNLQCEEYW GNQITNQMIC AGAAGSSSCM GDSGGPLVCP KNGAWYLVGI VSWGSSKCST
TIPAVYARVT EFHEWIALTM ASN
//