GenomeNet

Database: UniProt
Entry: A0A151MSQ0_ALLMI
LinkDB: A0A151MSQ0_ALLMI
Original site: A0A151MSQ0_ALLMI 
ID   A0A151MSQ0_ALLMI        Unreviewed;       263 AA.
AC   A0A151MSQ0;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=Chymotrypsinogen B2 isoform B {ECO:0000313|EMBL:KYO27532.1};
GN   Name=CTRB2 {ECO:0000313|EMBL:KYO27532.1};
GN   ORFNames=Y1Q_0003683 {ECO:0000313|EMBL:KYO27532.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO27532.1};
RN   [1] {ECO:0000313|EMBL:KYO27532.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO27532.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO27532.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03005169; KYO27532.1; -; Genomic_DNA.
DR   RefSeq; XP_006262259.1; XM_006262197.2.
DR   AlphaFoldDB; A0A151MSQ0; -.
DR   STRING; 8496.A0A151MSQ0; -.
DR   GeneID; 102565040; -.
DR   KEGG; amj:102565040; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   OrthoDB; 4629979at2759; -.
DR   PhylomeDB; A0A151MSQ0; -.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24250; CHYMOTRYPSIN-RELATED; 1.
DR   PANTHER; PTHR24250:SF29; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Serine protease {ECO:0000256|RuleBase:RU363034};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..15
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           16..263
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012113659"
FT   DOMAIN          34..261
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   263 AA;  28343 MW;  C81907CB683A2812 CRC64;
     MALLWLLSCL TLASAAHGCG IPAIHPIISG YTRIVNGEEA IPGSWPWQAS LQEKSGWHFC
     GGSLVGEDWV VTAAHCGVTT SNVVVLGEHD RGSSTEQVQK LAVKKVFTHP KWNPQTIDYD
     IALIKLATPA KLSRTVSPVC LGETRDHFYS GELCVTTGWG KTRYNAFLTP NKLQQTALPL
     LSNLQCEEYW GNQITNQMIC AGAAGSSSCM GDSGGPLVCP KNGAWYLVGI VSWGSSKCST
     TIPAVYARVT EFHEWIALTM ASN
//
DBGET integrated database retrieval system