GenomeNet

Database: UniProt/TrEMBL
Entry: A0A151NUB2_ALLMI
LinkDB: A0A151NUB2_ALLMI
Original site: A0A151NUB2_ALLMI 
ID   A0A151NUB2_ALLMI        Unreviewed;       247 AA.
AC   A0A151NUB2;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   27-MAR-2024, entry version 25.
DE   SubName: Full=Anionic trypsin-1-like {ECO:0000313|EMBL:KYO40159.1};
GN   ORFNames=Y1Q_0007563 {ECO:0000313|EMBL:KYO40159.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO40159.1};
RN   [1] {ECO:0000313|EMBL:KYO40159.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO40159.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO40159.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03002068; KYO40159.1; -; Genomic_DNA.
DR   RefSeq; XP_006276577.1; XM_006276515.2.
DR   AlphaFoldDB; A0A151NUB2; -.
DR   STRING; 8496.A0A151NUB2; -.
DR   GeneID; 102569978; -.
DR   KEGG; amj:102569978; -.
DR   eggNOG; KOG3627; Eukaryota.
DR   OrthoDB; 2910936at2759; -.
DR   PhylomeDB; A0A151NUB2; -.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF15; RIKEN CDNA 2210010C04 GENE; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..15
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           16..247
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5013085334"
FT   DOMAIN          23..245
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   247 AA;  26522 MW;  1014DFBADB9E4CEA CRC64;
     MAMWLVLLVL GVAAAAPHGA DRIVGGHECA AHSQPWQVSL NSGYHFCGGS LITDQWVVSA
     AHCWYNPSAM QVILGDHNIQ VFEHTEYLMR IETIVWHPDY NYQTMDHDIM LIKLAHPVQF
     DANVQAVALP TACAEGGTFC VVSGWGNILS DGVFNPYNLQ CADVPLLSTQ ECENAYPGQI
     TSTMLCAGFP EGGKDACQGD SGGPLVCHGE LQGIVSWGVG CALAGYPGVY TKVCALLPWI
     QETIATN
//
DBGET integrated database retrieval system