ID A0A195BUG4_9HYME Unreviewed; 783 AA.
AC A0A195BUG4;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN ORFNames=ALC53_01280 {ECO:0000313|EMBL:KYM92217.1};
OS Atta colombica.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Atta.
OX NCBI_TaxID=520822 {ECO:0000313|EMBL:KYM92217.1, ECO:0000313|Proteomes:UP000078540};
RN [1] {ECO:0000313|EMBL:KYM92217.1, ECO:0000313|Proteomes:UP000078540}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Treedump-2 {ECO:0000313|EMBL:KYM92217.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KYM92217.1};
RA Nygaard S., Hu H., Boomsma J., Zhang G.;
RT "Atta colombica WGS genome.";
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC Evidence={ECO:0000256|ARBA:ARBA00036320};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ976403; KYM92217.1; -; Genomic_DNA.
DR STRING; 520822.A0A195BUG4; -.
DR Proteomes; UP000078540; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00190; Tryp_SPc; 2.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 5.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001314; Peptidase_S1A.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR InterPro; IPR033116; TRYPSIN_SER.
DR PANTHER; PTHR24264:SF65; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR Pfam; PF00089; Trypsin; 4.
DR PRINTS; PR00722; CHYMOTRYPSIN.
DR SMART; SM00020; Tryp_SPc; 3.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 4.
DR PROSITE; PS50240; TRYPSIN_DOM; 3.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
DR PROSITE; PS00135; TRYPSIN_SER; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW Membrane {ECO:0000313|EMBL:KYM92217.1};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW Reference proteome {ECO:0000313|Proteomes:UP000078540};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW ECO:0000256|RuleBase:RU363034};
KW Transmembrane {ECO:0000313|EMBL:KYM92217.1}.
FT DOMAIN 1..237
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 233..506
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT DOMAIN 546..741
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
SQ SEQUENCE 783 AA; 87470 MW; B4988A715A21DF47 CRC64;
MNVNWSGTGR LAVPAFPSGI AITSSTSQSS IVGVGRQFMW FMIRVTFGEH DRCIEKSPET
RYVVRVMTGD FSFLNFENDI ALLRLNERVP LSDTIRPICL PTMLDNEYVD AKAIVSGWGT
LKEDGKPSCL LQEVEVPVMS LQACRNTSYS ARMISENMLC AGYLEGQKDS CQGDSGGPLI
TEREDKKYEL IGVVSWGNGC ARPGYPVCGR PNRKVARLLG GEYTESHEFP WLANIHIKSK
LLVSGILIND RYILTAASQL IGATAHEIKV SLGEYDRCNL DISSVNISIE FIILHPEFNL
ESNTHDLALI RLSRPTKFEK RISPICLPNP GSTYLGQVGT LVGWIKLKDQ TDNAACRPRK
LGLPILGQKE CIKSGINAMN LHDDYGCIGI VGTNSLVCEN DVGSSVQYRS YAGIYDLIVI
FYNFSKDDLK VSVGAHNSCK WDAKSIIFSV KSIFPHPDYN KNTNFADIML VKLIMRITFN
KLVRPIYLPK LGDVTQMISW ILQVTKHDSK YCSTCAKSSL TFTSKSRLRN FFIIECGLTE
GISNRIIGGK VSIPHIFPWV VAILNRNNFH CGGTLINNQY ILTAGHCVQW TNHADLSVGV
GMHDIKNLND GYIVAIDEII LHEDFKSDYL HDTNDIALIR LQQPIKIDEN VRPVCLPLKG
QFCTFLIIFK LFHLIKNSDY TGQYVKITGW GRVQVEGKAS RFLRQATLKV MSFAACKNTS
FGDHITESMI CAYNDNTDAC QVLCHDQFXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXV
//