GenomeNet

Database: UniProt
Entry: A0A195BUG4_9HYME
LinkDB: A0A195BUG4_9HYME
Original site: A0A195BUG4_9HYME 
ID   A0A195BUG4_9HYME        Unreviewed;       783 AA.
AC   A0A195BUG4;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   27-MAR-2024, entry version 19.
DE   RecName: Full=trypsin {ECO:0000256|ARBA:ARBA00038868};
DE            EC=3.4.21.4 {ECO:0000256|ARBA:ARBA00038868};
GN   ORFNames=ALC53_01280 {ECO:0000313|EMBL:KYM92217.1};
OS   Atta colombica.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC   Formicidae; Myrmicinae; Atta.
OX   NCBI_TaxID=520822 {ECO:0000313|EMBL:KYM92217.1, ECO:0000313|Proteomes:UP000078540};
RN   [1] {ECO:0000313|EMBL:KYM92217.1, ECO:0000313|Proteomes:UP000078540}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Treedump-2 {ECO:0000313|EMBL:KYM92217.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:KYM92217.1};
RA   Nygaard S., Hu H., Boomsma J., Zhang G.;
RT   "Atta colombica WGS genome.";
RL   Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Preferential cleavage: Arg-|-Xaa, Lys-|-Xaa.; EC=3.4.21.4;
CC         Evidence={ECO:0000256|ARBA:ARBA00036320};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KQ976403; KYM92217.1; -; Genomic_DNA.
DR   STRING; 520822.A0A195BUG4; -.
DR   Proteomes; UP000078540; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd00190; Tryp_SPc; 2.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 5.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   InterPro; IPR033116; TRYPSIN_SER.
DR   PANTHER; PTHR24264:SF65; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24264; TRYPSIN-RELATED; 1.
DR   Pfam; PF00089; Trypsin; 4.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 3.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 4.
DR   PROSITE; PS50240; TRYPSIN_DOM; 3.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU363034};
KW   Membrane {ECO:0000313|EMBL:KYM92217.1};
KW   Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU363034};
KW   Reference proteome {ECO:0000313|Proteomes:UP000078540};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825,
KW   ECO:0000256|RuleBase:RU363034};
KW   Transmembrane {ECO:0000313|EMBL:KYM92217.1}.
FT   DOMAIN          1..237
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          233..506
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   DOMAIN          546..741
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
SQ   SEQUENCE   783 AA;  87470 MW;  B4988A715A21DF47 CRC64;
     MNVNWSGTGR LAVPAFPSGI AITSSTSQSS IVGVGRQFMW FMIRVTFGEH DRCIEKSPET
     RYVVRVMTGD FSFLNFENDI ALLRLNERVP LSDTIRPICL PTMLDNEYVD AKAIVSGWGT
     LKEDGKPSCL LQEVEVPVMS LQACRNTSYS ARMISENMLC AGYLEGQKDS CQGDSGGPLI
     TEREDKKYEL IGVVSWGNGC ARPGYPVCGR PNRKVARLLG GEYTESHEFP WLANIHIKSK
     LLVSGILIND RYILTAASQL IGATAHEIKV SLGEYDRCNL DISSVNISIE FIILHPEFNL
     ESNTHDLALI RLSRPTKFEK RISPICLPNP GSTYLGQVGT LVGWIKLKDQ TDNAACRPRK
     LGLPILGQKE CIKSGINAMN LHDDYGCIGI VGTNSLVCEN DVGSSVQYRS YAGIYDLIVI
     FYNFSKDDLK VSVGAHNSCK WDAKSIIFSV KSIFPHPDYN KNTNFADIML VKLIMRITFN
     KLVRPIYLPK LGDVTQMISW ILQVTKHDSK YCSTCAKSSL TFTSKSRLRN FFIIECGLTE
     GISNRIIGGK VSIPHIFPWV VAILNRNNFH CGGTLINNQY ILTAGHCVQW TNHADLSVGV
     GMHDIKNLND GYIVAIDEII LHEDFKSDYL HDTNDIALIR LQQPIKIDEN VRPVCLPLKG
     QFCTFLIIFK LFHLIKNSDY TGQYVKITGW GRVQVEGKAS RFLRQATLKV MSFAACKNTS
     FGDHITESMI CAYNDNTDAC QVLCHDQFXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
     XXV
//
DBGET integrated database retrieval system