ID A0A195EZU7_9HYME Unreviewed; 903 AA.
AC A0A195EZU7;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Cysteine proteinase CG12163 {ECO:0008006|Google:ProtNLM};
GN ORFNames=ALC56_12006 {ECO:0000313|EMBL:KYN33748.1};
OS Trachymyrmex septentrionalis.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Myrmicinae; Trachymyrmex.
OX NCBI_TaxID=34720 {ECO:0000313|EMBL:KYN33748.1, ECO:0000313|Proteomes:UP000078541};
RN [1] {ECO:0000313|EMBL:KYN33748.1, ECO:0000313|Proteomes:UP000078541}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tsep2-gDNA-1 {ECO:0000313|EMBL:KYN33748.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:KYN33748.1};
RA Nygaard S., Hu H., Boomsma J., Zhang G.;
RT "Trachymyrmex septentrionalis WGS genome.";
RL Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ981897; KYN33748.1; -; Genomic_DNA.
DR RefSeq; XP_018350403.1; XM_018494901.1.
DR AlphaFoldDB; A0A195EZU7; -.
DR STRING; 34720.A0A195EZU7; -.
DR GeneID; 108753372; -.
DR OrthoDB; 1085298at2759; -.
DR Proteomes; UP000078541; Unassembled WGS sequence.
DR GO; GO:0004869; F:cysteine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00042; CY; 1.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.10.450.10; -; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR000010; Cystatin_dom.
DR InterPro; IPR046350; Cystatin_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR018073; Prot_inh_cystat_CS.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00031; Cystatin; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00043; CY; 2.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54403; Cystatin/monellin; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00287; CYSTATIN; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000078541};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..903
FT /note="Cysteine proteinase CG12163"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008271131"
FT DOMAIN 186..298
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 459..565
FT /note="Cystatin"
FT /evidence="ECO:0000259|SMART:SM00043"
FT DOMAIN 598..655
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 684..901
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 903 AA; 101900 MW; ADD02D371C554BB3 CRC64;
MAERSRAGHL ALLYLACLVA QMVRSNPVNI PLNRVSSQLS EKAWRALNDH SPTHHLYAHG
NLINAQELDE PPYKVYKFTY DLNPICGTES CPREACTIDL KQDEYGDIET LNDSIQCMYF
YPADAQNDMS QEQQSYQEQE DTQVTQENLA EQVNNNRSVE LDHEIQETGD ENVRPFIAMR
ASGNNYCPGC PYELNPNLPG LLAFGDQALR SMDEQTTSDY KHKLMSIVRV TRSVPVSSNV
IEYQLLLLIG ESECLKNALE REQECPLRTT IPTKLCLVTF EQRPWQQSSL KITKNNCTDP
ENVSSATYQT QRSNLESESF VTVKYNEGQP FHTEMADVYE GLKEILDNYT LAPSTPSKEP
EEAEVTESAI MKIVLNKSQD DEKISSFTDK TKEFKEFLED FDLPVKLEDA APNTDGVMKE
VKEEKFLRKK VSVEDLKLEK VKDRGNVNNR RKRSGPVPNL VGASTATSIN NPDVQLFVQK
ALQKVSAESD SPNEPLIVEI TEASVQVVAG KLYKIKAKLG DSDCSKGVKD NCKLLEGSEV
KDCLIQVWSR PWQDEGSPEI TVTCPPPENS RRKRSLRGIN YSKKMLQISE DIKAERLFDN
FVTTYNRTYS SPEEKNLRFK IFRENLNFIE MLRENEQGTG IYGVNMFADV SQKEFRTRYL
GLRPDLQSEN EIPLPKAEIP DIDLPPSFDW RQKGVVTPVK NQGSCGSCWA FSVTGNVEGQ
YAIKHGQLLS LSEQELVDCD DLDEGCNGGL PDNAYRAIEQ LGGLEQESDY PYEAENEKCH
FKQNLVKVEL TSAVNVTSNE TQIAQWLVQN GPIAIGINAN AMQFYMGGVS HPLKFLCNPK
NLDHGVLIVG YGTSRYPLFH KKLPYWIIKN SWGKSWGEQG YYRVYRGDGT CGLNTMASSA
VVV
//