ID A0A212FPU5_DANPL Unreviewed; 553 AA.
AC A0A212FPU5;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Digestive cysteine proteinase 2 {ECO:0000313|RefSeq:XP_032524140.1};
GN Name=LOC116775358 {ECO:0000313|RefSeq:XP_032524140.1};
GN ORFNames=KGM_213403 {ECO:0000313|EMBL:OWR55754.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR55754.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR55754.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR55754.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
RN [2] {ECO:0000313|EMBL:OWR55754.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR55754.1};
RA Zhan S., Reppert S.M.;
RT "MonarchBase: the monarch butterfly genome database.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|RefSeq:XP_032524140.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02000859; OWR55754.1; -; Genomic_DNA.
DR RefSeq; XP_032524140.1; XM_032668249.1.
DR EnsemblMetazoa; XM_032668249.1; XP_032524140.1; LOC116775358.
DR KEGG; dpl:KGM_213403; -.
DR eggNOG; KOG1543; Eukaryota.
DR OrthoDB; 5472948at2759; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR Proteomes; UP000596680; Chromosome 24.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF975; 26-29KD-PROTEINASE; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..553
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041057863"
FT DOMAIN 248..304
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 335..552
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 553 AA; 62656 MW; AD902CBE06151025 CRC64;
MFVYTLLCFY LGSVVGLRID KDNPPQWSDV YTVKGLLNIP YAELHEPFYA WFDSKNGKSR
IDYYGTMVKT YQLSASVYPQ YGTSIKIAPV TTEHVLNQDT CLQVNGTEGE NINIQTVLPD
MTDFKFVGTE TMKDSDTFKW RMVTSVGDKV NKYTMWVKYR KSLRGDNIAI PVRYEMKGFN
SLLGSHYDHY YLDYTDFDNS DIEPDVFKVD SSFKCSSFPG PGFRHMATFN PMKEFVHPAS
DEHVHHEFDR FVNKHNKQYA SEVEKTKRIN IFRQNLRLIH SHNRAHRGFS LAVNHLADHT
DEELAARRGR RYTGHNAGLP FPYGEAELAD MSVKLPPEFD WRLFGAVTPV KDQSVCGSCW
SFGTVGAVEG ALFLSNGGHL VRLSQQALVD CSWGFGNNGC DGGEDYRAYQ WIMRHGLPTE
DDYGGYLGQD GYCHMENVTV ATKMKGWVNV TAKNENALKL AIFKHGPVSV AIDASHKTFS
FYSNGVYFEP KCKNSVEELD HAVLAVGFGV LNGHKYWLVK NSWSNMWGND GYVLMSARDD
NCGVQAAPTY VII
//