GenomeNet

Database: UniProt
Entry: A0A091TFH5_PHALP
LinkDB: A0A091TFH5_PHALP
Original site: A0A091TFH5_PHALP 
ID   A0A091TFH5_PHALP        Unreviewed;       430 AA.
AC   A0A091TFH5;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   RecName: Full=Legumain {ECO:0000256|ARBA:ARBA00021147};
DE            EC=3.4.22.34 {ECO:0000256|ARBA:ARBA00012628};
DE   AltName: Full=Protease, cysteine 1 {ECO:0000256|ARBA:ARBA00030799};
DE   Flags: Fragment;
GN   ORFNames=N335_08463 {ECO:0000313|EMBL:KFQ72646.1};
OS   Phaethon lepturus (White-tailed tropicbird).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Eurypygimorphae; Phaethontiformes;
OC   Phaethontidae; Phaethon.
OX   NCBI_TaxID=97097 {ECO:0000313|EMBL:KFQ72646.1, ECO:0000313|Proteomes:UP000053638};
RN   [1] {ECO:0000313|EMBL:KFQ72646.1, ECO:0000313|Proteomes:UP000053638}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N335 {ECO:0000313|EMBL:KFQ72646.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins and small molecule substrates at
CC         -Asn-|-Xaa- bonds.; EC=3.4.22.34;
CC         Evidence={ECO:0000256|ARBA:ARBA00000810};
CC   -!- SIMILARITY: Belongs to the peptidase C13 family.
CC       {ECO:0000256|ARBA:ARBA00009941}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KK449347; KFQ72646.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A091TFH5; -.
DR   MEROPS; C13.004; -.
DR   PhylomeDB; A0A091TFH5; -.
DR   Proteomes; UP000053638; Unassembled WGS sequence.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IEA:InterPro.
DR   CDD; cd21115; legumain_C; 1.
DR   Gene3D; 1.10.132.130; -; 1.
DR   Gene3D; 3.40.50.1460; -; 1.
DR   InterPro; IPR043577; AE.
DR   InterPro; IPR048501; Legum_prodom.
DR   InterPro; IPR046427; Legumain_prodom_sf.
DR   InterPro; IPR001096; Peptidase_C13.
DR   PANTHER; PTHR12000; HEMOGLOBINASE FAMILY MEMBER; 1.
DR   PANTHER; PTHR12000:SF42; LEGUMAIN; 1.
DR   Pfam; PF20985; Legum_prodom; 1.
DR   Pfam; PF01650; Peptidase_C13; 1.
DR   PIRSF; PIRSF500139; AE; 1.
DR   PIRSF; PIRSF019663; Legumain; 1.
DR   PRINTS; PR00776; HEMOGLOBNASE.
PE   3: Inferred from homology;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053638};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..430
FT                   /note="Legumain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001879948"
FT   DOMAIN          333..428
FT                   /note="Legumain prodomain"
FT                   /evidence="ECO:0000259|Pfam:PF20985"
FT   ACT_SITE        148
FT                   /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT   ACT_SITE        189
FT                   /note="Nucleophile"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT   NON_TER         430
FT                   /evidence="ECO:0000313|EMBL:KFQ72646.1"
SQ   SEQUENCE   430 AA;  49049 MW;  5569AC24EB597FB6 CRC64;
     MILKAVLLLS CALGISAFPM EEPEDGGKHW VVIVAGSNGW YNYRHQADVC HAYQIVHRNG
     IPDEQIIVMM YDDIADNDEN PTKGIVINRP NGTDVYAGVP KDYTKEDVTP KNFLAVLRGD
     AEAVKGVGSG KVLKSGPKDH VFVYFTDHGA PGLLAFPDDD LHVKDLNKTI WYMYHHKKYR
     KMVFYIEACE SGSMMNHLAD NINVYATTAA NPRESSYACY YDDERQTYLG DWYSVNWMED
     SDMEDLRKET LHKQFQLVKK RTNTSHVMQY GNRSISSMKV MQFQGMGKKA IPISLPPVER
     YDLTPSPDVP FAIMKRKLMA TNDIYEAKKI AAEMKTHLEV KEFIQESMRK IVTLVTGSKE
     QTNQILSDRL TISNYDCYQS AVNHFKAHCF NWHLPLYEYA LRQLYALVNV CEGGYPIDRI
     CLAMNQVCLG
//
DBGET integrated database retrieval system