GenomeNet

Database: UniProt
Entry: A0A644F3R9_GIAIC
LinkDB: A0A644F3R9_GIAIC
Original site: A0A644F3R9_GIAIC 
ID   A0A644F3R9_GIAIC        Unreviewed;       714 AA.
AC   A0A644F3R9;
DT   22-APR-2020, integrated into UniProtKB/TrEMBL.
DT   22-APR-2020, sequence version 1.
DT   27-MAR-2024, entry version 12.
DE   SubName: Full=Cathepsin B-like cysteine proteinase {ECO:0000313|EMBL:KAE8303245.1};
GN   ORFNames=GL50803_00119224 {ECO:0000313|EMBL:KAE8303245.1};
OS   Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC   Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX   NCBI_TaxID=184922 {ECO:0000313|EMBL:KAE8303245.1, ECO:0000313|Proteomes:UP000001548};
RN   [1] {ECO:0000313|EMBL:KAE8303245.1, ECO:0000313|Proteomes:UP000001548}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548};
RX   PubMed=17901334; DOI=10.1126/science.1143837;
RA   Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA   Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA   Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA   Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA   Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA   Svard S.G., Sogin M.L.;
RT   "Genomic minimalism in the early diverging intestinal parasite Giardia
RT   lamblia.";
RL   Science 317:1921-1926(2007).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAE8303245.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACB03000003; KAE8303245.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A644F3R9; -.
DR   InParanoid; A0A644F3R9; -.
DR   Proteomes; UP000001548; Chromosome 3.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR018466; Kre9/Knh1-like_N.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF10342; Kre9_KNH; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000001548}.
FT   DOMAIN          1..194
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   REGION          475..516
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   714 AA;  77611 MW;  46DF317D2E6714E8 CRC64;
     MVTARRCLQL NDSRLVSLED LVTCDHTKYL NIQNNGCRGG NSLASLKFGE TTGMVYDTCE
     DYWNRTYPYP TETCKTVCKD KRPKDRTIKN KAPYRLSGVD AMMRDIYQNG PIAVSMYLAN
     DFPSKDKKGI YSSGPNTKLG GGHAVMIVGW GEENGVPYWD CANTYGTNWG DQGYFKIKRG
     SNELKIETWP GSALPIDTTP KPEPQNGTIT VEAAKTYIAG YEIAIPYKEV ATPAELQLKS
     EAGAVTSLVK LTTSSGTARV TIPRDTSAGQ YYLTTSTGDA TSARFSLASY FTLAFDKTSY
     QGQEGESVTL RFAAPLQVPC KLMDGASVLM ELSKGAQSCV APSSMSAGRH ALTLTTSDTV
     PVLSASAVLT IEKKHNDEAV ITISNPVAGA QVTALAEGLQ ISYASTGTSA KLLLQAYCGE
     RLLDVIAAGL PPSGSIEVRL PSSYGNCSAA YLRLRSETSP FAYADVGPLH VTSYGYPADD
     LPAPSDKKDP VPDPEPQPDP DPEPEPEPEP IGDVPLLITE PSTQSVWVPG GAVTIRWTSN
     LTASTEMTIL LYEKVGSKSY LRHTFTRSAP NTGLYADTLP ASVPSGPNYF VRMRSNSPMF
     TNTSDDFEVK ASRYLVRDVP DAITVRDPLS FTIHRHGFVF PLPRRRFSVT LMQAGKEVCA
     LDGVEGVAGL NTIYLDSCAE RMQELRDKQY YLKLCSQKME CTVTPTFTVL DDTH
//
DBGET integrated database retrieval system