ID V6TZQ8_GIAIN Unreviewed; 690 AA.
AC V6TZQ8;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Cysteine protease, papain family {ECO:0000313|EMBL:ESU42495.1};
DE Flags: Fragment;
GN ORFNames=GSB_153226 {ECO:0000313|EMBL:ESU42495.1};
OS Giardia intestinalis (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=5741 {ECO:0000313|EMBL:ESU42495.1, ECO:0000313|Proteomes:UP000018040};
RN [1] {ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=GS {ECO:0000313|Proteomes:UP000018040};
RA Adam R., Dahlstrom E., Martens C., Bruno D., Barbian K., Porcella S.F.,
RA Nash T.;
RT "Genome sequencing of Giardia lamblia Genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of Genotypes A1 and E (WB and
RT Pig).";
RL Submitted (FEB-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESU42495.1, ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GS {ECO:0000313|EMBL:ESU42495.1,
RC ECO:0000313|Proteomes:UP000018040};
RX PubMed=24307482; DOI=10.1093/gbe/evt197;
RA Adam R.D., Dahlstrom E.W., Martens C.A., Bruno D.P., Barbian K.D.,
RA Ricklefs S.M., Hernandez M.M., Narla N.P., Patel R.B., Porcella S.F.,
RA Nash T.E.;
RT "Genome sequencing of Giardia lamblia genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of genotypes A1 and E (WB and
RT Pig).";
RL Genome Biol. Evol. 5:2498-2511(2013).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESU42495.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHHH01000080; ESU42495.1; -; Genomic_DNA.
DR AlphaFoldDB; V6TZQ8; -.
DR EnsemblProtists; ESU42495; ESU42495; GSB_153226.
DR VEuPathDB; GiardiaDB:DHA2_151075; -.
DR VEuPathDB; GiardiaDB:GL50581_259; -.
DR VEuPathDB; GiardiaDB:GL50803_00114915; -.
DR VEuPathDB; GiardiaDB:QR46_1625; -.
DR Proteomes; UP000018040; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR Pfam; PF00112; Peptidase_C1; 2.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:ESU42495.1};
KW Membrane {ECO:0000256|SAM:Phobius}; Protease {ECO:0000313|EMBL:ESU42495.1};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 646..666
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 218..580
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ESU42495.1"
SQ SEQUENCE 690 AA; 75288 MW; 418E960D5C9A20FE CRC64;
VRNVGAVLGV TASLPLSRPP LLSAARCCEC PHSRTTPASS PALVIQKGAP MLMLALSLLH
AALAARDPFS IHLECGRAFE RFVGLYGKAY DTPGERLEAE RSFCAHLEHL RRLRREGHEH
VEFGITPRMD RRPSTGLLPP RSLGGGEWGD AAAGTCSRDS KSNKYTNCNV GAYALGPDLK
GENTAPNSQS TTDKSKKQDD FFLVDVRDMN VKDRQRTLPQ SVDLREVGFV TRAKDQGLCG
SCYEFTTIEV LENLMLVDAD LYTQAKYVDP KPFPANYFCN SSTTASDAGS KAGADCSSTL
ITVSNRTTID YSNIPENWKS YSASTLRLSV QFLLDNLIGG NYCNGGNYYR AVSDFVNNLS
KLAKESECIY REYATSEKAK PDEGSKCSDT SIGDFNPAFV SMPMKKKNGA SGTANAGEDV
YEYTTNNDGT GASANTYGVY QILRNDGVPS SGLITEEMDK GKFAAWERNV MNVLARGVGL
AAAMHTESGV DTAVDGDKYE SVVSAALSFN LYKGGVFPDV KCVRATTNHQ VMLVGYGLYR
GRAVWILRNS WGTGWGVSGY YYIGRGSNSL CHELTVEYTM PRFYGPDRDG VHPFFLSDPS
EIQKVALADA VRASPFSSKI KRCVNGLDTV DAYGGTMCEA VTVSTWNYVL VGVLAIVLFF
LGRYCMQYYF CAPEYIFHSP VNLSLPGRSV
//