ID V9HES4_9CLOT Unreviewed; 1000 AA.
AC V9HES4;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE RecName: Full=Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase-like domain-containing protein {ECO:0000259|SMART:SM00047};
GN ORFNames=CSBG_00370 {ECO:0000313|EMBL:EEH96744.2};
OS Clostridium sp. 7_2_43FAA.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=457396 {ECO:0000313|EMBL:EEH96744.2, ECO:0000313|Proteomes:UP000017809};
RN [1] {ECO:0000313|EMBL:EEH96744.2, ECO:0000313|Proteomes:UP000017809}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=7_2_43FAA {ECO:0000313|EMBL:EEH96744.2,
RC ECO:0000313|Proteomes:UP000017809};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Allen-Vercoe E., Strauss J.,
RA Ambrose C., Young S.K., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Alvarado L., Arachchi H.M., Berlin A., Chapman S.B.,
RA Gearin G., Goldberg J., Griggs A., Gujja S., Hansen M., Heiman D.,
RA Howarth C., Larimer J., Lui A., MacDonald P.J.P., McCowen C.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Clostridium sp. 7_2_43FAA.";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEH96744.2}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACDK02000037; EEH96744.2; -; Genomic_DNA.
DR RefSeq; WP_008680706.1; NZ_JH815222.1.
DR AlphaFoldDB; V9HES4; -.
DR STRING; 457396.CSBG_00370; -.
DR GeneID; 65400268; -.
DR eggNOG; COG2247; Bacteria.
DR eggNOG; COG4193; Bacteria.
DR HOGENOM; CLU_299714_0_0_9; -.
DR OrthoDB; 9816557at2; -.
DR Proteomes; UP000017809; Unassembled WGS sequence.
DR GO; GO:0004040; F:amidase activity; IEA:InterPro.
DR Gene3D; 1.10.530.10; -; 1.
DR Gene3D; 3.40.50.12090; -; 2.
DR InterPro; IPR007253; Cell_wall-bd_2.
DR InterPro; IPR002901; MGlyc_endo_b_GlcNAc-like_dom.
DR PANTHER; PTHR30032:SF8; AMIDASE ENHANCER; 1.
DR PANTHER; PTHR30032; N-ACETYLMURAMOYL-L-ALANINE AMIDASE-RELATED; 1.
DR Pfam; PF04122; CW_binding_2; 3.
DR Pfam; PF01832; Glucosaminidase; 1.
DR SMART; SM00047; LYZ2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017809};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1000
FT /note="Mannosyl-glycoprotein endo-beta-N-
FT acetylglucosamidase-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004776572"
FT DOMAIN 453..589
FT /note="Mannosyl-glycoprotein endo-beta-N-
FT acetylglucosamidase-like"
FT /evidence="ECO:0000259|SMART:SM00047"
FT REGION 37..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 55..84
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 85..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 100..136
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1000 AA; 109827 MW; FF6D349321271292 CRC64;
MSFKKIKKIV AQLVIVAFIV PNISAKAIIN NPGSVIEEKS SNKETDSITN NAKSSQDNEK
EDSKKDNSNG EELLTDEKDS AKDNEAQNSI SNNSSDEKTS VEQSKTESNK KVEEIKENSE
TSNKEDSIKE EANKKESANT LKGTKYKKVK VINPALRSYS VISSENLERE EFKLKQEALE
NLEVLEPMTD KNYELTIANS DGSYEFVEAY DDLDEAVNAA KSLDPSTKEN QDPAVINYSG
QVVYSTNQMA RVIKYSNNSV ISGTVNMYTE PSLQNENAFT YVNPGYMDDA PILDVSGTSA
KVMINGYEGW INNNKASGTY DMLVVPLSQV KNPSYYTVQN GELVHFISYD LTGVTGYNKI
LGKAPFFLKE GVRYLSYDGK YFYQYNHNSQ TDIANKLDIL IKDYRSGIRS NSINSGSPYY
IYYLNLPFRS QTVYSANDLN RYINAFINGE IDGIKRAGSK LKDLGQVFKD TESTYGVNAL
LALGVSINES AYGMSTIAQT KNNLFGLKAY DSAPGESADS FATPKDSVID FTKNYISRGY
ADPADWRYFG GFLGNKNRGT NVKYASDPFW GEKATSFAYE IDKYLSGGNS NLKDTNSKQI
GVATSNTSVI KKDGTLLYNV TNDTNQYGGY INTPFVVSNL QQVTISGKTY YEINPERNTP
IGLGGPDNKY HGSYNWNDRG YILASNVSMT NEYLPPIEVK SGLNRYSTAV ELSKSSFSSA
ETIVISNGYA IPDGLAATPI ASYYKGPLLL VEKSSIPSAT QNEIKRLKAK NVIIVGGTGV
VTPAVENQLR NLGITKITRL GGINRYETAL QVAKYIDQNL YDVENVVISN GYGEADSLSI
APVSGRDRMP IILVESNSIP SSVNSWLKSE ELNNAYLIGG TGVLSNNVLN QINSMTKQDI
TGNRLGGANR YETNAKVIER FYGNVLNKVY VTEGLELADA LTSGPVAAVN ESPVVIAEAQ
LTSTQKSVLD KKTTNTIIQV GGVVPKTAIN ELRKLLSRSQ
//