ID G5IJQ9_9CLOT Unreviewed; 530 AA.
AC G5IJQ9;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=HMPREF9473_03737 {ECO:0000313|EMBL:EHI58279.1};
OS Hungatella hathewayi WAL-18680.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX NCBI_TaxID=742737 {ECO:0000313|EMBL:EHI58279.1, ECO:0000313|Proteomes:UP000005384};
RN [1] {ECO:0000313|EMBL:EHI58279.1, ECO:0000313|Proteomes:UP000005384}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WAL-18680 {ECO:0000313|EMBL:EHI58279.1,
RC ECO:0000313|Proteomes:UP000005384};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA Molitoris D.R., Song M., Daigneault M., Allen-Vercoe E., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium hathewayi WAL-18680.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHI58279.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLN01000104; EHI58279.1; -; Genomic_DNA.
DR RefSeq; WP_006781728.1; NZ_JH379028.1.
DR AlphaFoldDB; G5IJQ9; -.
DR PATRIC; fig|742737.3.peg.3718; -.
DR HOGENOM; CLU_021406_1_0_9; -.
DR Proteomes; UP000005384; Unassembled WGS sequence.
DR InterPro; IPR025584; Cthe_2159.
DR Pfam; PF14262; Cthe_2159; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005384}.
FT REGION 29..67
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 482..530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..59
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..522
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 530 AA; 54760 MW; 4C09080ACF70C56B CRC64;
MMTKHRRWKT GAVCLVAAAV VLSGCHSGKT VTAESQTESN AENNSEKNSE NNGAGTSGNP
TDAGAGKIDS FMIGETEVKL EDDDYYSDWS GDDVTYIKLA DAAVSIGGNG AEADGNVVRI
TDGGTYVLSG SWSDGSVQVD AGDKGTVRLV LNGVEIHSEE SAPIYVMKAD KTVISLEDGT
VNTLSDTEGL VYTDEEKEEP NATLFSKKDL TINGTGTLVV NAAFNHGING KDNVILMSGN
YQVTSARSAF KGKDLLAVLD GSYEITAGND GMHSDGNLGI FGGTIEILES VEGLEGGTIQ
IADGDISIKS SDDGINASSD EEDVTPTLYI SGGTVAVDAE GDGIDSNGNF YMTDGDVTVY
GPVSNGDGAL DYDGVFEMSG GQLAAFGMSG MAQGVSAGST QNALMMTYPK EQAAGTEVVL
KNGAGEELYR WSGAKAFNSL VIAVPELTTG ESYVLETGES QLEFVLESAM TYLNENGVTE
AASMRPMGGK GGMGGGPGGQ RPDGSGDERP EGLPEGGKGR RPEGLPEGTP
//