GenomeNet

Database: UniProt
Entry: G5IJQ9_9CLOT
LinkDB: G5IJQ9_9CLOT
Original site: G5IJQ9_9CLOT 
ID   G5IJQ9_9CLOT            Unreviewed;       530 AA.
AC   G5IJQ9;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   24-JAN-2024, entry version 33.
DE   RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=HMPREF9473_03737 {ECO:0000313|EMBL:EHI58279.1};
OS   Hungatella hathewayi WAL-18680.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX   NCBI_TaxID=742737 {ECO:0000313|EMBL:EHI58279.1, ECO:0000313|Proteomes:UP000005384};
RN   [1] {ECO:0000313|EMBL:EHI58279.1, ECO:0000313|Proteomes:UP000005384}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WAL-18680 {ECO:0000313|EMBL:EHI58279.1,
RC   ECO:0000313|Proteomes:UP000005384};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA   Molitoris D.R., Song M., Daigneault M., Allen-Vercoe E., Young S.K.,
RA   Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA   Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA   Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A.,
RA   Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA   Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Clostridium hathewayi WAL-18680.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHI58279.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADLN01000104; EHI58279.1; -; Genomic_DNA.
DR   RefSeq; WP_006781728.1; NZ_JH379028.1.
DR   AlphaFoldDB; G5IJQ9; -.
DR   PATRIC; fig|742737.3.peg.3718; -.
DR   HOGENOM; CLU_021406_1_0_9; -.
DR   Proteomes; UP000005384; Unassembled WGS sequence.
DR   InterPro; IPR025584; Cthe_2159.
DR   Pfam; PF14262; Cthe_2159; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000005384}.
FT   REGION          29..67
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          482..530
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        29..59
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        504..522
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   530 AA;  54760 MW;  4C09080ACF70C56B CRC64;
     MMTKHRRWKT GAVCLVAAAV VLSGCHSGKT VTAESQTESN AENNSEKNSE NNGAGTSGNP
     TDAGAGKIDS FMIGETEVKL EDDDYYSDWS GDDVTYIKLA DAAVSIGGNG AEADGNVVRI
     TDGGTYVLSG SWSDGSVQVD AGDKGTVRLV LNGVEIHSEE SAPIYVMKAD KTVISLEDGT
     VNTLSDTEGL VYTDEEKEEP NATLFSKKDL TINGTGTLVV NAAFNHGING KDNVILMSGN
     YQVTSARSAF KGKDLLAVLD GSYEITAGND GMHSDGNLGI FGGTIEILES VEGLEGGTIQ
     IADGDISIKS SDDGINASSD EEDVTPTLYI SGGTVAVDAE GDGIDSNGNF YMTDGDVTVY
     GPVSNGDGAL DYDGVFEMSG GQLAAFGMSG MAQGVSAGST QNALMMTYPK EQAAGTEVVL
     KNGAGEELYR WSGAKAFNSL VIAVPELTTG ESYVLETGES QLEFVLESAM TYLNENGVTE
     AASMRPMGGK GGMGGGPGGQ RPDGSGDERP EGLPEGGKGR RPEGLPEGTP
//
DBGET integrated database retrieval system