ID G5II33_9CLOT Unreviewed; 348 AA.
AC G5II33;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHI58849.1};
GN ORFNames=HMPREF9473_03161 {ECO:0000313|EMBL:EHI58849.1};
OS Hungatella hathewayi WAL-18680.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX NCBI_TaxID=742737 {ECO:0000313|EMBL:EHI58849.1, ECO:0000313|Proteomes:UP000005384};
RN [1] {ECO:0000313|EMBL:EHI58849.1, ECO:0000313|Proteomes:UP000005384}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WAL-18680 {ECO:0000313|EMBL:EHI58849.1,
RC ECO:0000313|Proteomes:UP000005384};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA Molitoris D.R., Song M., Daigneault M., Allen-Vercoe E., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium hathewayi WAL-18680.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHI58849.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLN01000083; EHI58849.1; -; Genomic_DNA.
DR RefSeq; WP_006781140.1; NZ_JH379027.1.
DR AlphaFoldDB; G5II33; -.
DR PATRIC; fig|742737.3.peg.3134; -.
DR HOGENOM; CLU_969065_0_0_9; -.
DR OrthoDB; 1928949at2; -.
DR Proteomes; UP000005384; Unassembled WGS sequence.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR Pfam; PF19085; Choline_bind_2; 1.
DR Pfam; PF19127; Choline_bind_3; 1.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR PROSITE; PS51170; CW; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005384};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..348
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003478821"
FT REPEAT 247..266
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 267..286
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 288..307
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 190..220
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 348 AA; 39317 MW; E7CF35E6BFFF1591 CRC64;
MKKKLALLLA ISTLTIIGTT NAFAHDLVIE GEERTGKSYP TISGSYYTYD ENIGSDVKKD
VTIHLIPKNE RFRVSENDYS DGTGVAIYYL ALGDRNSSST NESYNGGSSA FLYDYVDSLW
YEQGVYYDFN SLAEIGNFSM DRDTAFFPVV FKEYSEYDWS LEELNPYIFK LVDAPLLEID
VATDSNASGD ISTDIPNTGG SSNHSGSSSS GGTGRKISSV NPANVNGEWI KDDNGWWFKK
MDGSYPKNEW IMNNNIWYFF DEAGYMKTNW IEINGVKYFL NPDGAMVSND WTFQDGKWYY
LNSTGAMQAN CWIKWKGLWY YLTADGTLAV DTVTPDNYRV DLNGVWIE
//