ID G5IGB9_9CLOT Unreviewed; 987 AA.
AC G5IGB9;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHI59481.1};
GN ORFNames=HMPREF9473_02547 {ECO:0000313|EMBL:EHI59481.1};
OS Hungatella hathewayi WAL-18680.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX NCBI_TaxID=742737 {ECO:0000313|EMBL:EHI59481.1, ECO:0000313|Proteomes:UP000005384};
RN [1] {ECO:0000313|EMBL:EHI59481.1, ECO:0000313|Proteomes:UP000005384}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WAL-18680 {ECO:0000313|EMBL:EHI59481.1,
RC ECO:0000313|Proteomes:UP000005384};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA Molitoris D.R., Song M., Daigneault M., Allen-Vercoe E., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Clostridium hathewayi WAL-18680.";
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHI59481.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLN01000057; EHI59481.1; -; Genomic_DNA.
DR RefSeq; WP_006780526.1; NZ_JH379027.1.
DR AlphaFoldDB; G5IGB9; -.
DR PATRIC; fig|742737.3.peg.2563; -.
DR HOGENOM; CLU_299311_0_0_9; -.
DR OrthoDB; 9805284at2; -.
DR Proteomes; UP000005384; Unassembled WGS sequence.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 4.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF01473; Choline_bind_1; 1.
DR Pfam; PF19127; Choline_bind_3; 1.
DR Pfam; PF13306; LRR_5; 3.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR PROSITE; PS51170; CW; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000005384};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..987
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039354621"
FT REPEAT 888..907
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 38..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 53..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 987 AA; 106804 MW; 1FDFBDB32EA108AB CRC64;
MKRNHKRVIS TLLIMLLIST QPAVMAWADS VVPADHLLEG DENTGKKTGY ATPTDADQKD
EDALKPDDTP KGDEIPSTEI PSTEIPSHEN LTDEKGPENE NSGKTEKVVI RWEFVDDDNL
SGGELSLIGV SPENRADFDT VVSMLPEQVR AEIEEAGKVT LPITDWSCPE YQQDKDGEWP
FTGKYEFIAE LPEGYVCESP ISVLVTLGGA MVNTINDRFT VDGLKYKELG PDTVQLMGYD
GAKPVGTLII PDKVRKPSNG REYQVINISN GAFQDCSGLT GDLVIPDTVT KIGNRAFSKC
GFTGQLVLPQ TLVRIEHDTF AGTAFSGQLI LPEKLNYIGV YAFLDCNFTG DLIIPDEVTD
VGYGAFEGNN FTGTLILPKK LKTIDREGFT LCGFTGELNI PDTVTDIGMF AFYKCGFTGD
LILPDGLTSI GTSAFEGCSE FTGRLSIPDG ITSIGKDAFK NTSFDGFDTT NQEIANLLYA
SGVDKDKIKV GDQPYQPSQP PKAPGFQVGD MDYQIIGSDT VALTGYHGNS DTDIIIPDMV
TDIVSGRTYP VTHIGSDAFW KKAITGSLHL PNTLISIEEG AFAENKFTGS LLLPESLVSI
GVGAFYDSGF TGDLTIPANV SYIGPSSFEK AGFTGDLTIE GKLTKLEGYE FIGCGFTGAL
VLPDTLTSIG DLTFQDCGFT GSLQLPKLVT EIGEKAFYGC DSLDSVYLGP NLQKLGAQAF
PESLPLSTDS PRVQLLINTY LNQDAIADTS WDGKEDVPDG AVVTIKQDTT VTGDRRIGTE
AVITIPSGVI LTVDGNLTVD GNLVVDGTIS VEGTLSINGS LSGSSTLIVR VNGRIVGDTS
GIRVVYVSHG SSGNSSSSTV NPDILIGTWE RTEDGIWKFH QARGTYAVNR WGIVDGLWYY
FDKEGRMLTG WQYINNQWYY LCREEDSKTN TGLKEGAMAT GWHFDPVYQA WFYLDTSGAM
AVGEKVIDGK QYYFNPESDG TRGAMQQ
//