GenomeNet

Database: UniProt
Entry: G5IGB9_9CLOT
LinkDB: G5IGB9_9CLOT
Original site: G5IGB9_9CLOT 
ID   G5IGB9_9CLOT            Unreviewed;       987 AA.
AC   G5IGB9;
DT   25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT   25-JAN-2012, sequence version 1.
DT   27-MAR-2024, entry version 43.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHI59481.1};
GN   ORFNames=HMPREF9473_02547 {ECO:0000313|EMBL:EHI59481.1};
OS   Hungatella hathewayi WAL-18680.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX   NCBI_TaxID=742737 {ECO:0000313|EMBL:EHI59481.1, ECO:0000313|Proteomes:UP000005384};
RN   [1] {ECO:0000313|EMBL:EHI59481.1, ECO:0000313|Proteomes:UP000005384}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=WAL-18680 {ECO:0000313|EMBL:EHI59481.1,
RC   ECO:0000313|Proteomes:UP000005384};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Finegold S.M., Summanen P.H.,
RA   Molitoris D.R., Song M., Daigneault M., Allen-Vercoe E., Young S.K.,
RA   Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA   Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA   Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Montmayeur A.,
RA   Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA   Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Clostridium hathewayi WAL-18680.";
RL   Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EHI59481.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADLN01000057; EHI59481.1; -; Genomic_DNA.
DR   RefSeq; WP_006780526.1; NZ_JH379027.1.
DR   AlphaFoldDB; G5IGB9; -.
DR   PATRIC; fig|742737.3.peg.2563; -.
DR   HOGENOM; CLU_299311_0_0_9; -.
DR   OrthoDB; 9805284at2; -.
DR   Proteomes; UP000005384; Unassembled WGS sequence.
DR   Gene3D; 2.10.270.10; Cholin Binding; 1.
DR   Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 4.
DR   InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR   InterPro; IPR026906; LRR_5.
DR   InterPro; IPR032675; LRR_dom_sf.
DR   PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR   PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR   Pfam; PF01473; Choline_bind_1; 1.
DR   Pfam; PF19127; Choline_bind_3; 1.
DR   Pfam; PF13306; LRR_5; 3.
DR   SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR   PROSITE; PS51170; CW; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000005384};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..987
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5039354621"
FT   REPEAT          888..907
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REGION          38..104
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        53..74
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        88..104
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   987 AA;  106804 MW;  1FDFBDB32EA108AB CRC64;
     MKRNHKRVIS TLLIMLLIST QPAVMAWADS VVPADHLLEG DENTGKKTGY ATPTDADQKD
     EDALKPDDTP KGDEIPSTEI PSTEIPSHEN LTDEKGPENE NSGKTEKVVI RWEFVDDDNL
     SGGELSLIGV SPENRADFDT VVSMLPEQVR AEIEEAGKVT LPITDWSCPE YQQDKDGEWP
     FTGKYEFIAE LPEGYVCESP ISVLVTLGGA MVNTINDRFT VDGLKYKELG PDTVQLMGYD
     GAKPVGTLII PDKVRKPSNG REYQVINISN GAFQDCSGLT GDLVIPDTVT KIGNRAFSKC
     GFTGQLVLPQ TLVRIEHDTF AGTAFSGQLI LPEKLNYIGV YAFLDCNFTG DLIIPDEVTD
     VGYGAFEGNN FTGTLILPKK LKTIDREGFT LCGFTGELNI PDTVTDIGMF AFYKCGFTGD
     LILPDGLTSI GTSAFEGCSE FTGRLSIPDG ITSIGKDAFK NTSFDGFDTT NQEIANLLYA
     SGVDKDKIKV GDQPYQPSQP PKAPGFQVGD MDYQIIGSDT VALTGYHGNS DTDIIIPDMV
     TDIVSGRTYP VTHIGSDAFW KKAITGSLHL PNTLISIEEG AFAENKFTGS LLLPESLVSI
     GVGAFYDSGF TGDLTIPANV SYIGPSSFEK AGFTGDLTIE GKLTKLEGYE FIGCGFTGAL
     VLPDTLTSIG DLTFQDCGFT GSLQLPKLVT EIGEKAFYGC DSLDSVYLGP NLQKLGAQAF
     PESLPLSTDS PRVQLLINTY LNQDAIADTS WDGKEDVPDG AVVTIKQDTT VTGDRRIGTE
     AVITIPSGVI LTVDGNLTVD GNLVVDGTIS VEGTLSINGS LSGSSTLIVR VNGRIVGDTS
     GIRVVYVSHG SSGNSSSSTV NPDILIGTWE RTEDGIWKFH QARGTYAVNR WGIVDGLWYY
     FDKEGRMLTG WQYINNQWYY LCREEDSKTN TGLKEGAMAT GWHFDPVYQA WFYLDTSGAM
     AVGEKVIDGK QYYFNPESDG TRGAMQQ
//
DBGET integrated database retrieval system