ID E7G5H8_9FIRM Unreviewed; 505 AA.
AC E7G5H8;
DT 05-APR-2011, integrated into UniProtKB/TrEMBL.
DT 05-APR-2011, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE RecName: Full=WG repeat-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=HMPREF9488_00016 {ECO:0000313|EMBL:EFW06479.1};
OS Coprobacillus cateniformis.
OC Bacteria; Bacillota; Erysipelotrichia; Erysipelotrichales;
OC Coprobacillaceae; Coprobacillus.
OX NCBI_TaxID=100884 {ECO:0000313|EMBL:EFW06479.1, ECO:0000313|Proteomes:UP000003157};
RN [1] {ECO:0000313|EMBL:EFW06479.1, ECO:0000313|Proteomes:UP000003157}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=29_1 {ECO:0000313|EMBL:EFW06479.1,
RC ECO:0000313|Proteomes:UP000003157};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Sibley C.D.,
RA White A., Strauss J., Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., White J.,
RA Yandava C., Nusbaum C., Birren B.;
RT "The Genome Sequence of Coprobacillus sp. strain 29_1.";
RL Submitted (DEC-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFW06479.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADKX01000001; EFW06479.1; -; Genomic_DNA.
DR RefSeq; WP_008787144.1; NZ_WSNW01000001.1.
DR AlphaFoldDB; E7G5H8; -.
DR STRING; 100884.GCA_000269565_01685; -.
DR GeneID; 78229555; -.
DR eggNOG; ENOG502ZZ9D; Bacteria.
DR HOGENOM; CLU_577102_0_0_9; -.
DR OrthoDB; 1654251at2; -.
DR Proteomes; UP000003157; Unassembled WGS sequence.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000003157};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..505
FT /note="WG repeat-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038594536"
SQ SEQUENCE 505 AA; 58208 MW; 9A8E9B0A4F20C44B CRC64;
MKKLLVLVLA MMLLVGCSQQ KKDTTFMVTR DNTLYALYNQ NGERLTEYSY KTFEEVSGIG
YIVTDANDQK GVISDKGDEI IKPGTYETLE AVDEMLYATK KVEVKKEKKD DKDKKEEKQT
TPTQTFIKNN LYVLNNKGEV LYSADEKTGI MKSGLPIILK DNTYIVLYHN GEILYNDIQI
VRYANQYKNS TSVILGLEKN ENYYYFDKQD EKNNIELTIN EKGTFQFLAQ NDKGVVLNDE
ATKSMIYIDF EHKKYYQNTI AIKEASFDST GNIVLTNDNK TFVYEVGKAP VLMTSYYMSA
YTYVARSTDI YGPHHIFKDG KSTGDFENCQ LYPVAYHVYY EIFPVYIRDK GYQYYNFDNK
KVIDKTFLAA EPFDANGRAI VKSKEEGYSL IDETGKVLTK DVYNQIKYIG SSYYAVYNEN
GTFGILDKDA GEIFPMEYTS LPTEAIVNYD SHDYLILGKN GRSFVYDIED EMKEIFSHEG
SVTLSEKGYF KVDNQYFTFE GEEIK
//