GenomeNet

Database: UniProt
Entry: E7G5H8_9FIRM
LinkDB: E7G5H8_9FIRM
Original site: E7G5H8_9FIRM 
ID   E7G5H8_9FIRM            Unreviewed;       505 AA.
AC   E7G5H8;
DT   05-APR-2011, integrated into UniProtKB/TrEMBL.
DT   05-APR-2011, sequence version 1.
DT   24-JAN-2024, entry version 35.
DE   RecName: Full=WG repeat-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=HMPREF9488_00016 {ECO:0000313|EMBL:EFW06479.1};
OS   Coprobacillus cateniformis.
OC   Bacteria; Bacillota; Erysipelotrichia; Erysipelotrichales;
OC   Coprobacillaceae; Coprobacillus.
OX   NCBI_TaxID=100884 {ECO:0000313|EMBL:EFW06479.1, ECO:0000313|Proteomes:UP000003157};
RN   [1] {ECO:0000313|EMBL:EFW06479.1, ECO:0000313|Proteomes:UP000003157}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=29_1 {ECO:0000313|EMBL:EFW06479.1,
RC   ECO:0000313|Proteomes:UP000003157};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Sibley C.D.,
RA   White A., Strauss J., Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Alvarado L., Arachchi H.M.,
RA   Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C., Freedman E.,
RA   Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S., Heilman E.,
RA   Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA   Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA   Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., White J.,
RA   Yandava C., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Coprobacillus sp. strain 29_1.";
RL   Submitted (DEC-2010) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EFW06479.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADKX01000001; EFW06479.1; -; Genomic_DNA.
DR   RefSeq; WP_008787144.1; NZ_WSNW01000001.1.
DR   AlphaFoldDB; E7G5H8; -.
DR   STRING; 100884.GCA_000269565_01685; -.
DR   GeneID; 78229555; -.
DR   eggNOG; ENOG502ZZ9D; Bacteria.
DR   HOGENOM; CLU_577102_0_0_9; -.
DR   OrthoDB; 1654251at2; -.
DR   Proteomes; UP000003157; Unassembled WGS sequence.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000003157};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..505
FT                   /note="WG repeat-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5038594536"
SQ   SEQUENCE   505 AA;  58208 MW;  9A8E9B0A4F20C44B CRC64;
     MKKLLVLVLA MMLLVGCSQQ KKDTTFMVTR DNTLYALYNQ NGERLTEYSY KTFEEVSGIG
     YIVTDANDQK GVISDKGDEI IKPGTYETLE AVDEMLYATK KVEVKKEKKD DKDKKEEKQT
     TPTQTFIKNN LYVLNNKGEV LYSADEKTGI MKSGLPIILK DNTYIVLYHN GEILYNDIQI
     VRYANQYKNS TSVILGLEKN ENYYYFDKQD EKNNIELTIN EKGTFQFLAQ NDKGVVLNDE
     ATKSMIYIDF EHKKYYQNTI AIKEASFDST GNIVLTNDNK TFVYEVGKAP VLMTSYYMSA
     YTYVARSTDI YGPHHIFKDG KSTGDFENCQ LYPVAYHVYY EIFPVYIRDK GYQYYNFDNK
     KVIDKTFLAA EPFDANGRAI VKSKEEGYSL IDETGKVLTK DVYNQIKYIG SSYYAVYNEN
     GTFGILDKDA GEIFPMEYTS LPTEAIVNYD SHDYLILGKN GRSFVYDIED EMKEIFSHEG
     SVTLSEKGYF KVDNQYFTFE GEEIK
//
DBGET integrated database retrieval system