GenomeNet

Database: UniProt
Entry: S0GJD2_9BACT
LinkDB: S0GJD2_9BACT
Original site: S0GJD2_9BACT 
ID   S0GJD2_9BACT            Unreviewed;       798 AA.
AC   S0GJD2;
DT   18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2013, sequence version 1.
DT   27-MAR-2024, entry version 30.
DE   RecName: Full=Capsule biosynthesis protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=C803_04543 {ECO:0000313|EMBL:EOS14402.1};
OS   Parabacteroides goldsteinii dnLKV18.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae;
OC   Parabacteroides.
OX   NCBI_TaxID=1235789 {ECO:0000313|EMBL:EOS14402.1, ECO:0000313|Proteomes:UP000014140};
RN   [1] {ECO:0000313|EMBL:EOS14402.1, ECO:0000313|Proteomes:UP000014140}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=dnLKV18 {ECO:0000313|Proteomes:UP000014140};
RG   The Broad Institute Genomics Platform;
RG   The Broad Institute Genome Sequencing Center for Infectious Disease;
RA   Earl A., Xavier R., Kuhn K., Stappenbeck T., Walker B., Young S., Zeng Q.,
RA   Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Parabacteroides goldsteinii dnLKV18.";
RL   Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EOS14402.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ASSQ01000021; EOS14402.1; -; Genomic_DNA.
DR   RefSeq; WP_010800051.1; NZ_KE159521.1.
DR   AlphaFoldDB; S0GJD2; -.
DR   PATRIC; fig|1235789.3.peg.4561; -.
DR   HOGENOM; CLU_011447_1_0_10; -.
DR   Proteomes; UP000014140; Unassembled WGS sequence.
DR   GO; GO:0015159; F:polysaccharide transmembrane transporter activity; IEA:InterPro.
DR   Gene3D; 3.10.560.10; Outer membrane lipoprotein wza domain like; 6.
DR   Gene3D; 3.30.1950.10; wza like domain; 1.
DR   InterPro; IPR049712; Poly_export.
DR   InterPro; IPR003715; Poly_export_N.
DR   InterPro; IPR019554; Soluble_ligand-bd.
DR   PANTHER; PTHR33619; POLYSACCHARIDE EXPORT PROTEIN GFCE-RELATED; 1.
DR   Pfam; PF02563; Poly_export; 1.
DR   Pfam; PF10531; SLBB; 6.
PE   4: Predicted;
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..798
FT                   /note="Capsule biosynthesis protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5005710847"
FT   DOMAIN          120..183
FT                   /note="Polysaccharide export protein N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF02563"
FT   DOMAIN          210..255
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
FT   DOMAIN          296..345
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
FT   DOMAIN          378..424
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
FT   DOMAIN          470..515
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
FT   DOMAIN          573..626
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
FT   DOMAIN          690..737
FT                   /note="Soluble ligand binding"
FT                   /evidence="ECO:0000259|Pfam:PF10531"
SQ   SEQUENCE   798 AA;  87308 MW;  6FD9EAD41A964611 CRC64;
     MNIVIRNVLL GGLLWLAVPL SAQVSQDLID KAKAAGMTDD QIRQEINKRM GQSGAEQTTR
     TASDAVVTDR TVAIPDGKVI PSLDAQREAN RPAGDLSGTV FGREIFSNKN LSFEPDLNVP
     TPKGYVLSAG DELLINVWGD SELNLKLKVS PEGTILIPNL GPVSVSGLTI GAAENRIRQE
     LGRIMSTLSG DTDGANTFVS VSLSQIRSIK VNIVGEVVAP GTYTLPSFAT LFNALYAAGG
     VNEIGSLRGI KVYRNSKEVA SLDVYDYLLN GKYNTNIRLE ENDMVIVSPY DQLAVVQGKV
     KRNRIFELKK GETLKQLLNM AGGFTGDAYR KDVRVKRKAG SRYQIATVTE DKYPTFAMMD
     GDSLLVDSVI PFYENRLTVT GAVWRPGEYE LNGTVHTVRQ LVDQAAGLKG DEFAGRAQIT
     RLNPDFTTTV IAVDIRGILN GTAPDMELKP EDQLYIPSFF DLREPYTIKV SGAVNYIDTV
     LPYRNNLTVE DAIMMAGGLK ESAATVNVEV ARRIKDTKTY ENTNRTAEVF NFELNDNLGL
     IPVNGKNSDT VFTLEPFDEV YVRFSPGYQE QQVVKVNGEI TFAGDYVLAE KNSRLSDIIA
     KAGGITPDAY VKGASLKRQL TEDEMRRLET LLQLSANKQS RDSVALSLEN IKDYSVGIDL
     EKALANPGSA HDVVLRDGDE LYIPQFQSTV KINGAVTYPN SVTYTNGMSV GDCLSQAGGY
     NDIARKYPIV IYMNGKVATT KKSFIFFKRY PKVEPGCEIV VPTKTQRDRK TSLAEVLSIA
     SSTTSMAAMV TSIINTLK
//
DBGET integrated database retrieval system