ID S0GJD2_9BACT Unreviewed; 798 AA.
AC S0GJD2;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=Capsule biosynthesis protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=C803_04543 {ECO:0000313|EMBL:EOS14402.1};
OS Parabacteroides goldsteinii dnLKV18.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae;
OC Parabacteroides.
OX NCBI_TaxID=1235789 {ECO:0000313|EMBL:EOS14402.1, ECO:0000313|Proteomes:UP000014140};
RN [1] {ECO:0000313|EMBL:EOS14402.1, ECO:0000313|Proteomes:UP000014140}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=dnLKV18 {ECO:0000313|Proteomes:UP000014140};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Kuhn K., Stappenbeck T., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Parabacteroides goldsteinii dnLKV18.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS14402.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASSQ01000021; EOS14402.1; -; Genomic_DNA.
DR RefSeq; WP_010800051.1; NZ_KE159521.1.
DR AlphaFoldDB; S0GJD2; -.
DR PATRIC; fig|1235789.3.peg.4561; -.
DR HOGENOM; CLU_011447_1_0_10; -.
DR Proteomes; UP000014140; Unassembled WGS sequence.
DR GO; GO:0015159; F:polysaccharide transmembrane transporter activity; IEA:InterPro.
DR Gene3D; 3.10.560.10; Outer membrane lipoprotein wza domain like; 6.
DR Gene3D; 3.30.1950.10; wza like domain; 1.
DR InterPro; IPR049712; Poly_export.
DR InterPro; IPR003715; Poly_export_N.
DR InterPro; IPR019554; Soluble_ligand-bd.
DR PANTHER; PTHR33619; POLYSACCHARIDE EXPORT PROTEIN GFCE-RELATED; 1.
DR Pfam; PF02563; Poly_export; 1.
DR Pfam; PF10531; SLBB; 6.
PE 4: Predicted;
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..798
FT /note="Capsule biosynthesis protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005710847"
FT DOMAIN 120..183
FT /note="Polysaccharide export protein N-terminal"
FT /evidence="ECO:0000259|Pfam:PF02563"
FT DOMAIN 210..255
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
FT DOMAIN 296..345
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
FT DOMAIN 378..424
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
FT DOMAIN 470..515
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
FT DOMAIN 573..626
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
FT DOMAIN 690..737
FT /note="Soluble ligand binding"
FT /evidence="ECO:0000259|Pfam:PF10531"
SQ SEQUENCE 798 AA; 87308 MW; 6FD9EAD41A964611 CRC64;
MNIVIRNVLL GGLLWLAVPL SAQVSQDLID KAKAAGMTDD QIRQEINKRM GQSGAEQTTR
TASDAVVTDR TVAIPDGKVI PSLDAQREAN RPAGDLSGTV FGREIFSNKN LSFEPDLNVP
TPKGYVLSAG DELLINVWGD SELNLKLKVS PEGTILIPNL GPVSVSGLTI GAAENRIRQE
LGRIMSTLSG DTDGANTFVS VSLSQIRSIK VNIVGEVVAP GTYTLPSFAT LFNALYAAGG
VNEIGSLRGI KVYRNSKEVA SLDVYDYLLN GKYNTNIRLE ENDMVIVSPY DQLAVVQGKV
KRNRIFELKK GETLKQLLNM AGGFTGDAYR KDVRVKRKAG SRYQIATVTE DKYPTFAMMD
GDSLLVDSVI PFYENRLTVT GAVWRPGEYE LNGTVHTVRQ LVDQAAGLKG DEFAGRAQIT
RLNPDFTTTV IAVDIRGILN GTAPDMELKP EDQLYIPSFF DLREPYTIKV SGAVNYIDTV
LPYRNNLTVE DAIMMAGGLK ESAATVNVEV ARRIKDTKTY ENTNRTAEVF NFELNDNLGL
IPVNGKNSDT VFTLEPFDEV YVRFSPGYQE QQVVKVNGEI TFAGDYVLAE KNSRLSDIIA
KAGGITPDAY VKGASLKRQL TEDEMRRLET LLQLSANKQS RDSVALSLEN IKDYSVGIDL
EKALANPGSA HDVVLRDGDE LYIPQFQSTV KINGAVTYPN SVTYTNGMSV GDCLSQAGGY
NDIARKYPIV IYMNGKVATT KKSFIFFKRY PKVEPGCEIV VPTKTQRDRK TSLAEVLSIA
SSTTSMAAMV TSIINTLK
//