ID C4KBY5_THASP Unreviewed; 768 AA.
AC C4KBY5;
DT 07-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 07-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Glycosyl transferase group 1 {ECO:0000313|EMBL:ACR01866.1};
GN OrderedLocusNames=Tmz1t_3271 {ECO:0000313|EMBL:ACR01866.1};
OS Thauera aminoaromatica.
OC Bacteria; Pseudomonadota; Betaproteobacteria; Rhodocyclales; Zoogloeaceae;
OC Thauera.
OX NCBI_TaxID=164330 {ECO:0000313|EMBL:ACR01866.1, ECO:0000313|Proteomes:UP000002186};
RN [1] {ECO:0000313|Proteomes:UP000002186}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MZ1T {ECO:0000313|Proteomes:UP000002186};
RG US DOE Joint Genome Institute;
RA Lucas S., Copeland A., Lapidus A., Glavina del Rio T., Dalin E., Tice H.,
RA Bruce D., Goodwin L., Pitluck S., Sims D., Brettin T., Detter J.C., Han C.,
RA Larimer F., Land M., Hauser L., Kyrpides N., Mikhailova N., Sayler G.S.;
RT "Complete sequence of chromosome of Thauera sp. MZ1T.";
RL Submitted (MAY-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ACR01866.1, ECO:0000313|Proteomes:UP000002186}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MZ1T {ECO:0000313|EMBL:ACR01866.1,
RC ECO:0000313|Proteomes:UP000002186};
RX PubMed=23407619; DOI=10.4056/sigs.2696029;
RA Jiang K., Sanseverino J., Chauhan A., Lucas S., Copeland A., Lapidus A.,
RA Del Rio T.G., Dalin E., Tice H., Bruce D., Goodwin L., Pitluck S., Sims D.,
RA Brettin T., Detter J.C., Han C., Chang Y.J., Larimer F., Land M.,
RA Hauser L., Kyrpides N.C., Mikhailova N., Moser S., Jegier P., Close D.,
RA Debruyn J.M., Wang Y., Layton A.C., Allen M.S., Sayler G.S.;
RT "Complete genome sequence of Thauera aminoaromatica strain MZ1T.";
RL Stand. Genomic Sci. 6:325-335(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP001281; ACR01866.1; -; Genomic_DNA.
DR AlphaFoldDB; C4KBY5; -.
DR STRING; 85643.Tmz1t_3271; -.
DR CAZy; GT4; Glycosyltransferase Family 4.
DR KEGG; tmz:Tmz1t_3271; -.
DR eggNOG; COG0297; Bacteria.
DR eggNOG; COG0438; Bacteria.
DR HOGENOM; CLU_363666_0_0_4; -.
DR Proteomes; UP000002186; Chromosome.
DR GO; GO:0016757; F:glycosyltransferase activity; IEA:InterPro.
DR GO; GO:1901576; P:organic substance biosynthetic process; IEA:UniProt.
DR CDD; cd03801; GT4_PimA-like; 1.
DR CDD; cd03794; GT4_WbuB-like; 1.
DR Gene3D; 3.40.50.2000; Glycogen Phosphorylase B; 4.
DR InterPro; IPR001296; Glyco_trans_1.
DR InterPro; IPR028098; Glyco_trans_4-like_N.
DR PANTHER; PTHR45947; SULFOQUINOVOSYL TRANSFERASE SQD2; 1.
DR PANTHER; PTHR45947:SF3; SULFOQUINOVOSYL TRANSFERASE SQD2; 1.
DR Pfam; PF13579; Glyco_trans_4_4; 2.
DR Pfam; PF00534; Glycos_transf_1; 2.
DR SUPFAM; SSF53756; UDP-Glycosyltransferase/glycogen phosphorylase; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002186};
KW Transferase {ECO:0000313|EMBL:ACR01866.1}.
FT DOMAIN 15..188
FT /note="Glycosyltransferase subfamily 4-like N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13579"
FT DOMAIN 205..277
FT /note="Glycosyl transferase family 1"
FT /evidence="ECO:0000259|Pfam:PF00534"
FT DOMAIN 422..563
FT /note="Glycosyltransferase subfamily 4-like N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13579"
FT DOMAIN 585..737
FT /note="Glycosyl transferase family 1"
FT /evidence="ECO:0000259|Pfam:PF00534"
FT REGION 302..375
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 331..345
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 768 AA; 83018 MW; B96F8F76CD631AEC CRC64;
MRILHILDHS IPLHSGYTFR TAAILREQRA LGWETFHLTS PKQGETKAMV EEIEGLRFHR
TGVPTPASSG IAELRQIRAV QARIEQLAAE LRPDILHAHS PVLNAIPAIR AGRKLGIPVV
YEIRAFWEDA AVDHGTTTEG SLRYRATKAL ETWAIKRADH VFTICEGLRA DIVGRGIPAA
KVTVIPNAVD IESFQLSGDA DPALREQLGL AGTTVVGFVG SFYAYEGLDL LLEAFPALLQ
KRPELRLLLV GGGPQDENLK AQALRLGVAD KVVFTGPRAA QGRQPLLRPD RPARLSAPLH
ATHRARHPAQ AARGHGAGSA LRRLRRRRPQ GADPRRRDRQ AVQGRQRRGA RRRHRRPARA
PRALAGDARG GAAVRRGRAQ LDEQRGELHA GVSQPGREAS RCGMNFSALR VLLVGPLPPP
AGGMANQTRQ LAELLRGEGA SVEVVQVNAP YRPAWVERLK GVRALFRLLP YLLRLWQAAG
RANLMHVMAN SGWSWHLFAA PAVWIGWLRG VPVVVNYRGG EAAGFLARSA AVVRATLRRA
SSLVLPSGFL LEVFARHGMA GRIVSNIVDL ERFHPATAPR AAGEGPHLVV ARNLEALYGN
DTALRAFARL CAAWPAARLS IAGSGPEAAA LAALAQELGI AERVRFTGRL DGEQMAALYR
DADLMLNPSR VDNMPNAILE ALASGLPVVT TDVGGIPFIV TQGHTAMLVP PDDPQAMADA
AHKVLADTGL RAALVAAGCA EVQRYRWSSV RAQLLSAYAD AIAAKRAT
//