ID A0A2S6NMP5_RHOGL Unreviewed; 1532 AA.
AC A0A2S6NMP5;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Glycosyltransferase 2-like domain-containing protein {ECO:0000259|Pfam:PF00535};
GN ORFNames=CCS01_03295 {ECO:0000313|EMBL:PPQ37663.1};
OS Rhodopila globiformis (Rhodopseudomonas globiformis).
OC Bacteria; Pseudomonadota; Alphaproteobacteria; Rhodospirillales;
OC Acetobacteraceae; Rhodopila.
OX NCBI_TaxID=1071 {ECO:0000313|EMBL:PPQ37663.1, ECO:0000313|Proteomes:UP000239724};
RN [1] {ECO:0000313|EMBL:PPQ37663.1, ECO:0000313|Proteomes:UP000239724}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 161 {ECO:0000313|EMBL:PPQ37663.1,
RC ECO:0000313|Proteomes:UP000239724};
RX PubMed=29423563;
RA Imhoff J.F., Rahn T., Kunzel S., Neulinger S.C.;
RT "New insights into the metabolic potential of the phototrophic purple
RT bacterium Rhodopila globiformis DSM 161(T) from its draft genome sequence
RT and evidence for a vanadium-dependent nitrogenase.";
RL Arch. Microbiol. 0:0-0(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PPQ37663.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NHRY01000046; PPQ37663.1; -; Genomic_DNA.
DR OrthoDB; 9783791at2; -.
DR Proteomes; UP000239724; Unassembled WGS sequence.
DR GO; GO:0016757; F:glycosyltransferase activity; IEA:InterPro.
DR CDD; cd03789; GT9_LPS_heptosyltransferase; 1.
DR Gene3D; 3.40.50.2000; Glycogen Phosphorylase B; 2.
DR InterPro; IPR001173; Glyco_trans_2-like.
DR InterPro; IPR002201; Glyco_trans_9.
DR InterPro; IPR029044; Nucleotide-diphossugar_trans.
DR PANTHER; PTHR30160; TETRAACYLDISACCHARIDE 4'-KINASE-RELATED; 1.
DR Pfam; PF01075; Glyco_transf_9; 1.
DR Pfam; PF00535; Glycos_transf_2; 1.
DR SUPFAM; SSF53448; Nucleotide-diphospho-sugar transferases; 1.
DR SUPFAM; SSF53756; UDP-Glycosyltransferase/glycogen phosphorylase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000239724};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 874..1049
FT /note="Glycosyltransferase 2-like"
FT /evidence="ECO:0000259|Pfam:PF00535"
FT REGION 1..31
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1532 AA; 165635 MW; 7BED17CE2B5C0D71 CRC64;
MKRQASVTKP AEPRRRVARR AGGAVAAAPM KDAPPKPVVL IEIDPALDGS VLHNRFDIMI
RGRAIADVPI IEIRLEVGEQ VLATAWYGQS GSAGAAVLPG GRPARQRAFQ FHLPQLPPGA
AAPCAFRIVA RTEPGFEARE DFTLSLDTQA TPAVSLVSGR SRAGLSVIAV RASVVMHVER
AVIDPGGGLS VHGWAVAVVP ILAVQAYADG DLVGQAALDA ERQDVAAAFA AYPDAGRSGF
VLVVPLRRQD RQVDAVRVRV VCPDGFGHDA IVPVERSPRQ LALAGAGMAA AAGETMDAPA
AVPMGQQPGY HVSTGLHFAH GPAPGFALAP PVDPSAWLPD PLAPEAGRFA AGQPTGIELY
CDEVQLSADG TLRVNGWAVC GAGITQVRVL LDDEDVGLAA YGYDRPDVGQ VFDAVPMSHL
SGFKFDHKVR DGFSGEYRVR IVVSNTRNQQ REKSVIGVVP PSGAWSLAAA PLAPAPGQAT
DFRFEIDSPA VTHGVAAETI TGRLTIDGWL LTRAGVSAFE VFLDDQRLGE AHFGLARQDV
GAAFPDWPTA LRSGFAFHCP PRSLRDGEHT VSLKVRTDDG QAISRSFRIT VRKSGDEIEQ
NGIRRRVPRA ESDMLLAFLD SMRYRPAFRF ILRPDGGTEP LRATLSALRT QAYADWTMLV
LAEDADAALA VRAVIDDEAP HLAERVSFRL RSDDAWDEPL ARGRDGRDVL YMLLSPGDEP
GADALLELAV AAGRNPGADL LYADEVRRSP VSGQLEPFLK PDFSPDLMLS TNYVGRPWVA
TALLLERTGV SAAGLVADGD YDLALRCAER AAGVHHSPRL LCQRAAMELD SAAQEQAALQ
RALERRGIHG EVGATPIHGT WRVRRRVAAL GKVSIIIPTC AANGHIETVI TSLRAKTAYR
NFEIIAIDNI PESNAAWKAW LALHADKVVS IPGSFNWSTF NNRATAAAEG DYFLFMNDDM
EVTHEDWLDA MLEHAQRPDV GVVGPQLLYG DGKVQHAGMF LSNNGIGRHA FRFATHDDPG
YFGLALTQRN VIAVTGACML VRRAVFERLG GFDESHQITN NDLDFCLRAH RAGLLAVFTP
YASLIHYELA SRAGLHDVFD LAQFNAAWKT AFAAGDPYFN PHLSRHADDY RPDDEPVQWI
VPGAPLFHRQ EIQRILVVKL DHIGDFVTAL PAVRRLKAVF PQARLTVLAG PASRALAALE
PGIDEFIPFE FFHARSQLGE RAVTGDDYQD LARRLRPYRF DLAIDLRKHP STRDVLRHTG
ARFLAGFDHQ GQYPFLDIAL DWDGDRSLQR KRSHVVDDLV ALVDAIGRAG EADRAVMQPP
APMPLAELPR AVRALFRRRV VAVHPGAGNV TKQWPEAHFS ALIDLLIERH DVAILLIGGP
DETDIADRLL AQGLHPERMA SMAGRTGLAD LPRLLGNCVL YIGNDSGPKH IASAIGIATI
GIHSGVVDPV EWGPIGVNAV ALRRAMTCSP CYLANAADCP RALACLRFLE PSLVYEAASL
MLARPVETVD RTSVAPLNRA GPRTKKRSGA SA
//