ID A0A0G3EHW4_9BACT Unreviewed; 982 AA.
AC A0A0G3EHW4;
DT 16-SEP-2015, integrated into UniProtKB/TrEMBL.
DT 16-SEP-2015, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=Trehalose utilization {ECO:0008006|Google:ProtNLM};
DE Flags: Precursor;
GN ORFNames=L21SP4_01145 {ECO:0000313|EMBL:AKJ64395.1};
OS Kiritimatiella glycovorans.
OC Bacteria; Kiritimatiellota; Kiritimatiellia; Kiritimatiellales;
OC Kiritimatiellaceae; Kiritimatiella.
OX NCBI_TaxID=1307763 {ECO:0000313|EMBL:AKJ64395.1, ECO:0000313|Proteomes:UP000035268};
RN [1] {ECO:0000313|Proteomes:UP000035268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=L21-Fru-AB {ECO:0000313|Proteomes:UP000035268};
RA Spring S., Bunk B., Sproer C., Klenk H.-P.;
RT "Description and complete genome sequence of the first cultured
RT representative of the subdivision 5 of the Verrucomicrobia phylum.";
RL Submitted (FEB-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:AKJ64395.1, ECO:0000313|Proteomes:UP000035268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=L21-Fru-AB {ECO:0000313|EMBL:AKJ64395.1,
RC ECO:0000313|Proteomes:UP000035268};
RX PubMed=27300277; DOI=10.1038/ismej.2016.84;
RA Spring S., Bunk B., Sproer C., Schumann P., Rohde M., Tindall B.J.,
RA Klenk H.P.;
RT "Characterization of the first cultured representative of Verrucomicrobia
RT subdivision 5 indicates the proposal of a novel phylum.";
RL ISME J. 10:2801-2816(2016).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP010904; AKJ64395.1; -; Genomic_DNA.
DR RefSeq; WP_052881729.1; NZ_CP010904.1.
DR AlphaFoldDB; A0A0G3EHW4; -.
DR STRING; 1307763.L21SP4_01145; -.
DR KEGG; vbl:L21SP4_01145; -.
DR OrthoDB; 9785923at2; -.
DR Proteomes; UP000035268; Chromosome.
DR Gene3D; 3.40.50.880; -; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR029062; Class_I_gatase-like.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR029010; ThuA-like.
DR PANTHER; PTHR40469:SF2; GALACTOSE-BINDING DOMAIN-LIKE SUPERFAMILY PROTEIN; 1.
DR PANTHER; PTHR40469; SECRETED GLYCOSYL HYDROLASE; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF06283; ThuA; 1.
DR SUPFAM; SSF48371; ARM repeat; 2.
DR SUPFAM; SSF52317; Class I glutamine amidotransferase-like; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000035268};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..982
FT /note="Trehalose utilization"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005184005"
FT DOMAIN 61..283
FT /note="ThuA-like"
FT /evidence="ECO:0000259|Pfam:PF06283"
FT DOMAIN 856..962
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|Pfam:PF00754"
SQ SEQUENCE 982 AA; 108711 MW; 50C061DE8A70CF73 CRC64;
MRMNPRVFFF TALFVAVLTS PPVPSVAAPA PDVLPEVPAD HVAKIRAALP SPAPAPETPR
RILLFWRCEG FYHSAIPWAN RAIQEMGAMN KAWTCAVSKD MAVFTPERLA EYDVVVFNST
TRLQPTDEQL QALLDFVRGG GGIVGIHAAT DNFYSDPEAA QMMGGLFNKH PWHFKGMWSF
VLDDPGHRLN QAFEELTFEA SDEIYQFKDP YSRERVRVLT RVDLSQASNL EVQGRERDDL
DHAITWVRSE GSGRVFYFGF GHNNAIYWNR PLMRHLYDGL RFAAGDLEVD TTPSAQRDDL
DRIAGWAYEQ SRMPFERLRI RWNEADDAGR AKLEDQFTEA LRNTRSTLDG RREICRLLGH
SGSERACAAL AEALRHPDLR DEACIALGVH PSAEADAALV DFLADSGDAH AISVINAAGR
RRVNAAVPQL ARRLASEDEA LVKASSYALA TIASPPAIET LMEAYTAEEN SILEPALLDA
AYRLAEAGSA ENARRLFEGL TGRGSPQSRA AALPGLVSLR GREMIPDLFK ALREGSDPVA
ETAARILPEL LTPSTVRPLA RTLDSLPDDR VPMALEVLAR VAPDETLPIL RSMLDTDEPS
SASMALAIIG RFGEREDLAR CFDWAAHEDE GASGPAREVL SYDHLPGTDK FLLKKLDPDT
APEDTALAIE LLSKREHPEL LDRLRDPAWY DDHITASAAL NALKEHATRD DLGPVIQLFF
AVNNRTAPKL AGVIRKIAQE YKDQKAVLDG YRRALDHARE MHSTARMRIL MQLVDYLDIP
AHLKGEAWMA LIRECEDKAL RLEAIQLLAR SAPSASALDF ISGLHGDADL TAVIERAHRS
IEKALSGPPE LTASHGGGTL KALFTPETED RWTSHQSREP GMWLLIDFRV PRRVGSITLD
ASGSKNDFPN QYEVYTDDEQ EASAAPRLRG EGSTVTKIDL GGVETQFVKI VNQSEAHQWW
SIHDLRIDGE SLSSMKNHDA GK
//