ID A0A1G0Y5K5_9BACT Unreviewed; 926 AA.
AC A0A1G0Y5K5;
DT 15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2017, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Malectin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A2X46_07460 {ECO:0000313|EMBL:OGV41179.1};
OS Lentisphaerae bacterium GWF2_57_35.
OC Bacteria; Lentisphaerota.
OX NCBI_TaxID=1798576 {ECO:0000313|EMBL:OGV41179.1, ECO:0000313|Proteomes:UP000178598};
RN [1] {ECO:0000313|EMBL:OGV41179.1, ECO:0000313|Proteomes:UP000178598}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=27774985; DOI=10.1038/ncomms13219;
RA Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA Williams K.H., Hubbard S.S., Banfield J.F.;
RT "Thousands of microbial genomes shed light on interconnected biogeochemical
RT processes in an aquifer system.";
RL Nat. Commun. 7:13219-13219(2016).
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004115}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004115}.
CC -!- SIMILARITY: Belongs to the malectin family.
CC {ECO:0000256|ARBA:ARBA00009141}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OGV41179.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MHBK01000068; OGV41179.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1G0Y5K5; -.
DR STRING; 1798576.A2X46_07460; -.
DR Proteomes; UP000178598; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.1220; -; 1.
DR Gene3D; 2.60.40.420; Cupredoxins - blue copper proteins; 1.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 2.
DR InterPro; IPR024361; BACON.
DR InterPro; IPR014755; Cu-Rt/internalin_Ig-like.
DR InterPro; IPR008972; Cupredoxin.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR021720; Malectin_dom.
DR InterPro; IPR039155; MLEC.
DR PANTHER; PTHR13460:SF0; MALECTIN; 1.
DR PANTHER; PTHR13460; UNCHARACTERIZED; 1.
DR Pfam; PF19190; BACON_2; 1.
DR Pfam; PF11721; Malectin; 2.
DR SUPFAM; SSF49503; Cupredoxins; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..926
FT /note="Malectin domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009570049"
FT DOMAIN 432..500
FT /note="BACON"
FT /evidence="ECO:0000259|Pfam:PF19190"
FT DOMAIN 524..655
FT /note="Malectin"
FT /evidence="ECO:0000259|Pfam:PF11721"
FT DOMAIN 783..901
FT /note="Malectin"
FT /evidence="ECO:0000259|Pfam:PF11721"
SQ SEQUENCE 926 AA; 96834 MW; 0D2A2F5D2879BCB6 CRC64;
MKTAMKRAMT GALALSACVL TTTSSWGVQT VNLRAETFTK TLTLPGGGVT NVLMWGFAQD
AGAPTAPGPA ITVPPGETEL VINLVNNLSD PISIVIPGQE GFERDAPHST FTDLEGRTRA
RSFVKETLGG GGTGTYRWTG LRPGTYLYHS GSHAALQVQM GLYGLLKQDA VAGQAYGIPY
ASETSWIFGE IDFDVHDAVQ AGTYGTTVKS MIHSIPEIYL LNGEPFQVGE LALVAGQTQL
LRLINACFDE RIPILNGQYA TLIAEDGRAY PYPKQENAIN LPSLKTRDAL LAVDQPGVVK
FYDLRLRAVA SFPVVLDVTP PTIVSATAID ATTVEVLFSE PVELVSAQTV GNYAIDRGVT
VSSAVLGADT RTVTLTTSTL TGGDYILTVN NVTDRAVPPN AILPDSQAAF AFTPPPAILS
AAPLALAPAS QAGVNAANQS FEVWNSGAGT LTYTIMENVT WLSVAPASGT STGEHDTINV
TYNTAALAAG VYNTVITVTA PPPATNSPQT IAVTLTVTPP PSTALRINSG GNSVSNWAAD
TGFLNGSLFS TTRAITNAGS VPQEVYQTER WGNPVDYSFS SVADGMYTVK LHFAEIFATT
PGARVFSVDL EGQRVLTNLD IFAQAGGRDR ALVRTFVVTV AGGNGLQIHA QATVNNAKFS
GIEVIPAGTP PPAIVVDRSA VTVPEGGATN FLVRLAQTPL QPTTVTVSRI GGDADIAVVA
GSPATFNATN WNIGQTIVLA AAQDADTLNG EALIQCAAFG YSSALVTATE ADNDVLPPTA
RLINCGGGAV SNWTSDAGLF TGGSPFSTTR TIANIDGAPM QVYQTERWGS PLNYSIPGVS
NGAYVVNLHF AEIYATMPGS RVFNVDIEGQ RVLTDFDIFV EAGGQDRALI KTFNVTVTGG
NGLQISGQAT VNNAKFSGIE IIPVVP
//