ID A0A1F8S4X2_9CHLR Unreviewed; 1241 AA.
AC A0A1F8S4X2;
DT 15-FEB-2017, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2017, sequence version 1.
DT 28-JUN-2023, entry version 19.
DE RecName: Full=PKD domain-containing protein {ECO:0000259|PROSITE:PS50093};
GN ORFNames=A2V85_11230 {ECO:0000313|EMBL:OGO54947.1};
OS Chloroflexi bacterium RBG_16_72_14.
OC Bacteria; Chloroflexota.
OX NCBI_TaxID=1797663 {ECO:0000313|EMBL:OGO54947.1, ECO:0000313|Proteomes:UP000176218};
RN [1] {ECO:0000313|EMBL:OGO54947.1, ECO:0000313|Proteomes:UP000176218}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=27774985; DOI=10.1038/ncomms13219;
RA Anantharaman K., Brown C.T., Hug L.A., Sharon I., Castelle C.J.,
RA Probst A.J., Thomas B.C., Singh A., Wilkins M.J., Karaoz U., Brodie E.L.,
RA Williams K.H., Hubbard S.S., Banfield J.F.;
RT "Thousands of microbial genomes shed light on interconnected biogeochemical
RT processes in an aquifer system.";
RL Nat. Commun. 7:13219-13219(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OGO54947.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MGOE01000121; OGO54947.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1F8S4X2; -.
DR STRING; 1797663.A2V85_11230; -.
DR Proteomes; UP000176218; Unassembled WGS sequence.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 2.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR012938; Glc/Sorbosone_DH.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR006558; LamG-like.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR InterPro; IPR011041; Quinoprot_gluc/sorb_DH.
DR PANTHER; PTHR19328:SF29; CALX-BETA DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR19328; HEDGEHOG-INTERACTING PROTEIN; 1.
DR Pfam; PF07995; GSDH; 2.
DR Pfam; PF13385; Laminin_G_3; 2.
DR Pfam; PF18911; PKD_4; 1.
DR SMART; SM00560; LamGL; 1.
DR SMART; SM00089; PKD; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF49299; PKD domain; 1.
DR SUPFAM; SSF50952; Soluble quinoprotein glucose dehydrogenase; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..40
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 41..1241
FT /note="PKD domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5009538398"
FT DOMAIN 745..829
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT REGION 39..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1241 AA; 126512 MW; 4921EBEEE979D006 CRC64;
MLRQPAAVPA TGPDRRVRRR LLAVLAAAGL LATSGSWAVA ADPSPGPSPA GGGTPFLREG
AAGGTSSLQV PDEGVAQAAV APGFTETAVI TGLTFPTNVR FAADGRVFVA EKSGLIKVFS
SLADTTPTVF ADLRTAVDDY WDRGLLGLEL APNFPADPYV YAAYTYDAPI GGTAPVWNDA
CPSPPGPTTD GCVVSARVIR MQASGDTMAS QQVLINDWCQ QFPSHSIGDL RFGPDGALYV
SGGDGASFTT VDTGQLGGSQ GSPTPVNPCT DPVNEGGALR SQDLRTMPST GGQTASYRTT
VLDDAPVVYW RLGETTTIIV GDEVGSLTGW YGGTSTKGVP GAIAGDANGA VSLDGSTGYV
GVPDYASLDL ANGPFSIELW VKRKTTGGVQ SVIDRGPGSY QVYFATDGRL TVGRNGGGTL
ARESGATTDT TAFHHFVITK SGTTTRVYKD GADVTAVGTD LTLANTSTNL WLGRWNDGTA
FANIVLDDVS IYASALSAAR VLAHYQAGIG GGGGGTPDPV TLDGTLIRVD PATGDALATN
PNAASPDANA RRIVAQGTRN PFRFTFRPGT GELWVGDVGW GTFEEIDRIV SPTAGVMNFG
WPCYEGAGQQ SGYAGTSICA GLYADGTAPA TAPYYAYDHA AKVVAGETCP TGSSAIAGLA
FYAGTSYPTE YRGALFFADN SRDCLWAMLP GANGLPDPGN IRTILAPAAN PVAVVAGPGG
DLFYVDFDGG SIRRIAYPGA GNQAPTAAMT ATPSSGPAPL DVAFSGAGSS DPEGGALTYA
WDLDGDGAYD DATGVTASWT YTVAGSVTAG LRVTDPQAAT GTTSTLISVG SAPNTPPVPV
IDTPAAGLTW AVGDTIGFTG HATDAEDGTL AAASLSWQLV LQHCPSNCHS HQVQTFAGVA
AGSFTAPDHD YPSHLDLVLT ATDAAGASAS TTLSLDPRTV TLSFRTEPTG LALVASGMQQ
PTPFALTAIE GGTVSVGAPS PQTMGGVTYA FDSWSDGGAA GHDVVAGADM TLTATYSMTT
PPVSSYPDTV VADGPVAYWR LGETAGIVAN DTVGTRSGWY GGTLTRGVTG ALAADDDGAV
SLDGASGYVG VPSSSAPRLG NGPLSVEFWV KRKSATGTHP VIDAGPGAYQ ITFSSSTGKL
TVSRNGGGII VAESTATTDT TTWHHYVFTK DGSAVKLYRD GVDVTGAVTN RTLANATKNF
WIGRWDDGTR YGNVVVDEVA LYATVLTPAE VSAHFVAGTG G
//