ID F4KWZ2_HALH1 Unreviewed; 1824 AA.
AC F4KWZ2;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 24-JAN-2024, entry version 57.
DE SubName: Full=PKD domain containing protein {ECO:0000313|EMBL:AEE53592.1};
GN OrderedLocusNames=Halhy_5769 {ECO:0000313|EMBL:AEE53592.1};
OS Haliscomenobacter hydrossis (strain ATCC 27775 / DSM 1100 / LMG 10767 / O).
OC Bacteria; Bacteroidota; Saprospiria; Saprospirales; Haliscomenobacteraceae;
OC Haliscomenobacter.
OX NCBI_TaxID=760192 {ECO:0000313|EMBL:AEE53592.1, ECO:0000313|Proteomes:UP000008461};
RN [1] {ECO:0000313|EMBL:AEE53592.1, ECO:0000313|Proteomes:UP000008461}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 27775 / DSM 1100 / LMG 10767 / O
RC {ECO:0000313|Proteomes:UP000008461};
RX PubMed=21886862; DOI=10.4056/sigs.1964579;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Daligault H., Lapidus A., Zeytun A., Nolan M., Lucas S., Del Rio T.G.,
RA Tice H., Cheng J.F., Tapia R., Han C., Goodwin L., Pitluck S., Liolios K.,
RA Pagani I., Ivanova N., Huntemann M., Mavromatis K., Mikhailova N., Pati A.,
RA Chen A., Palaniappan K., Land M., Hauser L., Brambilla E.M., Rohde M.,
RA Verbarg S., Goker M., Bristow J., Eisen J.A., Markowitz V., Hugenholtz P.,
RA Kyrpides N.C., Klenk H.P., Woyke T.;
RT "Complete genome sequence of Haliscomenobacter hydrossis type strain (O).";
RL Stand. Genomic Sci. 4:352-360(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=DSM 1100;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Lucas S., Han J., Lapidus A., Bruce D., Goodwin L., Pitluck S., Peters L.,
RA Kyrpides N., Mavromatis K., Ivanova N., Ovchinnikova G., Pagani I.,
RA Daligault H., Detter J.C., Han C., Land M., Hauser L., Markowitz V.,
RA Cheng J.-F., Hugenholtz P., Woyke T., Wu D., Verbarg S., Frueling A.,
RA Brambilla E., Klenk H.-P., Eisen J.A.;
RT "Complete sequence of chromosome of Haliscomenobacter hydrossis DSM 1100.";
RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002691; AEE53592.1; -; Genomic_DNA.
DR RefSeq; WP_013768121.1; NC_015510.1.
DR STRING; 760192.Halhy_5769; -.
DR GeneID; 78195380; -.
DR KEGG; hhy:Halhy_5769; -.
DR eggNOG; COG3291; Bacteria.
DR eggNOG; COG4935; Bacteria.
DR HOGENOM; CLU_237529_0_0_10; -.
DR OrthoDB; 9765926at2; -.
DR Proteomes; UP000008461; Chromosome.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.40.740; -; 3.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.60.120.290; Spermadhesin, CUB domain; 1.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002884; P_dom.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR InterPro; IPR035914; Sperma_CUB_dom_sf.
DR InterPro; IPR025667; SprB_repeat.
DR InterPro; IPR026341; T9SS_type_B.
DR NCBIfam; TIGR04131; Bac_Flav_CTERM; 1.
DR Pfam; PF13585; CHU_C; 1.
DR Pfam; PF01483; P_proprotein; 1.
DR Pfam; PF18911; PKD_4; 1.
DR Pfam; PF13573; SprB; 6.
DR SMART; SM00089; PKD; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR PROSITE; PS51829; P_HOMO_B; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000008461};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1824
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003310346"
FT DOMAIN 194..255
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 288..472
FT /note="P/Homo B"
FT /evidence="ECO:0000259|PROSITE:PS51829"
SQ SEQUENCE 1824 AA; 197456 MW; 2A2646BD0522BA03 CRC64;
MVKMIRILVF LPVLCFSTAT FAQFIPLADT TISACGGFLT DSGERNFNYR ANENLVMTIC
PDPTVGSRVR LNFPAINLGR GDRLCFYDGK DTLAPLLLCI DFSLSNRPFS VQSSAANPDS
CLTIAFRSDA QEEAAGWVAN IECIQECQNF TAAVLFSDPE ALPQDTNTIN LCPGGNLFLR
ARGLYTQNNN LYAQADSTST FSWDFGDGVV RTGRTVEHNY PQSGGYTVKL SAIDARGCRS
TNVAEKKIRV APSPRFSADA SLLPEICAGD TLLLVAAPSN VPGATVRVTA QTASFQTDFI
KGENQFIPDN DGRKLESSLA VDGFFVGQNL SEVDNLKGIC INIEHSQMRD LNIALRCPNG
SRVILQNFNG PGGNLVELGQ ANPNDDASPR PGRGYEYCWK TPVSNNNWSD YVDSFNLDSL
PIGEYRPFQS LSQLLGCPLN GEWVLELEDL NSSENGFIFS WSLQFDPVVF PKVETFNPGV
ADLIWRFHPT ILFQDDNSIS VSPSEGGFAS YRLIVQDSFQ CNFDTTITLR VKPQNDSTCG
TCLLTFDKLA DTSLCQGDSL QLSFLPNERL NQVLGFTSFP QYRFNFVQHP PGTPYESIIP
VSNIAQGVLT DPRNQIESVC IDLNSDWNSD LDIFLRAPSG ELLELSTGNG GSDDNFISTC
FTARASDPIN AGMAPFTGDW LPEGSWSDLQ GATIEGDWAL LITDAFGTLP QEINELLSWS
IRFRSVDTLT YEWSPAAGLS CTDCPNPVAL VPNNTRINVK AQSSYGCTFE DDFLVVVRDT
FAAPQVSCAQ ITARSIRFSW PATNARQYSI RFAINGRDSL LPIPISDTFY TLRNLMPDDE
VRIGVQAFSP DSLNFCRSGI GVASCRIMPC SLSVNLRSTK NISCTTASDG AFDFDISGGI
GTINYELTGP IGLIQPFRSD GLDFGTYQLI ALDEGLCRDT LDFNIGQNDS LILTLTLDRA
LRCAGDRNAI ISSSIAGGSG PFRYTWNNGL PSSGLSNIGA GTYALSISDV NGCRGTQRIT
ISEPDSIVLA MIPLDITCFG ASDGRLTAMS FGGMGNLSYR WSNGAVTNNL TNLRANNYCV
TVTDANGCVV SDCRDIKAPI QIKLDSTRIR QPSCANRSDG QATVFVSGGT GALSYQWNDP
QGQINRSANS LAAGQYQVMV SDSNNCSITQ SISIVAPTAL SLQINARSIK CKGGNDGQAS
IKVSGGTLPY RYTWETVNNN DTLASLLSAG SYGITVTDLN NCTVEGRAII TEPANGLTLS
VEQTRQGCYG LKQNMARALP IGGSGSPYTY LWNNGQTSQE LSGLDSLNYS LTVTDATGCT
QAGLIKLQDL PAMEPNTIIS QPSCFGGSNG AIGINLIVGR PNADLDQYRF RWSNGETGQI
IRGLRGEETY SVTITDPQGC IATGTRTVRQ PRAITFDIAG DTLSCSGGNT ATATVRNILA
DTRNFTFLWD SRARNQNTQR ATGLSAGIYT VTVTDDFGCF GIGAVQIREP SPVQVLVQAF
GPVCFGDTSG RAAISPSGGT PGYRYNWSNG ATTAFIGNIP GGLYRVTVSD AKGCSSVNNV
NIPIPNPINI NLRTQDPTCN GDLDGLITTA PSGGKAPYLL SINQSNYRPI LSAIGLKAGK
YDVFVKDANG CITIEQAVIN EKPALLIDLN QTSYTIQLGD TIQLNAEVIN NQGNVRYTWE
EPDAGTLSCL DCPRPTVRTQ NGIDYELLAI DEKGCEASAK VRVFVRKQRI VLVPTGFSPN
GDQNNDLLLV HGQNGVKIKS FKVYDRWGEE VYSRENFDVN DTTTGWDGSY RNQVLNPGIY
LWFLEAVYPD GYSEIKRGQT TLLR
//