ID R4Z431_9ACTN Unreviewed; 1243 AA.
AC R4Z431;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Tandem-95 repeat protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BN381_70139 {ECO:0000313|EMBL:CCM65440.1};
OS Candidatus Microthrix parvicella RN1.
OC Bacteria; Actinomycetota; Acidimicrobiia; Acidimicrobiales;
OC Microthrixaceae; Microthrix.
OX NCBI_TaxID=1229780 {ECO:0000313|EMBL:CCM65440.1, ECO:0000313|Proteomes:UP000018291};
RN [1] {ECO:0000313|EMBL:CCM65440.1, ECO:0000313|Proteomes:UP000018291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RN1 {ECO:0000313|EMBL:CCM65440.1,
RC ECO:0000313|Proteomes:UP000018291};
RX PubMed=23446830; DOI=10.1038/ismej.2013.6;
RA Jon McIlroy S., Kristiansen R., Albertsen M., Michael Karst S.,
RA Rossetti S., Lund Nielsen J., Tandoi V., James Seviour R., Nielsen P.H.;
RT "Metabolic model for the filamentous 'Candidatus Microthrix parvicella'
RT based on genomic and metagenomic analyses.";
RL ISME J. 7:1161-1172(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCM65440.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CANL01000067; CCM65440.1; -; Genomic_DNA.
DR AlphaFoldDB; R4Z431; -.
DR STRING; 1229780.BN381_70139; -.
DR eggNOG; COG1404; Bacteria.
DR eggNOG; COG2931; Bacteria.
DR eggNOG; COG3291; Bacteria.
DR HOGENOM; CLU_266447_0_0_11; -.
DR OrthoDB; 954626at2; -.
DR Proteomes; UP000018291; Unassembled WGS sequence.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.380; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR InterPro; IPR001119; SLH_dom.
DR InterPro; IPR010221; VCBS_dom.
DR NCBIfam; NF012211; tand_rpt_95; 3.
DR NCBIfam; TIGR01965; VCBS_repeat; 4.
DR Pfam; PF17963; Big_9; 4.
DR Pfam; PF18911; PKD_4; 1.
DR Pfam; PF00395; SLH; 3.
DR SMART; SM00089; PKD; 1.
DR SUPFAM; SSF55486; Metalloproteases ('zincins'), catalytic domain; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR PROSITE; PS50093; PKD; 1.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018291}.
FT DOMAIN 479..567
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 1064..1123
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1124..1187
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1188..1243
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 797..827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1243 AA; 127557 MW; 7785336AD96A1F06 CRC64;
MVVPRQSGCG RNSGTDPGGS MSPRIYPVAA ALALLATSFV VIPAEAGAQQ GEPVPSAAPT
TGDEAIEALG AGLPQAASDA GLSPAELTDR FTTDPSLQAT PDGILTYDDP APPAPPQART
GETDSAAFVP GSTFNLSSKP GANRTIYLDF LGGTVTDTYW NATYTGGAPI VVPPWDIDGN
SATLSIAEQK IIQTAWHVVA EDFAPWDVNV TTANPGLAAL ERTNAADQVF GQRVMYSNSN
HGMCTGCAGV AIIGAFDATG SYLGPAWAFT DTLYGEPKYY GDVSSHELGH TFTLGHDGTS
GAAYYAGHDD WSPIMGSTAW RPIVQWSKGE YDDANRQEDD LSLIAANGVD LKPDDHGDTL
ATGTPITPAT PVKGLINTSA DIDQFTFDGS GPFTASLSLW DEHPNLDARL RLLNSAGSQV
AASDPQKLGA PSLSVPSLAG GRYSIEVDGV GWGDPLGTGY TDYGSLGAYA LSTSFNQAPI
ANGTITPTTV QYQGTLSYNA SLSSDPDGTI TAYNWNFGDG TTSTSASGTK AYSAPGSYTA
TLKVTDNKGA TGTKTFAVTV NEPEAVDDTA ATTEHGIVSV FVLANDIPSP APSATIAAIT
QPAKGSASIF SMPSKGIRFD AGTDFDGLDT GETEDATFTY TRQQDGYTDT ATVTVTVSGE
NDAPVAADSS KTTDEDTALT FAAPMSDVDV EPLTITKTTQ PSAGSVTISG TNLVFNPGND
FQALRAGQTS QVTFTYTASD GDLTATGTIT VTVTGLNDSP VAVNDTAATP QGTSASLDPT
TNDIDPEGDS LGVASVSAPN AGSATRVSPH EVRFTPSPAD NALDDGESAN RTFDYVVTDG
AASDTGTVTV TVTGVNDRPV AANGSKTTDE DTVLTFAAPM SDVDVEPLTI TKTTQPSAGS
VTISGTNLVF NPGNAFQALG AGQTQQVSFT YTATDGDLTA TGTITITVTG VNDAPVAAND
SYSAVAGQGL TFDPVTGHGA DTDVDTAVST LTVPALPSGG TGTISRVGAT GFTYNPPPAA
NLLRIGTSLI DTFTYTVSDG HLTDTATITF TVTRELPAEC TSAPPHGFTD VTAGDWYDGP
VAWVKAMGIT SGTTPTLYGP HTRTTRAQMA TFLWGLRGRP TGNANPGFSD VPSDAYYAEA
VAYLVGSGVT SGTSATRFSP NAPVTRAQMA TFLWALAGQP ATAASNPFND VPAGKYYTVP
ATWAAEVGIT TGTSPGVFSP DSAVTRAQMA TFLRHYVCTV GYA
//