ID R4Z0G3_9ACTN Unreviewed; 1260 AA.
AC R4Z0G3;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Nephrocystin-3 {ECO:0000256|ARBA:ARBA00040387};
GN ORFNames=BN381_390021 {ECO:0000313|EMBL:CCM64384.1};
OS Candidatus Microthrix parvicella RN1.
OC Bacteria; Actinomycetota; Acidimicrobiia; Acidimicrobiales;
OC Microthrixaceae; Microthrix.
OX NCBI_TaxID=1229780 {ECO:0000313|EMBL:CCM64384.1, ECO:0000313|Proteomes:UP000018291};
RN [1] {ECO:0000313|EMBL:CCM64384.1, ECO:0000313|Proteomes:UP000018291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RN1 {ECO:0000313|EMBL:CCM64384.1,
RC ECO:0000313|Proteomes:UP000018291};
RX PubMed=23446830; DOI=10.1038/ismej.2013.6;
RA Jon McIlroy S., Kristiansen R., Albertsen M., Michael Karst S.,
RA Rossetti S., Lund Nielsen J., Tandoi V., James Seviour R., Nielsen P.H.;
RT "Metabolic model for the filamentous 'Candidatus Microthrix parvicella'
RT based on genomic and metagenomic analyses.";
RL ISME J. 7:1161-1172(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCM64384.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CANL01000033; CCM64384.1; -; Genomic_DNA.
DR AlphaFoldDB; R4Z0G3; -.
DR STRING; 1229780.BN381_390021; -.
DR eggNOG; COG0457; Bacteria.
DR HOGENOM; CLU_006232_0_0_11; -.
DR OrthoDB; 135224at2; -.
DR Proteomes; UP000018291; Unassembled WGS sequence.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 1.
DR InterPro; IPR024983; CHAT_dom.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR InterPro; IPR019734; TPR_repeat.
DR PANTHER; PTHR45641:SF1; NEPHROCYSTIN-3; 1.
DR PANTHER; PTHR45641; TETRATRICOPEPTIDE REPEAT PROTEIN (AFU_ORTHOLOGUE AFUA_6G03870); 1.
DR Pfam; PF12770; CHAT; 1.
DR Pfam; PF13424; TPR_12; 3.
DR SMART; SM00028; TPR; 7.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50005; TPR; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018291};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW TPR repeat {ECO:0000256|ARBA:ARBA00022803, ECO:0000256|PROSITE-
KW ProRule:PRU00339}; Wnt signaling pathway {ECO:0000256|ARBA:ARBA00022687}.
FT DOMAIN 113..359
FT /note="CHAT"
FT /evidence="ECO:0000259|Pfam:PF12770"
FT REPEAT 974..1007
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 1014..1047
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 1054..1087
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT REPEAT 1134..1167
FT /note="TPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
SQ SEQUENCE 1260 AA; 138086 MW; 20A2C8798C6E299A CRC64;
MADRLIVDLF GDGRVGVSRQ LHGEVAPTPG PDPVELVVPL SEKDLGELAW YLERYLVAPF
GVYEDRGPEI ADRLVGWGEA LFGSVFGGGG ARDAYRSVRD RDRGVGLEID IRSDDPGLLG
LPWELMVDPE RSRRLVTTAE SFNRMLLTAN LEPMAQVAGE GLRVLMVIAR PAGLADVDYG
MIARPLLERL EAVRGRVTLE VLRPPTLEQL EQRLGEAADA GVPFHVVHFD GHGALGEAPG
GGSPSMYDGA EGQLLFEDSG GAGSLVSATQ FAQVVRAGGV PVVVLNACQS GALGEVIESA
VATRVLKEGA VSSVVAMSYS VYAVAAAEFM AAFYDGLFAG DPVGVAVTKG RRRLERRPGR
PSPKGDMALA DWVVPVHYVR GDVSFPHLKA APAAAGRHDF ELDAILDKID NGPEGADVLA
ARDGVFVGRD DSFFELEAVC AHERVGIIVG PAGTGKTEVA KAFGRWRRDT GALDHPDGVF
FMSFEPGVAS FGLDGVINTI GTRLFPTQFH QLDREHREEV VVSVLRDHRF LVVWDNFESV
HSMPDPHQAT ATLNDNERTR LKKFVDQIGA PDGKTTLLIT SRSPEPWLET TPRRLGLGGL
GERAADAYTD HLLNRLPNAQ ARRKQRAFGE LKQWLHGHPL SMRLILPQLE VTDPANLLDA
LKGNSELPAG FEADQGRTES LGASIHYSLV QLDDTTRQLL PAVALFESVV DAEVLRVFSA
SEQVPERFAG IDTGQWIQAL DAAAATGLLT PLGANTYSIH PALPAYLHAQ WKLDATDQFE
TEHNTATNAL LGAHAAFGTW LHTEIETGRA GLAFALIDWQ RRTMTKLIRH ALNTSQWDAC
QNVLQPLDDY FDVRGLDEEA RAVVDLVRTA TEKPAGQSPV LDSEDGKLWL FAVGSEANRQ
LRRQQLDAAY QTYDDLRVAL ENSPKSSTQQ RYLANQYHQL GIVAQKRGDL DQADGWYRKS
LTTFEELDNR PGMATSYHQL GNVAQDRGDL DQADDWYRKS LTIREELGNR PGMAESYHQL
GIVAQKRGDL DEADDWYRKS LTILEELGDR PGMATSYHQL GNVAQERGDL DQADDWYRKS
LTIKEKLGNR PGMASSYHQL GIVAQDRGAL DQADDWYRKS LTIFEELGNR PGMARSYHQL
GIVAYLRGDL DQADGWYRKS LAIKEELGSI GQGVTLALRG LLAEEQDRIN EAFGYAIRAV
ALFEEFPHRS SGSAPAQLAR LTSQHGDTAL RDAWTATTGN QIPADVFNWI AEQDKGATDE
//