ID A0A0D2WPK4_CAPO3 Unreviewed; 1046 AA.
AC A0A0D2WPK4;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=NOD3 protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CAOG_008759 {ECO:0000313|EMBL:KJE93360.1};
OS Capsaspora owczarzaki (strain ATCC 30864).
OC Eukaryota; Filasterea; Capsaspora.
OX NCBI_TaxID=595528 {ECO:0000313|EMBL:KJE93360.1, ECO:0000313|Proteomes:UP000008743};
RN [1] {ECO:0000313|Proteomes:UP000008743}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=ATCC 30864 {ECO:0000313|Proteomes:UP000008743};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Capsaspora owczarzaki ATCC 30864.";
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE346365; KJE93360.1; -; Genomic_DNA.
DR RefSeq; XP_011270389.1; XM_011272087.1.
DR AlphaFoldDB; A0A0D2WPK4; -.
DR EnsemblProtists; KJE93360; KJE93360; CAOG_008759.
DR GeneID; 23302156; -.
DR InParanoid; A0A0D2WPK4; -.
DR OrthoDB; 2732841at2759; -.
DR PhylomeDB; A0A0D2WPK4; -.
DR Proteomes; UP000008743; Unassembled WGS sequence.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 8.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR032675; LRR_dom_sf.
DR PANTHER; PTHR24111:SF0; LEUCINE-RICH REPEAT PROTEIN (LRRP); 1.
DR PANTHER; PTHR24111; LEUCINE-RICH REPEAT-CONTAINING PROTEIN 34; 1.
DR Pfam; PF13516; LRR_6; 12.
DR SMART; SM00368; LRR_RI; 21.
DR SUPFAM; SSF52047; RNI-like; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008743}.
FT REGION 1..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 82..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..35
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..55
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..128
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 139..156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..186
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..201
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1046 AA; 110195 MW; 8F74181DD1FBBE56 CRC64;
MIKKAVPSAA PAPTPTPMRA TRATKPKTSG SLGSTDAAED DRVVTPMEVD PPKPKRAAAR
AANAAAAATA ATPAVQVVTT TIAAPRGRTS KAAAPVPASS ISEPLSKTAQ KKATAPTNQG
SQTVSYHLRA RKPIQVAVAR TGKTSSTSSS GVPTTIRSKR RAAEMAGENT STSKKNKKKA
RQTDADDGDY EDCDDDTAGD GDDNQDSKKL SREEAIGEAL LQNTTLTTLD LGDCVIDFKN
ARMQPIAAAL VQNKTLTTLR TRNYIDAVIA DALTQSTTLT ELSGYLDANG ARSMAKALTQ
NATLTTLYLH SGKFDSAESN PIAAILKQNS TLSTLVLYGP RIGDFGAQAI GKVLKQNTTL
TALRLICNDI GSAGAQAIGE ALKTNFALTT LVLNNNDIGE AGARAIADAL VCNKTLTTLS
MYWCSIEDAG VEAIVHALEK NTTLTSLNLK NNSSGSGAQA AIPRMLQVNT TLTELDLSQN
GLKGVGFQAI AEALVQNTSL TTLLLARNFL ELADAQAIAA ALKSNTTLTT LDLGESWFGD
AGAQAIADAL RQNKTLTTLQ IGTAGLQAIS GALTQNNTLT TLNLSMNPID DVGANNIAEM
LKSNTSLTTL DQLCFKVREI AEALKQNTTL TTLDLSSNDR GLIAFNPIGA VGAHAIAEAL
KQNRTLTTLR LNNNAIGTAG VKPIAEALKM NAALTTLELD GNSIGDAETQ AIAPALVQNT
TLTSLKLGNG VLGKAGAHSI ATVLKQNAKL TTLEVTARFV DSGVQMIAAA LKHNTTLTTF
KLRDMKRYDP EFEREMSEPL TTERLLPQVR NPHGFGLEKA LSRVSRLFLR ENIVDSAGVQ
VIQEVLSLIK PSKLEIRFGL DDFGAQVLAT SLKQNSWMTE LNQVGPIGAQ AIAEALMQNT
KLTILNLSST QLGDAGAEAI SKALRVNTTL TTLNAGAQAL AEELKQNVGL TSFDLSRNSI
RDSGANAMAA VISQNTTLTT LDLGKNHIGD AGAERLAEAL LRNTTLKVLS LWYNEMSEAG
QRVMAAARSQ NSALDSLNLS ENQPPN
//