ID A0A0L0FC97_9EUKA Unreviewed; 908 AA.
AC A0A0L0FC97;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=NodB homology domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=SARC_13061 {ECO:0000313|EMBL:KNC74390.1};
OS Sphaeroforma arctica JP610.
OC Eukaryota; Ichthyosporea; Ichthyophonida; Sphaeroforma.
OX NCBI_TaxID=667725 {ECO:0000313|EMBL:KNC74390.1, ECO:0000313|Proteomes:UP000054560};
RN [1] {ECO:0000313|EMBL:KNC74390.1, ECO:0000313|Proteomes:UP000054560}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP610 {ECO:0000313|EMBL:KNC74390.1,
RC ECO:0000313|Proteomes:UP000054560};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Young S.K., Zeng Q., Gargeya S., Alvarado L., Berlin A.,
RA Chapman S.B., Chen Z., Freedman E., Gellesch M., Goldberg J., Griggs A.,
RA Gujja S., Heilman E., Heiman D., Howarth C., Mehta T., Neiman D.,
RA Pearson M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C.,
RA Sykes S., White J., Yandava C., Burger G., Gray M.W., Holland P.W.H.,
RA King N., Lang F.B.F., Roger A.J., Ruiz-Trillo I., Haas B., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Sphaeroforma arctica JP610.";
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ244456; KNC74390.1; -; Genomic_DNA.
DR RefSeq; XP_014148292.1; XM_014292817.1.
DR AlphaFoldDB; A0A0L0FC97; -.
DR STRING; 667725.A0A0L0FC97; -.
DR EnsemblProtists; KNC74390; KNC74390; SARC_13061.
DR GeneID; 25913565; -.
DR eggNOG; KOG1217; Eukaryota.
DR OrthoDB; 1343935at2759; -.
DR Proteomes; UP000054560; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004099; F:chitin deacetylase activity; IEA:UniProt.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd19941; TIL; 1.
DR Gene3D; 3.20.20.370; Glycoside hydrolase/deacetylase; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR011330; Glyco_hydro/deAcase_b/a-brl.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR002509; NODB_dom.
DR PANTHER; PTHR24034; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24034:SF166; VACUOLAR-SORTING RECEPTOR 1; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF01522; Polysacc_deac_1; 1.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF88713; Glycoside hydrolase/deacetylase; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS51677; NODB; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000054560};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..908
FT /note="NodB homology domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005538634"
FT DOMAIN 63..283
FT /note="NodB homology"
FT /evidence="ECO:0000259|PROSITE:PS51677"
FT DOMAIN 362..403
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 437..475
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 476..517
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 707..743
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 372..389
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 446..463
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 465..474
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 507..516
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 733..742
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 908 AA; 97951 MW; 2D2082F50B03F7D6 CRC64;
MYCLHISSGL FALVSSFAAL TLAQPVGTQH PLTYTNNGYI GNTWIESFWT TNDNVYWRCT
RPGLIAITYD DGPAIELSSS DRTYKALDLH SRLGVPATFF VLGKYTDVDD PALQQKIHDV
LRRMVDEGHQ IAHHSVTHKR ITENPNFIAE LEEQNAWFAA NFVYGDILRS SNGVRYVRPP
YLDMDDLSAN ALGAAGYKVV QRSASSKDTT VGNTADIGSS HIWCDTESGY SCPTEYPAPL
NAEISVSQNS WIGLFHDRGA AFDSSITAEF MITRARQMGY RFVTVAECLG DSPLCSCPEN
AFCDENRDCF CNEGYTMISN QCVENADECL PELSCSPGSI RLGNKCVCED GFQMNADLTC
DDIDECLTDG ICSMTPGSEC INTPGFFTCA CPENTALLNG HCIMTGCSGC PPNAGNINGE
CMCADGYVKD GGGACVDIDE CALDACAALT NTECVNTDGS FVCQCQPFWL GPACIEYDYC
GMDPNACTGA GYSGECSNSG SGPVCTCLPE WTGDSCEISK GDCYATVQYP VGALNPDSGQ
EIAFASLGVG ACLMGLVVDV GTYRMNGQEF IDAPFEIYID NNDDETDGYT GSAVTGARWR
VKYHKVCEYP TASSTYTGCS TIANADVNVV DNGPAGVSLV IPWESFGFAN APVGQTWRIG
ARSSNLLSDT RAYLPDTRYT DAVPIAYTIA LPGTRRDTGE IMARAKGGKK CKEGTCSGRG
TCTQGERGAV CWCEDGFDGE GCEVYDVCSL GDNPNSACER VSGVSTECIS SNSGFTCAGL
PNEFRMDPNL DLALASDTVL KGDADLDTRT QATLWTALNC QQLVEHVVQS ILRQALVVEL
RSYLFTEIQI ANARCASDTI AFEKELGKRL KEAAKVDDQA DLWMQAIDCD TKGKCSGVKE
KENQKKNH
//