ID A0A0L0H5W2_SPIPD Unreviewed; 5053 AA.
AC A0A0L0H5W2;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE RecName: Full=Peptidase M23 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=SPPG_08216 {ECO:0000313|EMBL:KNC96311.1};
OS Spizellomyces punctatus (strain DAOM BR117).
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Chytridiomycetes; Spizellomycetales;
OC Spizellomycetaceae; Spizellomyces.
OX NCBI_TaxID=645134 {ECO:0000313|EMBL:KNC96311.1, ECO:0000313|Proteomes:UP000053201};
RN [1] {ECO:0000313|EMBL:KNC96311.1, ECO:0000313|Proteomes:UP000053201}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DAOM BR117 {ECO:0000313|EMBL:KNC96311.1,
RC ECO:0000313|Proteomes:UP000053201};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Engels R., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heiman D., Hepburn T., Howarth C.,
RA Jen D., Larson L., Lewis B., Mehta T., Park D., Pearson M., Roberts A.,
RA Saif S., Shenoy N., Sisk P., Stolte C., Sykes S., Thomson T., Walk T.,
RA White J., Yandava C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Spizellomyces punctatus strain DAOM BR117.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ257469; KNC96311.1; -; Genomic_DNA.
DR RefSeq; XP_016604351.1; XM_016756377.1.
DR STRING; 645134.A0A0L0H5W2; -.
DR GeneID; 27691392; -.
DR VEuPathDB; FungiDB:SPPG_08216; -.
DR eggNOG; KOG1217; Eukaryota.
DR InParanoid; A0A0L0H5W2; -.
DR OrthoDB; 2949229at2759; -.
DR Proteomes; UP000053201; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0005319; F:lipid transporter activity; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 5.
DR Gene3D; 2.10.10.10; Fibronectin, type II, collagen-binding; 1.
DR Gene3D; 2.10.25.10; Laminin; 5.
DR Gene3D; 1.25.10.20; Vitellinogen, superhelical; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR000562; FN_type2_dom.
DR InterPro; IPR036943; FN_type2_sf.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR013806; Kringle-like.
DR InterPro; IPR015819; Lipid_transp_b-sht_shell.
DR InterPro; IPR011030; Lipovitellin_superhlx_dom.
DR InterPro; IPR015816; Vitellinogen_b-sht_N.
DR InterPro; IPR001747; Vitellogenin_N.
DR PANTHER; PTHR16897; OS10G0105400 PROTEIN; 1.
DR PANTHER; PTHR16897:SF2; STRESS RESPONSE PROTEIN NST1; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF07645; EGF_CA; 3.
DR Pfam; PF00040; fn2; 1.
DR Pfam; PF01347; Vitellogenin_N; 1.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 4.
DR SMART; SM00059; FN2; 1.
DR SMART; SM00060; FN3; 5.
DR SMART; SM00638; LPD_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR SUPFAM; SSF57440; Kringle-like; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 3.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS51092; FN2_2; 1.
DR PROSITE; PS50853; FN3; 1.
DR PROSITE; PS51211; VITELLOGENIN; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000053201};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 43..823
FT /note="Vitellogenin"
FT /evidence="ECO:0000259|PROSITE:PS51211"
FT DOMAIN 2734..2831
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 3728..3771
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3782..3821
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3822..3864
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3990..4022
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 4216..4259
FT /note="Fibronectin type-II"
FT /evidence="ECO:0000259|PROSITE:PS51092"
FT DOMAIN 4304..4343
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1214..1241
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3994..4004
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 4012..4021
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 5053 AA; 543492 MW; AD999AA731925E16 CRC64;
MCSTQTREPR KRPIIISQRP VIKLLLLLAF WACSASHVYA AHFQIGKSYQ YSISTQVESI
SDMSKTILNS GPELSVKPAH KNGNAEMKLD AKFTIKPYDT DEANRTLCQF VFIGHPRIIV
TAPLPEGGIK RSETPTDYDF NLHWFGFAVS DSGIVEEVFS DPDELRGVLN IKKAIANLFS
AKLYDEEKKL RSADFNTTET DMSGRHDAAY SVGTSTATSN LRFTKRMLTR RDEADDAETA
EQDRFTHDHE KIIEKHPETH EIHSVVITDH VSSEGSAGVR GNFGGVEKRS TEDAHLLMSA
TGSTALKLLH TRSTKDAHTI ATPANIVRIS LSTEIAAKTV PLETVMNIVA ESMECYDSSV
LHSRKHHTRG EKAQCFARAR KALLSLDQNQ AAIAVQKLLS QYVHTDVAYV AVDLAAEVCT
AFTNVGETIL AETFGARSRR DVPEETLATA LQGISRCAKP TKVVAEIVEE LTKYSTFNGS
YQEETENVQQ HAALALGLMA QKSFEKNDTA LGEQLVSTLH EMVAEVDHSQ YHVLNSDRAK
SKRDFDDEVI DVSHAAAYHG TLLLALGNAA HNKSIPVLLS HLKRSEVTSN HFPEIVRASA
IRSLGSYHGS DIEDILILAA TEGDDGVRSA AHAAYKKHKR SISFEEVIEG AEELKHILQS
DEDRFERVMN GTTDMGLSRR ALRARGLVSM SLTSTSIELN METPKYFWEQ RYGPSVVGVR
ALVNFKNKIN LFLSVLRTQL TIDINNSAQA FLYFDIGGYT EATIFDAQLK LYAKVGYDLD
MIKNFKLSDL ANIKSAFLRG IDKVTSPFTK TYQDALNAFD VLVSKFTAME HLIANFPSQG
AFGLADFSFG LNNVTVPSDP ADESDMLNII NFFGYLEAQV KASAKQLNSK VVDGIDDVLA
NVAQGFGDVS AGVDNVLNCP AQADNTIVHG IEGIQSGLGQ VEKVLETLKS LLSVEALEGM
LPDVEHEFLD QLLALAASKP ILLDFVQPSV QLYNTTSESI QAAQRNIAEA YRNFKLTMGK
LSATHNELKK YLGSVFGPKF DQPFPNRVAA DNNGAPMAFP SAMLDKKYNG SFLDVEGREE
IVAPLDGTLS RADDGALILS VTDGSLSNYN IIINRVTPLV DLNVVVGKTK VKRGDSIAAP
KGSEIHLSIQ DKRDPNVWLD PGKYLPRRIP TILSGFNPNP NYYLLTILGR SYVPQTPIIG
KQALNQQDLA DALDAKKEAD KSGKSKTKPK KSSKARRALE FDEPDSHAVS LARRASFSGF
PVNVPNVCID DEFENAQELC AGGMLPEFRK SINIFHAEKT ILVGGMVPVT FELDFDAILG
IQAGMRVCLM GMTLEPTVIP AFAISVSGSL SLDIFIASVG LQVTGIVADT HVPITVHFPL
SNFPIGTCID INVEIVPLAI EIAIIVTIDF FFFDITWRVT IVRFALSTIK IRVLDTCPAP
GALTISDASG TLADRTPPII DQLLAQQVVG LTPVNPLLFT RFAAHDGESG IAEISVGVGW
GPGDTSIIPR STVPRDQGES LTFPGPTDKE WFDEIDLYVT ITAKNQQGLE TTASTKVLWD
LSPPIINIWE TDLTQPSEFE FSIGENETWV VQRKNLRAFT TEVDLQRNGT ALQPLVDKLS
FAYSIKDATP LREILCAVGT SKQEGALNDT VPWMKLDANA TMGRLTFPGL TLEHGKTYYL
AIQATNTLGY VSSVQSKGTM VDVTPPDLGS LYFGQKFRRD WEATNAQSSV FFNFNGFIDN
ETDIAAWNFA IGPNDIDADL IPVDVDSWMP YNGWTNFIVP GPGGADIAMS LSGYELPEGN
HTICIQATNW VGLKTLRCRN GYVVDATPPT GGISIAPSTE PGSLGDIVLS FNYSDNLSGV
KYINVGLGDG YTPRYTGYIT LDPSATPGIQ NYTITVDEKL NGKLLYGQLQ IADNAGNFFA
LSTERPIAVD FYPPTPGTVF DGATLWTQDD YIDSNQTLCC SWTSFVSAVS GIGRIDVSFG
TLPEQTDIMD WTEVGRWDSQ ACLPVTKVTV PQDSIVFANV RAWNDNGPDS KYSVASSRGT
IIDITGPTPF EAKILTGGNG IFQNQRLFVT LNWTSSSDDE SGLLKYTIQL LDSNTGAILY
PETIVDYALP TRTNAVLYGA PISQGQTFYA VVTAYNAAGT RNTAVATTGN VTVDETPPLL
KFLNYRGNSV YGTTTYIQTP ATFTIDWEWD EPESGLASNA LACWILDPDL VAVPGATANI
TSSGCTITAN PPLKEFNSYT VFVSAVNRAG LSTRVGFEIT IATLPPVLLA SGVGSVALGT
PSLRFSTNND ALRGWFSYQK TIVPITNIFV SFSDETGSTN YTNGFIAVEP SSTFIAYPMD
LAANQTYCMT VYARNVAGLE SVLHESGCVK IVPGAKPGTV WDGFMPNVET RIQIERDLVT
ATFNGFKHFF AGALYTWSMG TTPNSTDVVQ ETSAYLVYPS KNSSGFINYP SSALQPNVTY
FITVYGTFGT TGPDTFTEKV SSTSAGFRVA YEAPFEPTLQ LADGSSYVSN ASVNQIKVTC
AASTGLGSEG FDSLQVSIGR FDTDDVFPLT VSQFPATEQY SVAFLANISS IPYGTASFAK
CSAVDSITGT SSTVYSNHLV VDATPPVAPT DFGCSSAIVT PRLPFSCSWS AFVDPESAVA
NVTVSLGTTT GGVEILTPKT VVGNSYVFDP AFVYLETPSS QYFVTVTGTN SVGMSSSAYT
VLSIDRTPPD TKYDKILFLS EVEGINFTSS ASALPRNVID CHRGTNNIVA SWDGAFTDAE
SGIARYRVAV KEYQGQAFLL KWKDVGTATT ATLQLDVAPR PGSQIVIVVR AVNGAGLSAQ
ATSSPLGVVD NGPMPVSVYS TSASSGFQKD RNVLSAAWKF THPCPIAEYQ WWIKDNLTNA
VVWPVTSTTE TEIVAANLGL VPMRSYVVIV KAFNSFGMSN AVPAVSDAVT IVHSPAKAGK
VWDGPLPGIQ QSFQNDPHTV SASWEDFSSD TCFVVNYRYA VGTDYLISDG AANILIWTDA
GLTRNFTAWI STELKPYTTY YVTVQATSCT GDIVTGVSPG FVIGLRDPPT VGEVWIDNGL
GITTATSQVS TDTVNIQWRG ITSVWSPLAI DVALSSSADS SNSQSYIVNF THVDPNAVMF
NFTGLSLNVT EIGGNRSEYY AFVRATDLNA QGTITKSLGF IVNNTQPAPL VIAFSGQAEN
ATESWQAASG SMTINLGDFG GDIQDIQYAV MLEPPAENST GQLRRRDSAE DPVSMALDNP
DLLTVPFSSV GSDLNTTSSI DIAAPLTESG THRILIKAVN NAGLTSYSAS VPLHVDVTPP
ILGTVKQGYD PSVNMAYTAS NNSVALTWAE SVGESPCQPN VDTFTSGASA RWSISSSLTT
DFGLDDARIV FSPSCATFAS NRLTLVAATD SADEHLTRGC EVVSNVSISG GSFSFRFKAS
PTYGAASSII LTDSNIPVVE RSFETQSNYY FNNSWPYNAI GIQILALDKP SVYMWAAKPG
DKSLRGAEVD FSQDPTSVYL TYSVTVREGT TQVTVTDDAG LVVGSSQLPG LFVGTAGIQG
LPKLAPLFRL WAVRKLASTA SMVFSSTTFP GPPNASYGCN HGDPFLDPDS GIDYYEIGIS
TVIEKYDVLN LTRINVENWT QTCNDATSDV PCVPSQWSQP LTSTDSELIR LGRLTNITLE
SYIYSRNWCD LSAEQGGCSP LASCTKREDV NFPVSNQTLA VANRYNLQYI CTCQPGYGGS
GYQFDCEDID ECGIAASYNV SMCATGATCR NIPGSYLCEC PPGYTGNPYS ATGELEQSGC
IDIDECASNP CGLAGRCENV PGSYRCNCNP GFTAMDQWTC VNVNECMTSN NCHSNATCVD
TYGGHECRCP PGTVGTGIGT DGCVLPDPKT GCAGAAVLTS SAPLNASISS KDIDEDPRTT
RMVNTPRLAR WYKITATGEP FVLSSSASQP SRMWLVAFPS CTASPVAIAK CMPAQSLQCA
ISIPSLKGEV YIAVLSAKAV TVQLSVSGLT GFTCSIACQN GGVCIGPNMC SCPPNWSGPT
CATAVRQMLP RQTINLNETC LGPFTVNGTT YSSCAPGVKP DVAPSAPGYS FQGCWSFGDL
SQVLHIALAT NNYTTILSQA TLAFPQTKSY RDISLDVCAS RCAEINLPFF VYTSATTTAP
AGCVCMPTLM SNSRLASGSL CFTRDKITKA PIYGLDALGA ATFRAELPIP WCPLVSNSSV
PSQLSRTDYC APRKTVTGET CVFPFYYQGK LYTECTYADD IRPWCYTNAL FSKRGYCVGI
DWCSGPTGPN PANGKCVSSL TSPTESYTTV CNDGFRRSGL ECVDIDECAE GINSCDFRTT
TCVNDIGSYH CVCNDGYVPS SSNMTCIKPA NATVPTAATP GSIAPAPLTT PASYYIALRA
VNRAGLGSVL WTRGLTIDQT PPVLGNLTFM RAGQPVGATS DGHLTLTWEA MDDGSDIAYY
EWALGTAVNR SDLLGWSFTQ DSQVTASLVN VSGLLEIYLT VRAVNNALLS TIDVTSILVD
TSPPNAAGAQ VSLANYGRVV QWSGFVPGRS RIYQYELAIG SVQGAIDIMP YKSVSNGSIL
STTIPADRMT VPVPVWVTVK GTNDAWLSAT VSIPTPRVVV GYDAYVLTRT AGSTTVTVDG
FDESTTTLQV VATNIPTNLG ITVLPVSTYT YSVVGNDSTT VISPPATLVL ALGWKYARYL
DTSALFMALT PSADVATLFG SSVVTATLLP DQDLVAAVVT PYIFARFKGS AQWTIVPTTY
DAALHTIDFD VAKPGQYALY NNWVSPAFLD LDQNGRADVL GYKTNTTDQL TTYSSWSSTF
STSNTWLTIA ASKTISSVGD YNQDGTFDYI TKDSVGAYAI VYRPGLQLQT LKGPDQSICP
DRALVGFADL DGNDILDTFW ATKAGGPIDG GSSITVCVMA DNDYDQSCQA NTFSASGTPV
GLGVWDSLGV LGFACTANIC RFSIQGLAVP NRASLFVPEQ ALTSISSSVT LLELSLEIHA
GQTWRASVQG DVDGDGYTDL LLQCKSTLTQ AGCGPAMNLR EALTIVYIRQ NGQVTTAMVT
NTGMDLAMST LIV
//