ID A0A1Y1IKE5_KLENI Unreviewed; 2778 AA.
AC A0A1Y1IKE5;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 22-FEB-2023, entry version 24.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN ORFNames=KFL_007350025 {ECO:0000313|EMBL:GAQ91153.1};
OS Klebsormidium nitens (Green alga) (Ulothrix nitens).
OC Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae;
OC Klebsormidiales; Klebsormidiaceae; Klebsormidium.
OX NCBI_TaxID=105231 {ECO:0000313|EMBL:GAQ91153.1, ECO:0000313|Proteomes:UP000054558};
RN [1] {ECO:0000313|EMBL:GAQ91153.1, ECO:0000313|Proteomes:UP000054558}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NIES-2285 {ECO:0000313|EMBL:GAQ91153.1,
RC ECO:0000313|Proteomes:UP000054558};
RX PubMed=24865297; DOI=10.1038/ncomms4978;
RA Hori K., Maruyama F., Fujisawa T., Togashi T., Yamamoto N., Seo M.,
RA Sato S., Yamada T., Mori H., Tajima N., Moriyama T., Ikeuchi M.,
RA Watanabe M., Wada H., Kobayashi K., Saito M., Masuda T.,
RA Sasaki-Sekimoto Y., Mashiguchi K., Awai K., Shimojima M., Masuda S.,
RA Iwai M., Nobusawa T., Narise T., Kondo S., Saito H., Sato R., Murakawa M.,
RA Ihara Y., Oshima-Yamada Y., Ohtaka K., Satoh M., Sonobe K., Ishii M.,
RA Ohtani R., Kanamori-Sato M., Honoki R., Miyazaki D., Mochizuki H.,
RA Umetsu J., Higashi K., Shibata D., Kamiya Y., Sato N., Nakamura Y.,
RA Tabata S., Ida S., Kurokawa K., Ohta H.;
RT "Klebsormidium flaccidum genome reveals primary factors for plant
RT terrestrial adaptation.";
RL Nat. Commun. 5:3978-3978(2014).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF237684; GAQ91153.1; -; Genomic_DNA.
DR STRING; 105231.A0A1Y1IKE5; -.
DR OrthoDB; 2910701at2759; -.
DR Proteomes; UP000054558; Unassembled WGS sequence.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR GO; GO:0016540; P:protein autoprocessing; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00081; Hint; 1.
DR Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 2.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR001767; Hedgehog_Hint.
DR InterPro; IPR003587; Hint_dom_N.
DR InterPro; IPR036844; Hint_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR006141; Intein_N.
DR InterPro; IPR000800; Notch_dom.
DR InterPro; IPR016201; PSI.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR PANTHER; PTHR46706; PROTEIN QUA-1-RELATED; 1.
DR PANTHER; PTHR46706:SF12; PROTEIN QUA-1-RELATED; 1.
DR Pfam; PF01079; Hint; 1.
DR Pfam; PF00066; Notch; 1.
DR Pfam; PF19030; TSP1_ADAMTS; 2.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00306; HintN; 1.
DR SMART; SM00004; NL; 1.
DR SMART; SM00423; PSI; 2.
DR SMART; SM00209; TSP1; 3.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 2.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50817; INTEIN_N_TER; 1.
DR PROSITE; PS50092; TSP1; 3.
PE 4: Predicted;
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000054558};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..35
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 36..2778
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012778991"
FT DOMAIN 1459..1493
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1614..1646
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1533..1614
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1769..2072
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2213..2272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2437..2463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1534..1564
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1574..1610
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1769..1802
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1816..1845
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1852..1930
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1931..1954
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1955..1975
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1976..2010
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2011..2025
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2026..2040
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2217..2272
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1483..1492
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1618..1628
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1636..1645
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2778 AA; 291573 MW; D9220E705C839CB0 CRC64;
MNGHPKCLFP PKFLFRVSLL VLCLRLSASH WSALAFTEDQ GDWDDETLEH HSKSFDLDAH
LAKNEHGHPG VVLEDAEEQN ETGGHHRYEA MHMVDDSGAL LHIKYNVRTV AHLVNLAQHA
DVISSVRCLG GQLHLSVSNR AHLPGWRVGT TLVGGLEWNC TDQEGAPAPI YRKLIQTVDG
VEEGTVILET QERELHHCFR GAKVSFRYTP SSASSPAKGL TRARRKLLGL VKVVGQVYDG
VEHTITKVGD GAATAIDVMT QLLANGKYEW HGDSNWRIFA FNFDETEQRA LLKRDQLNPA
SAASNDNAKL RASVECVNCF ATLEAGVSFD MVLKTQQAVV PVYLDQMKVV ISGDFQMNAD
FNAQFDHIAG RNSWTKPIGP NKVLSTLTIM AGDVPIVVTP SIQLKAQANV AASVRGGASF
GVDYRQRMQF GKEFRAAWGE FRPVQPDDFA PLNFHEEPKL SFQGEATVEG LLIPEITLKL
YDALPIIIAP MPYLGADFRA NSDSNAADCP FSFRTYAGFN LSLGLNNLNQ IAFVSLLVSR
SVPPFRTGNS FYFSLDQHLF ARKLEIPISG KKINVGGSIV PFRKQVKLIG KHFLPCNFCS
GCVPLLGAEP VYKWVTGGWG TCTSSGLWKR DVTCQADGQP GFAMGDAHCS KAGAKPATTG
TCTVPGCPST CRSSSLGDGT CEAACNLPEC NFDGGDCAST DPCKALKDCP VCLTGGASCG
WCASSGTCLK SQSACSSALP SDWWTNRCAL EEQQIAFTRP TSLNTLRAGQ GFTVQWSGGP
LGGNVVLRYR FDNSADVFSG FGIPLASIPN TGSFYWAIDG GLPTSRAFEL LLASDEDLGN
FAFSDFFAVY GGLDMSGYVW TTGDFGPCSK ECGGGVRTRT TSCVNALNGT TVDASLCNPG
TRPSVSASCN AQACVQCPNV PLCQSGSGYS CSACSCKSNG DGTFFCGMVA TSLRYGTQTT
FGCDTRGNGY QECCRKQGLY CDDGCTNKAQ WVVISEMPCS ASCGGGTQWR KFACKGYVMW
NGRRDDRTCE DFYCGPDPST SVTNTYYECN TQPCRTYAWQ VGEWQKCTQD CGGGSHVRDL
ACKASDGSFA DVSLCDSDAR PPTQEACNVD PCLNVDPFIS QPASLDVWTA GQTYNVTWTG
GIQYGRVKLR MSRSAASPST VQLASIDPLD DGPQISVPID ALPNSGSFTV QIPNKIRSGL
LSLQLQSSDA NGTKTALTTG PVIIRGLANY TVLVTTVSRA GSSSTLPSAT SVTVVGTFGA
SFPFLIAAVP SGQSYYDQRL QILDLGAIFQ CEVTGAPNWV GSVQVLAQGS ELAIAHFGFD
APSLGQTILR SDCAQILDCH ACAETAGCGW CDTDSACRPG DDQGPYVGVC SQPTDAAMLP
YAWATVAATC PDPCSLSNAA TCRDCALRAG CGCLLAQQCN ESSTCPAAPG PRARVPAAQI
TLTSETSVLL DGSRSSKCAT CFGVDCGPNG ECITTVNAAA CRCKAGYSGA RCELPPSPCL
NVSSPGQLSC APSGNSFVLL CANSYVGKDC TPATPAPTPG PSPGPSSSPQ PSPSSTPGPS
PGASSNPQPS LSASPVPSTS PVPSPSPAPS RSPSPKASVS PSPSKSPSPS PSRSPFVCSY
NCNNRGTCLA NNKCSCQFGW ESFNCNYNNL GWNCFCECPN SERACYNAPG CKYCIDNDYN
GGCIPLREDR GQCTRARRKL LGAVGDPPRA SCYAWEQVSG PASVLSNATL PAARASGLVA
GNYTFKLSVT DNFGRVAETA VQVSVLGNSA TASPSQSPVP SPSPVASSSA SPSSSPSPSR
FLSPSSSPLP RPSNSPAASP SSSATPSSSS SPFASVPSNP FASPSSSPFA SPPRNPFASS
SGSPVASPSI SPVISPSSSP VASPSSSPVA SPSPSTIASA SPDPIASPSS SPYMRPSDSP
STSAGQSEGP SPSPSPSPVA SSPPGSPSPS ASSSPQPSPS SSAAPQTSPS TSPSPVESPS
LSPVPSPTSS SAPSPSPSAS PSPVESPSLS PVPSPSSSSS VSSPSPSPSP STPSSPVESP
SPSPRASVRP SRSPKASPSP SPSPSQSPSP SAAAAVAASI NVGVSLTSLE DANLIQNDDI
ALSLAEANGL DPGSVVISGI TFAVNHSCVL GNVSPDDWTD WGGREAFIDG VSYALNVDPS
SVSIDGFSES GRRRGLLQTP GLRVLFSILC TDGSQAKEVA QAVGDALLAA KSALPSSPPL
GPSPSKRPQP PPPPPQTPPP EPPFFDPPLE SFANPPPAPP PPPPPAPPLP VKKGGCFPAD
AFVTLADGGW KRMADVHLGD PVLVFDPSSG ELFVQPVYWF THHLNQTATY VSLEADAGVS
ISGTPNHYIL VSDPRSTLSI QQATIKPFAD VKPGDVIWTS KGPGRQTSLA PSTVVSVTSS
VLPGCTAPKL TLAAAGVITR AVLRGRSGLG RKRVKMALSK GTSKKHSRTG QQGAIRKEGK
EDRSPGAYAL RTFCLDMQKS RRRRFGLPGV TSFQNYKDAD YIGNYEMALD GLQECTSDRV
GYYQPSADAC ANLCLSFSSC IGFSFWTSNY WPDLCCSLQQ APSHGLMRSL GTNYYLRLAA
VLVPEAVSPS SQHLRLPLSH NFFPVALSLE LAFSFLLTKS SKLSVPIADA FSFFDPKPIF
LSISLSFAFD LRISVLIRQS LSIHVLISVT FDKPLPLPLP FALAFFLGKS VKVCISFLDS
LPLTFSYDKP ISSSVDTSIA ISLSISLPFF VNEPLSIDDL VSVCFSQPLS FAEPLAVPFN
IRITIALRFP TTGPKLGP
//