GenomeNet

Database: UniProt
Entry: A0A1Y1IKE5_KLENI
LinkDB: A0A1Y1IKE5_KLENI
Original site: A0A1Y1IKE5_KLENI 
ID   A0A1Y1IKE5_KLENI        Unreviewed;      2778 AA.
AC   A0A1Y1IKE5;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   22-FEB-2023, entry version 24.
DE   RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN   ORFNames=KFL_007350025 {ECO:0000313|EMBL:GAQ91153.1};
OS   Klebsormidium nitens (Green alga) (Ulothrix nitens).
OC   Eukaryota; Viridiplantae; Streptophyta; Klebsormidiophyceae;
OC   Klebsormidiales; Klebsormidiaceae; Klebsormidium.
OX   NCBI_TaxID=105231 {ECO:0000313|EMBL:GAQ91153.1, ECO:0000313|Proteomes:UP000054558};
RN   [1] {ECO:0000313|EMBL:GAQ91153.1, ECO:0000313|Proteomes:UP000054558}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=NIES-2285 {ECO:0000313|EMBL:GAQ91153.1,
RC   ECO:0000313|Proteomes:UP000054558};
RX   PubMed=24865297; DOI=10.1038/ncomms4978;
RA   Hori K., Maruyama F., Fujisawa T., Togashi T., Yamamoto N., Seo M.,
RA   Sato S., Yamada T., Mori H., Tajima N., Moriyama T., Ikeuchi M.,
RA   Watanabe M., Wada H., Kobayashi K., Saito M., Masuda T.,
RA   Sasaki-Sekimoto Y., Mashiguchi K., Awai K., Shimojima M., Masuda S.,
RA   Iwai M., Nobusawa T., Narise T., Kondo S., Saito H., Sato R., Murakawa M.,
RA   Ihara Y., Oshima-Yamada Y., Ohtaka K., Satoh M., Sonobe K., Ishii M.,
RA   Ohtani R., Kanamori-Sato M., Honoki R., Miyazaki D., Mochizuki H.,
RA   Umetsu J., Higashi K., Shibata D., Kamiya Y., Sato N., Nakamura Y.,
RA   Tabata S., Ida S., Kurokawa K., Ohta H.;
RT   "Klebsormidium flaccidum genome reveals primary factors for plant
RT   terrestrial adaptation.";
RL   Nat. Commun. 5:3978-3978(2014).
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DF237684; GAQ91153.1; -; Genomic_DNA.
DR   STRING; 105231.A0A1Y1IKE5; -.
DR   OrthoDB; 2910701at2759; -.
DR   Proteomes; UP000054558; Unassembled WGS sequence.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   GO; GO:0016539; P:intein-mediated protein splicing; IEA:InterPro.
DR   GO; GO:0016540; P:protein autoprocessing; IEA:InterPro.
DR   CDD; cd00054; EGF_CA; 1.
DR   CDD; cd00081; Hint; 1.
DR   Gene3D; 2.170.16.10; Hedgehog/Intein (Hint) domain; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   Gene3D; 2.10.25.10; Laminin; 1.
DR   Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 2.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR   InterPro; IPR001767; Hedgehog_Hint.
DR   InterPro; IPR003587; Hint_dom_N.
DR   InterPro; IPR036844; Hint_dom_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR006141; Intein_N.
DR   InterPro; IPR000800; Notch_dom.
DR   InterPro; IPR016201; PSI.
DR   InterPro; IPR000884; TSP1_rpt.
DR   InterPro; IPR036383; TSP1_rpt_sf.
DR   PANTHER; PTHR46706; PROTEIN QUA-1-RELATED; 1.
DR   PANTHER; PTHR46706:SF12; PROTEIN QUA-1-RELATED; 1.
DR   Pfam; PF01079; Hint; 1.
DR   Pfam; PF00066; Notch; 1.
DR   Pfam; PF19030; TSP1_ADAMTS; 2.
DR   SMART; SM00181; EGF; 2.
DR   SMART; SM00306; HintN; 1.
DR   SMART; SM00004; NL; 1.
DR   SMART; SM00423; PSI; 2.
DR   SMART; SM00209; TSP1; 3.
DR   SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR   SUPFAM; SSF51294; Hedgehog/intein (Hint) domain; 1.
DR   SUPFAM; SSF82895; TSP-1 type 1 repeat; 2.
DR   PROSITE; PS00022; EGF_1; 1.
DR   PROSITE; PS01186; EGF_2; 2.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS50817; INTEIN_N_TER; 1.
DR   PROSITE; PS50092; TSP1; 3.
PE   4: Predicted;
KW   Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Reference proteome {ECO:0000313|Proteomes:UP000054558};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..35
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           36..2778
FT                   /note="EGF-like domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012778991"
FT   DOMAIN          1459..1493
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          1614..1646
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   REGION          1533..1614
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1769..2072
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2213..2272
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2437..2463
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1534..1564
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1574..1610
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1769..1802
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1816..1845
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1852..1930
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1931..1954
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1955..1975
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1976..2010
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2011..2025
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2026..2040
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2217..2272
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        1483..1492
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        1618..1628
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        1636..1645
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   2778 AA;  291573 MW;  D9220E705C839CB0 CRC64;
     MNGHPKCLFP PKFLFRVSLL VLCLRLSASH WSALAFTEDQ GDWDDETLEH HSKSFDLDAH
     LAKNEHGHPG VVLEDAEEQN ETGGHHRYEA MHMVDDSGAL LHIKYNVRTV AHLVNLAQHA
     DVISSVRCLG GQLHLSVSNR AHLPGWRVGT TLVGGLEWNC TDQEGAPAPI YRKLIQTVDG
     VEEGTVILET QERELHHCFR GAKVSFRYTP SSASSPAKGL TRARRKLLGL VKVVGQVYDG
     VEHTITKVGD GAATAIDVMT QLLANGKYEW HGDSNWRIFA FNFDETEQRA LLKRDQLNPA
     SAASNDNAKL RASVECVNCF ATLEAGVSFD MVLKTQQAVV PVYLDQMKVV ISGDFQMNAD
     FNAQFDHIAG RNSWTKPIGP NKVLSTLTIM AGDVPIVVTP SIQLKAQANV AASVRGGASF
     GVDYRQRMQF GKEFRAAWGE FRPVQPDDFA PLNFHEEPKL SFQGEATVEG LLIPEITLKL
     YDALPIIIAP MPYLGADFRA NSDSNAADCP FSFRTYAGFN LSLGLNNLNQ IAFVSLLVSR
     SVPPFRTGNS FYFSLDQHLF ARKLEIPISG KKINVGGSIV PFRKQVKLIG KHFLPCNFCS
     GCVPLLGAEP VYKWVTGGWG TCTSSGLWKR DVTCQADGQP GFAMGDAHCS KAGAKPATTG
     TCTVPGCPST CRSSSLGDGT CEAACNLPEC NFDGGDCAST DPCKALKDCP VCLTGGASCG
     WCASSGTCLK SQSACSSALP SDWWTNRCAL EEQQIAFTRP TSLNTLRAGQ GFTVQWSGGP
     LGGNVVLRYR FDNSADVFSG FGIPLASIPN TGSFYWAIDG GLPTSRAFEL LLASDEDLGN
     FAFSDFFAVY GGLDMSGYVW TTGDFGPCSK ECGGGVRTRT TSCVNALNGT TVDASLCNPG
     TRPSVSASCN AQACVQCPNV PLCQSGSGYS CSACSCKSNG DGTFFCGMVA TSLRYGTQTT
     FGCDTRGNGY QECCRKQGLY CDDGCTNKAQ WVVISEMPCS ASCGGGTQWR KFACKGYVMW
     NGRRDDRTCE DFYCGPDPST SVTNTYYECN TQPCRTYAWQ VGEWQKCTQD CGGGSHVRDL
     ACKASDGSFA DVSLCDSDAR PPTQEACNVD PCLNVDPFIS QPASLDVWTA GQTYNVTWTG
     GIQYGRVKLR MSRSAASPST VQLASIDPLD DGPQISVPID ALPNSGSFTV QIPNKIRSGL
     LSLQLQSSDA NGTKTALTTG PVIIRGLANY TVLVTTVSRA GSSSTLPSAT SVTVVGTFGA
     SFPFLIAAVP SGQSYYDQRL QILDLGAIFQ CEVTGAPNWV GSVQVLAQGS ELAIAHFGFD
     APSLGQTILR SDCAQILDCH ACAETAGCGW CDTDSACRPG DDQGPYVGVC SQPTDAAMLP
     YAWATVAATC PDPCSLSNAA TCRDCALRAG CGCLLAQQCN ESSTCPAAPG PRARVPAAQI
     TLTSETSVLL DGSRSSKCAT CFGVDCGPNG ECITTVNAAA CRCKAGYSGA RCELPPSPCL
     NVSSPGQLSC APSGNSFVLL CANSYVGKDC TPATPAPTPG PSPGPSSSPQ PSPSSTPGPS
     PGASSNPQPS LSASPVPSTS PVPSPSPAPS RSPSPKASVS PSPSKSPSPS PSRSPFVCSY
     NCNNRGTCLA NNKCSCQFGW ESFNCNYNNL GWNCFCECPN SERACYNAPG CKYCIDNDYN
     GGCIPLREDR GQCTRARRKL LGAVGDPPRA SCYAWEQVSG PASVLSNATL PAARASGLVA
     GNYTFKLSVT DNFGRVAETA VQVSVLGNSA TASPSQSPVP SPSPVASSSA SPSSSPSPSR
     FLSPSSSPLP RPSNSPAASP SSSATPSSSS SPFASVPSNP FASPSSSPFA SPPRNPFASS
     SGSPVASPSI SPVISPSSSP VASPSSSPVA SPSPSTIASA SPDPIASPSS SPYMRPSDSP
     STSAGQSEGP SPSPSPSPVA SSPPGSPSPS ASSSPQPSPS SSAAPQTSPS TSPSPVESPS
     LSPVPSPTSS SAPSPSPSAS PSPVESPSLS PVPSPSSSSS VSSPSPSPSP STPSSPVESP
     SPSPRASVRP SRSPKASPSP SPSPSQSPSP SAAAAVAASI NVGVSLTSLE DANLIQNDDI
     ALSLAEANGL DPGSVVISGI TFAVNHSCVL GNVSPDDWTD WGGREAFIDG VSYALNVDPS
     SVSIDGFSES GRRRGLLQTP GLRVLFSILC TDGSQAKEVA QAVGDALLAA KSALPSSPPL
     GPSPSKRPQP PPPPPQTPPP EPPFFDPPLE SFANPPPAPP PPPPPAPPLP VKKGGCFPAD
     AFVTLADGGW KRMADVHLGD PVLVFDPSSG ELFVQPVYWF THHLNQTATY VSLEADAGVS
     ISGTPNHYIL VSDPRSTLSI QQATIKPFAD VKPGDVIWTS KGPGRQTSLA PSTVVSVTSS
     VLPGCTAPKL TLAAAGVITR AVLRGRSGLG RKRVKMALSK GTSKKHSRTG QQGAIRKEGK
     EDRSPGAYAL RTFCLDMQKS RRRRFGLPGV TSFQNYKDAD YIGNYEMALD GLQECTSDRV
     GYYQPSADAC ANLCLSFSSC IGFSFWTSNY WPDLCCSLQQ APSHGLMRSL GTNYYLRLAA
     VLVPEAVSPS SQHLRLPLSH NFFPVALSLE LAFSFLLTKS SKLSVPIADA FSFFDPKPIF
     LSISLSFAFD LRISVLIRQS LSIHVLISVT FDKPLPLPLP FALAFFLGKS VKVCISFLDS
     LPLTFSYDKP ISSSVDTSIA ISLSISLPFF VNEPLSIDDL VSVCFSQPLS FAEPLAVPFN
     IRITIALRFP TTGPKLGP
//
DBGET integrated database retrieval system