ID A0A150GY18_GONPE Unreviewed; 1245 AA.
AC A0A150GY18;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=EGF-like domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=GPECTOR_4g751 {ECO:0000313|EMBL:KXZ54684.1};
OS Gonium pectorale (Green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Chlorophyceae;
OC CS clade; Chlamydomonadales; Volvocaceae; Gonium.
OX NCBI_TaxID=33097 {ECO:0000313|EMBL:KXZ54684.1, ECO:0000313|Proteomes:UP000075714};
RN [1] {ECO:0000313|Proteomes:UP000075714}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NIES-2863 {ECO:0000313|Proteomes:UP000075714};
RX PubMed=27102219; DOI=10.1038/ncomms11370;
RA Hanschen E.R., Marriage T.N., Ferris P.J., Hamaji T., Toyoda A.,
RA Fujiyama A., Neme R., Noguchi H., Minakuchi Y., Suzuki M.,
RA Kawai-Toyooka H., Smith D.R., Sparks H., Anderson J., Bakaric R., Luria V.,
RA Karger A., Kirschner M.W., Durand P.M., Michod R.E., Nozaki H., Olson B.J.;
RT "The Gonium pectorale genome demonstrates co-option of cell cycle
RT regulation during the evolution of multicellularity.";
RL Nat. Commun. 7:11370-11370(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KXZ54684.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSYV01000005; KXZ54684.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A150GY18; -.
DR OrthoDB; 324080at2759; -.
DR Proteomes; UP000075714; Unassembled WGS sequence.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR003609; Pan_app.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF14295; PAN_4; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000075714}.
FT DOMAIN 18..54
FT /note="Apple"
FT /evidence="ECO:0000259|Pfam:PF14295"
FT DOMAIN 96..130
FT /note="Apple"
FT /evidence="ECO:0000259|Pfam:PF14295"
FT DOMAIN 168..201
FT /note="Apple"
FT /evidence="ECO:0000259|Pfam:PF14295"
FT DOMAIN 631..761
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|Pfam:PF00754"
FT REGION 1028..1068
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1080..1104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1035..1068
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1245 AA; 130114 MW; BE403ABCE8E77127 CRC64;
MATLSNHSCP AAAGYTLYAG KDIARFDIAS VSSVADAATK CNANPSCNSF NWNLALGLGY
TKTVVYSAAQ AASYADICFY AKGCSDIAGF TLYTDIDIPG SDIVKVTTTA SAAAAQCLAN
SACKSVLWAP IFSPPYGFTK SIGYSANASK AYKVCCSDIA GFTLYTDIDI PGSDIVKVTT
TASAAAAQCL ANSACKSVLW APIFSPPYGF TKSIGYSANA SKAYKGLCFY EKTVANVTLV
SKSYLASYTN VAIELDASGK PTQAAVMAFV ESLKANITNT WGVNASNINI TSLIINGVDV
TSMVTTTTTG RRRSLFLGYA IPTRPNDYFI DFDELSVIGG TLNYNGFNFG PGFQLFGTQG
FSNTPASTSI IPDDSHDIDT QNPVVLGVPD DGCITVPAGF TGIDMWVNFA CDSGKCVAGA
TPACHVCDVK GDGSVNNVIR AYNVENCPLD AATGDARIVA DWTYSRTPAG IIGVSTAAAI
TAAAITAAAI TAAAITAATV NSPAKPAPAV STAVPTAAAK PAAAVSTAVP SPANPAPAGS
TARRDALLLM DWYCQCNKWS ISIGPKTCIA LDICRAVPCG GAPNKCIRDA TKSLGFYCQC
GDGFTEKVVN GITRCYGNVA RGAIAYGNSV LNNWYPNLAI DGKRNTFFGS NNNDGRYPGQ
LGLDLRVPHY IDGFKIVLRQ DMCAGHIGMM LQVNNEPSRS WVSYGGTVWT QVTTFLYDRP
NKDLGPADGK PATFTPVTGR WVNLTNQNPS WVNNVVFIAE LELYGYSMGD ANNLVSLPWC
SYKPCGGGTC NPSTSIGYWC KCWDYHYKIN QGTLLETCVK IPECNAPSVQ AECAAKGMVC
INLPGNYTCG NAPPASPPPP LPPGLAQVTA NYEAGFSNVA LTIDGQVSDA AVAAFVNDFK
TRVAATWKIP FNKVVIKSIT INGVTKSLQR RRAAAESADS AATSSDEIIE IQDSLRHPLR
ALQDMGLTHI VQVGQPQVTT SELLHEHHTM EAYLLGLHGV HHADLRNLAS THRVLAGGDA
ASMDFSVTKE LSKPPKAPKA PPPPKAPKAS PPPKAPLPPA APPPPGPILW AVSATASAGH
NDASASNAVG APKKALTKRT ECRPAKPTEA WIPAVSRGER FIEAGFDGKA PSGSQVASVA
LYLLNSGTLA NPVTAVTVNL QLAGQAQART FTVFTANTTA LNLACPGITY FPVAAASLRP
ALTARQYSGA TVVSARISFN ETAVGKPTML PQVIGVGLAA AAAAA
//