ID A0AAD3E0G4_9CHLO Unreviewed; 794 AA.
AC A0AAD3E0G4;
DT 29-MAY-2024, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2024, sequence version 1.
DT 28-JAN-2026, entry version 8.
DE RecName: Full=EGF-like domain-containing protein {ECO:0000259|PROSITE:PS50026};
GN ORFNames=Agub_g12134 {ECO:0000313|EMBL:GFR49988.1};
OS Astrephomene gubernaculifera.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Chlorophyceae;
OC CS clade; Chlamydomonadales; Astrephomenaceae; Astrephomene.
OX NCBI_TaxID=47775 {ECO:0000313|EMBL:GFR49988.1, ECO:0000313|Proteomes:UP001054857};
RN [1] {ECO:0000313|EMBL:GFR49988.1, ECO:0000313|Proteomes:UP001054857}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NIES-4017 {ECO:0000313|EMBL:GFR49988.1,
RC ECO:0000313|Proteomes:UP001054857};
RX PubMed=34811380; DOI=10.1038/s41598-021-01521-x;
RA Yamashita S., Yamamoto K., Matsuzaki R., Suzuki S., Yamaguchi H.,
RA Hirooka S., Minakuchi Y., Miyagishima S., Kawachi M., Toyoda A., Nozaki H.;
RT "Genome sequencing of the multicellular alga Astrephomene provides insights
RT into convergent evolution of germ-soma differentiation.";
RL Sci. Rep. 11:22231-22231(2021).
CC -!- SUBCELLULAR LOCATION: Golgi apparatus membrane
CC {ECO:0000256|ARBA:ARBA00004323}; Single-pass type II membrane protein
CC {ECO:0000256|ARBA:ARBA00004323}.
CC -!- SIMILARITY: Belongs to the glycosyltransferase 47 family.
CC {ECO:0000256|ARBA:ARBA00010271}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GFR49988.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BMAR01000034; GFR49988.1; -; Genomic_DNA.
DR AlphaFoldDB; A0AAD3E0G4; -.
DR Proteomes; UP001054857; Unassembled WGS sequence.
DR GO; GO:0000139; C:Golgi membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016757; F:glycosyltransferase activity; IEA:InterPro.
DR FunFam; 2.10.25.10:FF:000001; Tenascin C; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR000742; EGF.
DR InterPro; IPR004263; Exostosin.
DR InterPro; IPR040911; Exostosin_GT47.
DR PANTHER; PTHR11062; EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATED; 1.
DR PANTHER; PTHR11062:SF268; FAMILY PROTEIN, PUTATIVE, EXPRESSED-RELATED; 1.
DR Pfam; PF23106; EGF_Teneurin; 1.
DR Pfam; PF03016; Exostosin_GT47; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Golgi apparatus {ECO:0000256|ARBA:ARBA00023034};
KW Reference proteome {ECO:0000313|Proteomes:UP001054857};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..794
FT /note="EGF-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5042195186"
FT DOMAIN 117..150
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 121..131
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 140..149
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 794 AA; 89409 MW; 39330EB95115642D CRC64;
MVAERILFRL LPLVALLGLA LRLRSSVSPA VSHHGLQETS HFGQMVGVSG VGRSLLQDDE
NDKHSEDEVA LLVMKLNRSS LLPLPVLSEQ RCAPTKGVWC AQFHGQTPAP RYLPKYYNKP
CPNNCSGVGV CHAEYGMCFC PAGYGGPDCA APRKRPCAHM GTDKRDAGWH NLTTWSHTRC
AGVCDDDIAM CYCPPDTKYG RKEAPPGSPL GSPPLQAGRP LFMCTPGTDS EGRKVEWGGT
PFPDMFGPAG WCNADKPNFT CPCRLDGLHG PLCDVVSEQV CPNQCSGRGR CNQGFCACQQ
GWYGVDCALR REGVAAEEQG LELLTKQPWL PAVRTHIVAT EDPPLTPVRR RPYVYVYDMK
PEYGSDLLQY RIEGSHCVYR TFKTDNTPDW VGYNAYSTEP VLHELFLASE HRTLDPEEAD
FFFVPVNVGC MLDVYGWNEV PRWPKDVHGP RSHGLSLMQR EAQRWLNATF PWFARRGGRD
HIWFNPHDEG ACYVWKDVWP GVMLSHWGRT DFPHLSHTAY GQDNYSMSLQ HPEHPGDWLD
HTSRTHPCFD PKKDLVIPAF KPPRHYHKSP YLGAPPPAAG RDIFAFFMGD LRMQPGRDPD
CIYSRCIRQT LYNLSISNHW KEAHAIWYGE RKDVAGSGED YSELLARSTF CFVLPGDGWS
PRLEDAVLHG CIPVIIMDDV QVVFESIIDV EQFSVRILQR DIPNVVDILK RIPPTTVKTM
QEKLHKVWHR FRFLGLRMAR AEARKLLEAH QERAGGSAPR HPGAEYSIHA QDDAFDTLMQ
WLYSRIPSVH SGSG
//