ID D2VYI4_NAEGR Unreviewed; 500 AA.
AC D2VYI4;
DT 02-MAR-2010, integrated into UniProtKB/TrEMBL.
DT 02-MAR-2010, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EFC38116.1};
GN ORFNames=NAEGRDRAFT_74132 {ECO:0000313|EMBL:EFC38116.1};
OS Naegleria gruberi (Amoeba).
OC Eukaryota; Discoba; Heterolobosea; Tetramitia; Eutetramitia;
OC Vahlkampfiidae; Naegleria.
OX NCBI_TaxID=5762 {ECO:0000313|Proteomes:UP000006671};
RN [1] {ECO:0000313|EMBL:EFC38116.1, ECO:0000313|Proteomes:UP000006671}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NEG-M {ECO:0000313|EMBL:EFC38116.1,
RC ECO:0000313|Proteomes:UP000006671};
RX PubMed=20211133; DOI=10.1016/j.cell.2010.01.032;
RA Fritz-Laylin L.K., Prochnik S.E., Ginger M.L., Dacks J.B., Carpenter M.L.,
RA Field M.C., Kuo A., Paredez A., Chapman J., Pham J., Shu S., Neupane R.,
RA Cipriano M., Mancuso J., Tu H., Salamov A., Lindquist E., Shapiro H.,
RA Lucas S., Grigoriev I.V., Cande W.Z., Fulton C., Rokhsar D.S., Dawson S.C.;
RT "The genome of Naegleria gruberi illuminates early eukaryotic
RT versatility.";
RL Cell 140:631-642(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG738911; EFC38116.1; -; Genomic_DNA.
DR RefSeq; XP_002670860.1; XM_002670814.1.
DR AlphaFoldDB; D2VYI4; -.
DR STRING; 5762.D2VYI4; -.
DR EnsemblProtists; EFC38116; EFC38116; NAEGRDRAFT_74132.
DR GeneID; 8858011; -.
DR VEuPathDB; AmoebaDB:NAEGRDRAFT_74132; -.
DR eggNOG; KOG1225; Eukaryota.
DR InParanoid; D2VYI4; -.
DR OrthoDB; 5475408at2759; -.
DR Proteomes; UP000006671; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.90.10.10; Bulb-type lectin domain; 2.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR001480; Bulb-type_lectin_dom.
DR InterPro; IPR036426; Bulb-type_lectin_dom_sf.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR013111; EGF_extracell.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF07974; EGF_2; 3.
DR SMART; SM00108; B_lectin; 1.
DR SMART; SM00181; EGF; 6.
DR SUPFAM; SSF51110; alpha-D-mannose-specific plant lectins; 2.
DR PROSITE; PS50927; BULB_LECTIN; 1.
DR PROSITE; PS00022; EGF_1; 4.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000006671};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..500
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003038469"
FT DOMAIN 72..185
FT /note="Bulb-type lectin"
FT /evidence="ECO:0000259|PROSITE:PS50927"
FT DOMAIN 201..238
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 364..396
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 228..237
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 386..395
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 500 AA; 53837 MW; 330E8B9B8C093920 CRC64;
MQHQAPSTLF LYFVFAFVVQ TCFHNHQQQL SVLAVNTCYG YSATDLNSCG GHGFCAGSNN
CYCVDSTSGN TLYSSGGTNN LYQSNSIYSS NGIYQLKQQV DGNLVLYKPG LVAYWASDTN
GQGTAPYNLV LSLDGTLEIR DQGGVLKTIS SACTSGTAPF RLVMQSDGNL VLYGAQKEVC
WTANQYFGNS LTYTWSGSQC NEYLCYGLYG SSACSGHGSC NARDSCSCYS GYSGSTCSNY
YCNSILYSQV GTVCNGRGTC TGPNTCSCQT GYSGTFCELY YCNGLAPSDP NVCSGFGTCS
SPNTCSNCQA GRYGSECQYY DCGGIRFDQP NVCSRVGSCD SKNNCTCPEL YFGDNCENYM
CNGIINNSSM VCSGHGSCSS PENCTCQEGY YGEDCELFEC SGILKNETNV CTGFGNCTAF
DTCQCDEQHT GKFCEINICN GVLSNNPNIC SSHGQCHSFN NCTCDENRIV KFMIVMVLLG
IPQLSVQDED HVQPIIIVVV
//