ID E1F9H9_GIAIA Unreviewed; 964 AA.
AC E1F9H9;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=High cysteine membrane EGF-like protein {ECO:0000313|EMBL:EFO60886.1};
GN ORFNames=GLP15_4020 {ECO:0000313|EMBL:EFO60886.1};
OS Giardia intestinalis (strain P15) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=658858 {ECO:0000313|EMBL:EFO60886.1, ECO:0000313|Proteomes:UP000008974};
RN [1] {ECO:0000313|EMBL:EFO60886.1, ECO:0000313|Proteomes:UP000008974}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=P15 {ECO:0000313|EMBL:EFO60886.1,
RC ECO:0000313|Proteomes:UP000008974};
RX PubMed=20929575; DOI=10.1186/1471-2164-11-543;
RA Jerlstrom-Hultqvist J., Franzen O., Ankarklev J., Xu F., Nohynkova E.,
RA Andersson J.O., Svard S.G., Andersson B.;
RT "Genome analysis and comparative genomics of a Giardia intestinalis
RT assemblage E isolate.";
RL BMC Genomics 11:543-543(2010).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFO60886.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACVC01000566; EFO60886.1; -; Genomic_DNA.
DR AlphaFoldDB; E1F9H9; -.
DR EnsemblProtists; EFO60886; EFO60886; GLP15_4020.
DR VEuPathDB; GiardiaDB:GLP15_4020; -.
DR OMA; HTCQCPA; -.
DR OrthoDB; 5471657at2759; -.
DR Proteomes; UP000008974; Unassembled WGS sequence.
DR Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR013111; EGF_extracell.
DR InterPro; IPR002049; LE_dom.
DR Pfam; PF07974; EGF_2; 1.
DR SMART; SM00181; EGF; 9.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS50026; EGF_3; 1.
DR PROSITE; PS01248; EGF_LAM_1; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076}.
FT DOMAIN 573..608
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 598..607
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 964 AA; 101900 MW; 30B3A3B6812CD433 CRC64;
MWYDYCGGIG DCIKKDGQTY ACDCGPSATW DENLKTCVTS ACKLDKRLAG PEAPEYCAAS
SDSGLRCTVG RDATWQCECT GNYSNYNKTC ILAYKNASPA TQLARGLCGG PGAGYLNDYG
SCVCNSGFLK IGDMCYSYDC LPVGVTAATD ATRLSPNPHV CSGKGVCAYN QLTGRYGCEC
NGGLEAFGGY CTRPECAGKV MHNGELKYVE CKVYDGSVGS CTQTSDKSAY ICTCASSYES
VNGICVHSRC MLDGKYCNGD VLASCVKGDD SLYGCVCSEG YDLSEERNEY GNKAKCVPSK
CMYRASANDP AIECNELGTC SDSGSGSLLK NKQCTCNGEA KPHTLRDANG ELRDTCILDI
CITSKDGEVP VICGGSGRCG PRGCVCNLGT QLFENSCVGI NCFINTTDSN GKVTESVCGG
ENIGVCTKIS SHGDRRDYAC RCKQKVSAYR EVDGFCLPPS CIFTIEAPNA QATDTMCGGS
HFGTCVINTN QPENSYCNCK DRIDVVKITT GQCMKRDCVS NALPGTTYQS IECYGHGKCK
TSNSIDYACE CDPDYKTVKG VIGTYLCIPQ VCVVSETDAA MVCSGRGTCL VDEKRCNCHA
GYAGNQCGEC APDYKKHDNV CYPNSCPQDD NCSADSSSAG SCQLVNNRFL CVCADSSFVV
DSTTKKCRKS RCVWTDPYDN MEKTCYGMGT CNDNGEDTGK CSCNSGTDLV GTNICVYSQC
ISDGGDPKAI CKGRGTCVEA TAGKGICRCD SSKYRTDKKT GQCFAKGCFG AHESILSEVC
DGGGTCSEDT KRCNCNVDGF QSLDGQNGCV HSNCISSDKK LCSGFGACEK TGSTYGCLCA
SYYTLVDKDC IPTNCLNKTT VCNGGGSCTG TGASASCSCN QGWAPLNSLC YPSACVSDGA
LYGGNGDCQL SDGGSCTCRS GYETVSGKLC ISSQFLVYST LSKCTSLKNI SAPQRSNEAN
PERC
//