ID A8BST4_GIAIC Unreviewed; 677 AA.
AC A8BST4;
DT 13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2007, sequence version 1.
DT 27-MAR-2024, entry version 77.
DE SubName: Full=High cysteine membrane protein {ECO:0000313|EMBL:KAE8305536.1};
DE SubName: Full=High cysteine protein {ECO:0000313|EMBL:EDO77394.1};
GN ORFNames=GL50803_0094003 {ECO:0000313|EMBL:KAE8305536.1},
GN GL50803_94003 {ECO:0000313|EMBL:EDO77394.1};
OS Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO77394.1};
RN [1] {ECO:0000313|EMBL:EDO77394.1, ECO:0000313|Proteomes:UP000001548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC and WB C6 {ECO:0000313|EMBL:EDO77394.1};
RX PubMed=17901334; DOI=10.1126/science.1143837;
RA Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA Svard S.G., Sogin M.L.;
RT "Genomic minimalism in the early diverging intestinal parasite Giardia
RT lamblia.";
RL Science 317:1921-1926(2007).
RN [2] {ECO:0000313|EMBL:KAE8305536.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=WB C6 {ECO:0000313|EMBL:KAE8305536.1};
RA Xu F., Jex A., Svard S.G.;
RT "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO77394.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACB02000041; EDO77394.1; -; Genomic_DNA.
DR EMBL; AACB03000001; KAE8305536.1; -; Genomic_DNA.
DR RefSeq; XP_001705068.1; XM_001705016.1.
DR AlphaFoldDB; A8BST4; -.
DR STRING; 184922.A8BST4; -.
DR EnsemblProtists; EDO77394; EDO77394; GL50803_94003.
DR GeneID; 5697941; -.
DR KEGG; gla:GL50803_0094003; -.
DR VEuPathDB; GiardiaDB:GL50803_94003; -.
DR HOGENOM; CLU_406255_0_0_1; -.
DR InParanoid; A8BST4; -.
DR OMA; NYWWTYP; -.
DR Proteomes; UP000001548; Chromosome 5.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 1.
DR InterPro; IPR000742; EGF-like_dom.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00008; EGF; 1.
DR SMART; SM00181; EGF; 12.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR PROSITE; PS00022; EGF_1; 6.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 7.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000001548};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..677
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5036279119"
FT TRANSMEM 614..638
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 15..49
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 67..109
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 169..206
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 207..248
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 262..299
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 338..378
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 469..509
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 39..48
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 99..108
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 196..205
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 238..247
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 289..298
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 368..377
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 480..497
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 677 AA; 71718 MW; D331A0EADB9242FF CRC64;
MLPNLPAWVL IAVVLASGCT SYKDCSSIGR CNDNGKCECP FGVQGSSCEI NRCGVSLKEG
TDPTLLDDPE ANIPYTYCSS NGLCRRSASD TGSEFSCTCL SSFSGAICDV HVGQEDHENC
RTFNGSHETV CNERGICTTG GCRCLTGFSG PKCENVYCGA QADYNYIDEY DNPKWILCNA
GGTCVETEEG EYQCQCRAHF TGVFCQEFNC VNDSSCLNGG TCFFTTSPYI SSTSHCKCLE
GFSGLDCGLN SCGVEQDYQT GDIRLCSGSG VCIVGSKLEN EMPIYQCECR VGHYGSACET
FNCDISPNAC NGGVCVIETT GDMVCKLCPK FFYGNDCAVN PCGIDNAGVA CSGYGTCVED
GESAKCNCRP GRMGDVCSLR DCLTPGNECL NGGVCIDASN ALQLGLLKQA EYDSISLLTH
RKGRDSRSAG STVCMCPKGY TGEFCTECDT MTELATQVFS NGTKASVCAH ESCFYNGLVC
SNRGTCSYDE SSDSFGCVCE AGYLLLGAQC WHPSCPIEVR GERNLPCGIG GQCLETSLGS
DEWQCSCANT YRLDSSTGLC WPSACFGASS KSEKVCGGGG VCDLSTYTGT HVCNCKDGFK
KSSDGITCVP SSSVIALAVI VPILLVLTVG GLCIYFLACR GKRREPAKQA SSKGTSYIRM
NDGDAIRTIM GGGHQTL
//