GenomeNet

Database: UniProt
Entry: A8BST4_GIAIC
LinkDB: A8BST4_GIAIC
Original site: A8BST4_GIAIC 
ID   A8BST4_GIAIC            Unreviewed;       677 AA.
AC   A8BST4;
DT   13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2007, sequence version 1.
DT   27-MAR-2024, entry version 77.
DE   SubName: Full=High cysteine membrane protein {ECO:0000313|EMBL:KAE8305536.1};
DE   SubName: Full=High cysteine protein {ECO:0000313|EMBL:EDO77394.1};
GN   ORFNames=GL50803_0094003 {ECO:0000313|EMBL:KAE8305536.1},
GN   GL50803_94003 {ECO:0000313|EMBL:EDO77394.1};
OS   Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC   Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX   NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO77394.1};
RN   [1] {ECO:0000313|EMBL:EDO77394.1, ECO:0000313|Proteomes:UP000001548}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC   and WB C6 {ECO:0000313|EMBL:EDO77394.1};
RX   PubMed=17901334; DOI=10.1126/science.1143837;
RA   Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA   Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA   Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA   Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA   Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA   Svard S.G., Sogin M.L.;
RT   "Genomic minimalism in the early diverging intestinal parasite Giardia
RT   lamblia.";
RL   Science 317:1921-1926(2007).
RN   [2] {ECO:0000313|EMBL:KAE8305536.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=WB C6 {ECO:0000313|EMBL:KAE8305536.1};
RA   Xu F., Jex A., Svard S.G.;
RT   "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL   Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EDO77394.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACB02000041; EDO77394.1; -; Genomic_DNA.
DR   EMBL; AACB03000001; KAE8305536.1; -; Genomic_DNA.
DR   RefSeq; XP_001705068.1; XM_001705016.1.
DR   AlphaFoldDB; A8BST4; -.
DR   STRING; 184922.A8BST4; -.
DR   EnsemblProtists; EDO77394; EDO77394; GL50803_94003.
DR   GeneID; 5697941; -.
DR   KEGG; gla:GL50803_0094003; -.
DR   VEuPathDB; GiardiaDB:GL50803_94003; -.
DR   HOGENOM; CLU_406255_0_0_1; -.
DR   InParanoid; A8BST4; -.
DR   OMA; NYWWTYP; -.
DR   Proteomes; UP000001548; Chromosome 5.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   Gene3D; 2.10.25.10; Laminin; 3.
DR   Gene3D; 2.90.20.10; Plasmodium vivax P25 domain; 1.
DR   Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 1.
DR   InterPro; IPR000742; EGF-like_dom.
DR   PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00008; EGF; 1.
DR   SMART; SM00181; EGF; 12.
DR   SUPFAM; SSF57196; EGF/Laminin; 2.
DR   PROSITE; PS00022; EGF_1; 6.
DR   PROSITE; PS01186; EGF_2; 2.
DR   PROSITE; PS50026; EGF_3; 7.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001548};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..677
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5036279119"
FT   TRANSMEM        614..638
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          15..49
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          67..109
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          169..206
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          207..248
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          262..299
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          338..378
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          469..509
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DISULFID        39..48
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        99..108
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        196..205
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        238..247
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        289..298
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        368..377
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        480..497
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   677 AA;  71718 MW;  D331A0EADB9242FF CRC64;
     MLPNLPAWVL IAVVLASGCT SYKDCSSIGR CNDNGKCECP FGVQGSSCEI NRCGVSLKEG
     TDPTLLDDPE ANIPYTYCSS NGLCRRSASD TGSEFSCTCL SSFSGAICDV HVGQEDHENC
     RTFNGSHETV CNERGICTTG GCRCLTGFSG PKCENVYCGA QADYNYIDEY DNPKWILCNA
     GGTCVETEEG EYQCQCRAHF TGVFCQEFNC VNDSSCLNGG TCFFTTSPYI SSTSHCKCLE
     GFSGLDCGLN SCGVEQDYQT GDIRLCSGSG VCIVGSKLEN EMPIYQCECR VGHYGSACET
     FNCDISPNAC NGGVCVIETT GDMVCKLCPK FFYGNDCAVN PCGIDNAGVA CSGYGTCVED
     GESAKCNCRP GRMGDVCSLR DCLTPGNECL NGGVCIDASN ALQLGLLKQA EYDSISLLTH
     RKGRDSRSAG STVCMCPKGY TGEFCTECDT MTELATQVFS NGTKASVCAH ESCFYNGLVC
     SNRGTCSYDE SSDSFGCVCE AGYLLLGAQC WHPSCPIEVR GERNLPCGIG GQCLETSLGS
     DEWQCSCANT YRLDSSTGLC WPSACFGASS KSEKVCGGGG VCDLSTYTGT HVCNCKDGFK
     KSSDGITCVP SSSVIALAVI VPILLVLTVG GLCIYFLACR GKRREPAKQA SSKGTSYIRM
     NDGDAIRTIM GGGHQTL
//
DBGET integrated database retrieval system