ID A8BPT8_GIAIC Unreviewed; 843 AA.
AC A8BPT8;
DT 13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2007, sequence version 1.
DT 24-JAN-2024, entry version 67.
DE SubName: Full=Ankyrin repeat protein 1 {ECO:0000313|EMBL:KAE8304351.1};
DE SubName: Full=Protein 21.1 {ECO:0000313|EMBL:EDO77997.1};
GN ORFNames=GL50803_0088369 {ECO:0000313|EMBL:KAE8304351.1},
GN GL50803_88369 {ECO:0000313|EMBL:EDO77997.1};
OS Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO77997.1};
RN [1] {ECO:0000313|EMBL:EDO77997.1, ECO:0000313|Proteomes:UP000001548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC and WB C6 {ECO:0000313|EMBL:EDO77997.1};
RX PubMed=17901334; DOI=10.1126/science.1143837;
RA Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA Svard S.G., Sogin M.L.;
RT "Genomic minimalism in the early diverging intestinal parasite Giardia
RT lamblia.";
RL Science 317:1921-1926(2007).
RN [2] {ECO:0000313|EMBL:KAE8304351.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=WB C6 {ECO:0000313|EMBL:KAE8304351.1};
RA Xu F., Jex A., Svard S.G.;
RT "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO77997.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACB02000031; EDO77997.1; -; Genomic_DNA.
DR EMBL; AACB03000002; KAE8304351.1; -; Genomic_DNA.
DR RefSeq; XP_001705671.1; XM_001705619.1.
DR AlphaFoldDB; A8BPT8; -.
DR STRING; 184922.A8BPT8; -.
DR EnsemblProtists; EDO77997; EDO77997; GL50803_88369.
DR GeneID; 5698556; -.
DR KEGG; gla:GL50803_0088369; -.
DR VEuPathDB; GiardiaDB:GL50803_88369; -.
DR HOGENOM; CLU_337855_0_0_1; -.
DR InParanoid; A8BPT8; -.
DR OMA; TLACTEP; -.
DR Proteomes; UP000001548; Chromosome 4.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR InterPro; IPR047162; ANKS3.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24184:SF11; ANK_REP_REGION DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24184; SI:CH211-189E2.2; 1.
DR Pfam; PF12796; Ank_2; 2.
DR SMART; SM00248; ANK; 7.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
DR PROSITE; PS50297; ANK_REP_REGION; 1.
DR PROSITE; PS50088; ANK_REPEAT; 2.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Reference proteome {ECO:0000313|Proteomes:UP000001548}.
FT REPEAT 37..69
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 68..100
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
SQ SEQUENCE 843 AA; 92039 MW; 9090B3BD11247B01 CRC64;
MESAGLLIDT WFKCAFVGDK DRMSSLVDKC AGRRNDSGET ALMVATRGNQ ADVVSFLLQR
EATLLNNDGE CAMHMAVKHG YLEIVQLLMP FEGHIYTREG NTSLMLAINA ANTTLCSYLL
RYPTQWYDKR GFSVLAHAVS LGNHALAAMI GHEIPIPKDV LTACTEIALR RGDTVMCSTL
KEIAAYQAEH VDSFFQPGNS SMSVSNSVVH TPSSNVKVIP LSPADVVAPR TRSTTGPCID
ATSIVKDEVD ASVLSANYVS KKPEYKKNEI RPMNESRHLG HPTFPPTLAC TEPKKISIDE
YLEYSQSISR SKAVVDRSEY STSSAQDFDH LTISCKASTA HEQEIQRAVD VIRDAHDLVV
DALTKVRESP DNHSRAANPL DEELGLDADQ PTIFAATVKK LSAANTILEQ STNILQQSLQ
LDRESVALSQ AIETYKDKTV DEMNSLICSA VEFTEEKVKV AEEHLGRSSS VVADRELLDS
RSVDVEGIET RLVEQAKDQQ YAVPYMPALV SSDTQLVASS EPLVHRDGAD KSLILANSNT
NTYITPLPTT TTKKAEDLRE VYDSNIAAND AGDVPPHADP HGILSTDPIF ESIQRRSAEE
YCPPNVPTTE RESRGGHTDF STQSFNGMGK ELTSMILSGQ FSHGTVEKHV LQAGIQRPSV
NGKILESSYK RVIPEYTGNY SFTSQMVNCA TGRFTELMDA VVRNDIICVK AMLPYQGCLQ
DENGTTALML AAERGLDALV CVLVGTEAGM TDIYGETALM RAAKNNRPST CELLLGEAGL
RTSEGHPNKE GLTALCFACM YGSIDCVRVL LGKEAKLGYR PTISPDALSS SAEIRHLVQL
YSE
//