ID A8BY44_GIAIC Unreviewed; 619 AA.
AC A8BY44;
DT 13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2007, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=Ankyrin repeat protein 1 {ECO:0000313|EMBL:KAE8301298.1};
DE SubName: Full=Protein 21.1 {ECO:0000313|EMBL:EDO76537.1};
GN ORFNames=GL50803_0014745 {ECO:0000313|EMBL:KAE8301298.1},
GN GL50803_14745 {ECO:0000313|EMBL:EDO76537.1};
OS Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO76537.1};
RN [1] {ECO:0000313|EMBL:EDO76537.1, ECO:0000313|Proteomes:UP000001548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC and WB C6 {ECO:0000313|EMBL:EDO76537.1};
RX PubMed=17901334; DOI=10.1126/science.1143837;
RA Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA Svard S.G., Sogin M.L.;
RT "Genomic minimalism in the early diverging intestinal parasite Giardia
RT lamblia.";
RL Science 317:1921-1926(2007).
RN [2] {ECO:0000313|EMBL:KAE8301298.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=WB C6 {ECO:0000313|EMBL:KAE8301298.1};
RA Xu F., Jex A., Svard S.G.;
RT "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO76537.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACB02000063; EDO76537.1; -; Genomic_DNA.
DR EMBL; AACB03000005; KAE8301298.1; -; Genomic_DNA.
DR RefSeq; XP_001704211.1; XM_001704159.1.
DR AlphaFoldDB; A8BY44; -.
DR SMR; A8BY44; -.
DR STRING; 184922.A8BY44; -.
DR EnsemblProtists; EDO76537; EDO76537; GL50803_14745.
DR GeneID; 5697081; -.
DR KEGG; gla:GL50803_0014745; -.
DR VEuPathDB; GiardiaDB:GL50803_14745; -.
DR HOGENOM; CLU_441762_0_0_1; -.
DR InParanoid; A8BY44; -.
DR OMA; KSIRMKM; -.
DR Proteomes; UP000001548; Chromosome 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24184:SF11; ANK_REP_REGION DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24184; SI:CH211-189E2.2; 1.
DR Pfam; PF12796; Ank_2; 2.
DR SMART; SM00248; ANK; 4.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001548};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 496..563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 312..393
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 535..554
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 619 AA; 67582 MW; 26DFC7A9567960E8 CRC64;
MIANVQDWFV AIASGNEDDV REFMSEYVGS RDETGDTALI IAARMGEAGI VRLLSTTQEV
GLINKEGCTA LIAAAMSNKP ETCEILVTLE KHIPLRDGRD ALMLAAFMAN YEAVSVLVRH
MALVEDENQM NALDYAVVGG SLNVVRAIVE AQDSIDDKLE YAIFLATEPI REDILEYLKQ
FKGASPSDHR QADASARSPV GTQSLSKLEE ELRQVLEERD TIADELTTLN LHLGALFTSL
GALRKKRLAI STTGSTFSST LTTGNNSTLL AAQPQITSEQ SQLQDITTLP EAINELELLL
TVPFGAGSAD IYEEIEDQIK QLSRAVAELQ DLNAAKDEEI AELKRALECD IDGNVTEAIA
QKDAQIEALQ AKVTDQEAKL DSYALELEAA KRSENSVPAL KAELKLKDKV ISGLLSTIQK
QEKSFSAIRS AFQQKEVETQ KYRTAIGCSE STVKNLITAN DYERARFSAR NSLVINSPVR
LSGAWGSMLS DGATPAMDRP AMAPASIRQA SAAGEPPEEA AKEPNPASIA SISRLCAIRS
PTSGQRSSTR VRSTRMTMSE EERAELRKSI RMKMTENKML KEALINASPS DTEARVLREE
IDRLKDLLRA SSPTGEKRD
//