ID V6U1B0_GIAIN Unreviewed; 1044 AA.
AC V6U1B0;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Ankyrin repeat protein {ECO:0000313|EMBL:ESU44634.1};
GN ORFNames=GSB_150990 {ECO:0000313|EMBL:ESU44634.1};
OS Giardia intestinalis (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=5741 {ECO:0000313|EMBL:ESU44634.1, ECO:0000313|Proteomes:UP000018040};
RN [1] {ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=GS {ECO:0000313|Proteomes:UP000018040};
RA Adam R., Dahlstrom E., Martens C., Bruno D., Barbian K., Porcella S.F.,
RA Nash T.;
RT "Genome sequencing of Giardia lamblia Genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of Genotypes A1 and E (WB and
RT Pig).";
RL Submitted (FEB-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESU44634.1, ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GS {ECO:0000313|EMBL:ESU44634.1,
RC ECO:0000313|Proteomes:UP000018040};
RX PubMed=24307482; DOI=10.1093/gbe/evt197;
RA Adam R.D., Dahlstrom E.W., Martens C.A., Bruno D.P., Barbian K.D.,
RA Ricklefs S.M., Hernandez M.M., Narla N.P., Patel R.B., Porcella S.F.,
RA Nash T.E.;
RT "Genome sequencing of Giardia lamblia genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of genotypes A1 and E (WB and
RT Pig).";
RL Genome Biol. Evol. 5:2498-2511(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESU44634.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHHH01000019; ESU44634.1; -; Genomic_DNA.
DR AlphaFoldDB; V6U1B0; -.
DR EnsemblProtists; ESU44634; ESU44634; GSB_150990.
DR VEuPathDB; GiardiaDB:DHA2_151294; -.
DR VEuPathDB; GiardiaDB:GL50581_4368; -.
DR VEuPathDB; GiardiaDB:GL50803_0017117; -.
DR VEuPathDB; GiardiaDB:QR46_0274; -.
DR Proteomes; UP000018040; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 3.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24184:SF11; ANK_REP_REGION DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24184; SI:CH211-189E2.2; 1.
DR Pfam; PF12796; Ank_2; 4.
DR SMART; SM00248; ANK; 8.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
DR PROSITE; PS50297; ANK_REP_REGION; 1.
DR PROSITE; PS50088; ANK_REPEAT; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 985..1005
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 798..823
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 231..272
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 807..823
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1044 AA; 114576 MW; 6D0A4D5D15B59E30 CRC64;
MQISSVDDWF TAIERGDWAT VEPNVDVFHG SRNMMHETGL MCAARKDLVD VVKVLIEHEA
KLRDHNGYTA LMFACEHDAA GAAKLLLSAE KHIFLFDGRT ALHIACESNA NACVELLAKD
LGAVRDKHGR SSLFSAAEVG NTTAISMLLL VTSFSKKEIQ QAKSILTSLG IPSDELLFDD
HTALEPDGCV DISDPHQLQC NQLLSAQHIS EGVAPDEGSN MGDVVEIVDP VVQLKAELKR
KAAVIEQLND EVEAMRKQLD DARQLVQQKN AEERTQNIAG DVSNSMSLYK FLSLMPAVTR
ISPISELSNV LVAVPTSSLP QKLVTTDVMW TADVQHLLDA KARELSMLYD VLTILSGLFL
EVDENLAKDL LLLCHSSRTT STTHSLYDGN ARLDGVLNCL PDTERFAESL ASLPPPLLKG
IKEKTKAIGK ISSILMKRVQ SALWREQELA AQVEYLEEER SGLLDKLVEA RAVGSAGHFG
AKVELTADEA IQASGDSGLL DGVRALLIDH KVLLPCSQWD ELHLEHSTTT LKGTASTNTS
MIGVDQSMSG TTFLTCDPGA TSFTNSYTTA ARQTVQKSYA AYVRNLEMNI AKVTAQLAIA
HHKLQYNDST VDPNLLSVTF GRPGMSGQRQ KVATSDRATS PVAEFEYKGL MLENRRLNSI
IDEMKLTLNE TPLESNETET EYFERQLRRL SSLPQSQVTG KGILVGPQTI DYLRKKLLSD
MQPVQRTHTH GVPFIGSMNP DTARLHTSYT KLAQPVDAIL DDLLNSNRPL LEASTKAELA
RKDYLHLTAG ILPDTASTEH PSISFDDDHS PSTTMSGGME TSKRSATLTP LMVAIYSKSL
PDVERYLSYA GQARLDGTTA LMLAAELGFT EAVKVLKNRE SRFVRDDGKT ALKIAKDAGY
NEIVALLSQR SDDIHDSFTE ISDTCLDLHE AVRNFQIDET RRLAISLAKT RDHKGRTALM
VAASIGNSAA VETLVHLEGG LQDVHGTTAL MLAAEQGHLD CVRLLSPHEK EIHDNLGYDA
LFYMARSKVC IPPEILRKMQ EYLL
//