ID V6U2N3_GIAIN Unreviewed; 1183 AA.
AC V6U2N3;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Ankyrin repeat protein {ECO:0000313|EMBL:ESU43515.1};
GN ORFNames=GSB_151929 {ECO:0000313|EMBL:ESU43515.1};
OS Giardia intestinalis (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=5741 {ECO:0000313|EMBL:ESU43515.1, ECO:0000313|Proteomes:UP000018040};
RN [1] {ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=GS {ECO:0000313|Proteomes:UP000018040};
RA Adam R., Dahlstrom E., Martens C., Bruno D., Barbian K., Porcella S.F.,
RA Nash T.;
RT "Genome sequencing of Giardia lamblia Genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of Genotypes A1 and E (WB and
RT Pig).";
RL Submitted (FEB-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESU43515.1, ECO:0000313|Proteomes:UP000018040}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GS {ECO:0000313|EMBL:ESU43515.1,
RC ECO:0000313|Proteomes:UP000018040};
RX PubMed=24307482; DOI=10.1093/gbe/evt197;
RA Adam R.D., Dahlstrom E.W., Martens C.A., Bruno D.P., Barbian K.D.,
RA Ricklefs S.M., Hernandez M.M., Narla N.P., Patel R.B., Porcella S.F.,
RA Nash T.E.;
RT "Genome sequencing of Giardia lamblia genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of genotypes A1 and E (WB and
RT Pig).";
RL Genome Biol. Evol. 5:2498-2511(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESU43515.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHHH01000046; ESU43515.1; -; Genomic_DNA.
DR AlphaFoldDB; V6U2N3; -.
DR EnsemblProtists; ESU43515; ESU43515; GSB_151929.
DR VEuPathDB; GiardiaDB:DHA2_151310; -.
DR VEuPathDB; GiardiaDB:GL50581_4396; -.
DR VEuPathDB; GiardiaDB:GL50803_0016736; -.
DR VEuPathDB; GiardiaDB:QR46_0303; -.
DR Proteomes; UP000018040; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR PANTHER; PTHR24120:SF4; ANKYRIN REPEAT AND SOCS BOX PROTEIN 12; 1.
DR PANTHER; PTHR24120; GH07239P; 1.
DR Pfam; PF12796; Ank_2; 2.
DR SMART; SM00248; ANK; 7.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils}.
FT REGION 256..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 935..962
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 369..396
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 862..889
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 256..304
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1183 AA; 130777 MW; B371474E5AFA8A3C CRC64;
MSGDDKLRKW FRAIQSGDLN EVERLLPECA GLCGDFGETG LQYAVQMKNP TLVQLLIDYE
VGIANNEGCT ALIMAALNNS ADLCSILAPL EADIILPDRR DAYMIAAQVG SYDALVALRS
YFSLVTDSHD LNALDYASMT GNLRCVDEIV THYKPSEQER VYSRALAENE GHHAVVQYFD
TGNLDHQACE SSRLTDGPAL TTEFTMQSSM LPAQGVQSPR IPLNNLTYSN VTVERQKSIS
DSAIDADFDE TIADLVDEVP KRDAPEQSPT RRKATKDKGS IKAPKAEKKQ KQKHIDDSKL
VHTSPLDIHK FSNGSPDGFS SLQSQIASKD SYPYELLRTN QRSKSGMCGS RVGRKGGLTQ
SSSAILPLLE RKDTEIRNLQ SQLEALTLRR STLMTDSFAM VDACTSCCAE GTSFCEIVFD
SLLNSKDEFI ERLLDLCIPL FPSSATKRID VCVGTDFAST LLFNEAPAQG AIQNNAACEK
VEEYSYESLL KAKDLEIDRL QKILFDIDIK LQPQSKREAK LCRELEAKIL KEVELIGIID
EMTQEIKGLK DKVVLHKKVL ADIQQQGFIS DRRTERDIRL EDLFDQTLRS CDVVREIKQQ
MIAFKAEFSG YIAQTELLAR TLFNEGGAND LNKSLKRSAS CSKINPLNGT GRIHVPNPPN
PMSVTLHPPT RSSSFATLQP LKPPLKPTDY SGLLQNKDSC IDRLVALLYV SPNTLYQQPA
LFSHPNTIEE KEVARERHEE ILAQKDQIIN DLRRELETLF KSISAETVNY KKESLTHVTK
PPDMVIGDLL SGSDGEEDST EDQKYYMEAL AKLELEHAAM QMEIDTMAKT IAERDNAIMF
FADHYTKKTN KSAIATETAK VITALELEIT ALKSQLQATK QEITLLRNEA KAAPRSRSPT
VIGIASARSS EDAPAALGQT ADPLCPIIIP SAPPRRSSLA QQLPQSDESS KDIRASSPSR
TASLLRQTIT RAKSRTASRT MQLKSQYAER TGLTKLMQAV IDGDARAIGM HLSLLCLVTD
DGTSALMLAA IHNRLFAIEY LIHGEAGLVD NNGKTALIHA LEKGHIRIAE ILTPYECPDV
TNVDITHTGS RTTELMTAVL QGDLARAWAL LPMQHGIRDK NGKTALILAI ELRRPAFVRI
LLPLEHTLCL EDGSSPLDAI MSLKGSDTSI CEIQRIAAEY FGF
//