ID A8BHF4_GIAIC Unreviewed; 870 AA.
AC A8BHF4;
DT 13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2007, sequence version 1.
DT 13-SEP-2023, entry version 54.
DE SubName: Full=KH domain-containing protein {ECO:0000313|EMBL:KAE8301651.1};
GN ORFNames=GL50803_009485 {ECO:0000313|EMBL:KAE8301651.1}, GL50803_9485
GN {ECO:0000313|EMBL:EDO79421.1};
OS Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO79421.1};
RN [1] {ECO:0000313|EMBL:EDO79421.1, ECO:0000313|Proteomes:UP000001548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC and WB C6 {ECO:0000313|EMBL:EDO79421.1};
RX PubMed=17901334; DOI=10.1126/science.1143837;
RA Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA Svard S.G., Sogin M.L.;
RT "Genomic minimalism in the early diverging intestinal parasite Giardia
RT lamblia.";
RL Science 317:1921-1926(2007).
RN [2] {ECO:0000313|EMBL:KAE8301651.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=WB C6 {ECO:0000313|EMBL:KAE8301651.1};
RA Xu F., Jex A., Svard S.G.;
RT "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO79421.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACB02000017; EDO79421.1; -; Genomic_DNA.
DR EMBL; AACB03000005; KAE8301651.1; -; Genomic_DNA.
DR RefSeq; XP_001707095.1; XM_001707043.1.
DR AlphaFoldDB; A8BHF4; -.
DR EnsemblProtists; EDO79421; EDO79421; GL50803_9485.
DR GeneID; 5699990; -.
DR KEGG; gla:GL50803_009485; -.
DR VEuPathDB; GiardiaDB:GL50803_9485; -.
DR HOGENOM; CLU_329953_0_0_1; -.
DR InParanoid; A8BHF4; -.
DR OMA; FVPVMET; -.
DR Proteomes; UP000001548; Chromosome 1.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0010468; P:regulation of gene expression; IBA:GO_Central.
DR CDD; cd00105; KH-I; 1.
DR Gene3D; 3.30.1370.10; K Homology domain, type 1; 1.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR Pfam; PF00013; KH_1; 1.
DR SMART; SM00322; KH; 2.
DR SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 2.
DR PROSITE; PS50084; KH_TYPE_1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001548};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117}.
FT DOMAIN 20..90
FT /note="K Homology"
FT /evidence="ECO:0000259|SMART:SM00322"
FT DOMAIN 178..256
FT /note="K Homology"
FT /evidence="ECO:0000259|SMART:SM00322"
SQ SEQUENCE 870 AA; 95820 MW; CBF0A611A73E0C11 CRC64;
MPGDPSKTYV QRWIQDALPP LQTEKIAIEV DNLGYMVGTK GSRIGLIRQN TGAAIHYMED
PPGSKRFYAV IKGTTKQLAD AKAAIYADVL DYHRWVKRGK GSLSDALGNR MYPYSLVLRL
PPGGSRFYSG PGYQSFLELS IRKDIFINVS QQDELTVRAA STEALMEAVQ KVTDILNTFV
SLAAPIPNWT VGRVIGRNGE ALAKVSDIIQ IRTNSRLICF ISNTHKAEGL VSYFVVLADK
RDAVTLAMQM VFSRISSITI EAGLFVPVME TENVIMLPID STSLCSYGLK APLAKLQSSS
GSHSRKYSSI SMDGSFSLPP EMLSLLPTSS PPTHARHFST VEGDVVPCHA VQTTTAHRRL
NSSIPYSVSN STQSDGQQSF LFQLSFPNSS VSNTTDQALK CNFCCAKRRS LLPCSDFLRT
PYTVESLRES TVAVDRRYLV MHVPEHCDNM QEFWQSSLLA ALATVGREFF FIEDPTTAVD
FVKSAFDFTV SCSPGRCYYA SRTDELFDTI YDISEGLSSR RVRLCFDSGM DDCDYTKLFA
SSKTHYAWYT KQLMDVSISD ALPNGLAQHL SAISGVGGMD KHYSMWHSLN FDCYVQPPGR
KSALEHDFSM SLSDEDIPQL QDAADEPAVS VSGLGISYIL QSKTGYDILV KITLVPKSLT
PLVSATDETD VDALCRGDAR LILLRKYIKN LKTISSNTAY TSPITTTKCY EVQSEPSQFG
EALPVKTHVI TLNLPDVPLD VATHIDPTPL SLEDLATGVG YSTAKGLLPP SQDSKFPMPS
TVLIQDHPNA TLNRVVFAGL GDIVATSSYL QELPHVAKPK LVVDFRPFAG YRWRRKGLIN
KVVSFTGLVS ILPSIVSRVK EVLKTLPDQE
//