ID T1FPJ4_HELRO Unreviewed; 1736 AA.
AC T1FPJ4;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=K Homology domain-containing protein {ECO:0000259|SMART:SM00322};
GN Name=20210741 {ECO:0000313|EnsemblMetazoa:HelroP188005};
GN ORFNames=HELRODRAFT_188005 {ECO:0000313|EMBL:ESO12839.1};
OS Helobdella robusta (Californian leech).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Clitellata;
OC Hirudinea; Rhynchobdellida; Glossiphoniidae; Helobdella.
OX NCBI_TaxID=6412 {ECO:0000313|EnsemblMetazoa:HelroP188005, ECO:0000313|Proteomes:UP000015101};
RN [1] {ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESO12839.1, ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:HelroP188005}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQM01000249; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB095811; ESO12839.1; -; Genomic_DNA.
DR RefSeq; XP_009009559.1; XM_009011311.1.
DR STRING; 6412.T1FPJ4; -.
DR EnsemblMetazoa; HelroT188005; HelroP188005; HelroG188005.
DR GeneID; 20210741; -.
DR KEGG; hro:HELRODRAFT_188005; -.
DR CTD; 20210741; -.
DR eggNOG; KOG4369; Eukaryota.
DR HOGENOM; CLU_239829_0_0_1; -.
DR InParanoid; T1FPJ4; -.
DR OrthoDB; 5480610at2759; -.
DR Proteomes; UP000015101; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0045087; P:innate immune response; IBA:GO_Central.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR Gene3D; 3.30.1370.10; K Homology domain, type 1; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR PANTHER; PTHR23206:SF7; ANKYRIN REPEAT AND KH DOMAIN-CONTAINING PROTEIN 1 ISOFORM X1; 1.
DR PANTHER; PTHR23206; MASK PROTEIN; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF00013; KH_1; 1.
DR SMART; SM00248; ANK; 3.
DR SMART; SM00322; KH; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 1.
DR PROSITE; PS50297; ANK_REP_REGION; 2.
DR PROSITE; PS50088; ANK_REPEAT; 2.
DR PROSITE; PS50084; KH_TYPE_1; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000015101};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117}.
FT REPEAT 305..337
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 338..370
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 786..855
FT /note="K Homology"
FT /evidence="ECO:0000259|SMART:SM00322"
FT REGION 33..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 459..598
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1195..1215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1663..1689
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 39..55
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 482..518
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 522..541
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 546..563
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..593
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1670..1689
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1736 AA; 188760 MW; 7C2395558547F323 CRC64;
MFGDLHWKVK VYKDLKFTKF NGKIVKLDQN ITNYTSSSDN SDDDDDEEED EDYEETECSV
KHGMKALYVK QYRNLQPCSL VSAGHPNQKW KHEHKSGGML ALKKQRLKHQ KKPFSSSHQE
LRFNDGHHLK HFITSSLSSA NIPKTKADKK HSRSNTIATS IISASSSLSS SSVPLCGHNH
LDHDEETCGM CGCLKAMKTM EEVDRSSQLE CQTCPPISTA GRQHPGDDKC CRLHHHQTVC
SLHRSIKTMN NNSLQQFSQH SLQQHHQLTS DSSNGCNLAN CVDQCGSNSD CEPIKSQLEA
QTVGNLETAL TLSCSRGHVE LVCLLLSKGS NIEHRDKKGF TPLLHAANFG HSKIVEILVG
NGADVEAETE RTKDTALSLA CASSEKVVKY LVSQVSQLPS DAECGKLLKS FSEIDKPELK
NSCMKCLEII KKAKEAQAMK AYENATILLN ELQQEKKLEA KKRAAREKKR EKRKQKKKDR
FASGSTTPNS SNLNNNGNTA RDSSVDVKTA NSSASPGVSD EDKPEPSSRL TASDEDVAGK
ENNPKCLIAS KSSATTNSSV DTNTIDNKKR KSNRIDRGHQ SSDVKESTPV PNHKDQVKVR
IASNTATSGL PLQESKVQLN QCPEKSRKDS IPLLKTKLSG IGDLDDFALL PSTLHTNIVA
APVLLASLGT QKMKNDSKSA KLPLDNLKET SYTHKSGVLA TTKVTPAGIS TTTLVATAQK
PLLSTIPSQP APVHVTYKPA LALNNVSTNA KSFVKKKITP GSSIVPSSNS VLLKKDDSFL
LNKDPQRKTR KVDVPARAVS RVIGRAGCNI NAIREFSGAK IDLDKLKTCD DAVVTIRGTN
ESVNKAGELI LALVRESDKD IDQLITAFKQ QQLSGGVSLL GSSSSSNSVE VSTTNVWKTG
NSLLMKNSIT IPSCVSIAST VRESVSLIPP NNVWESRKMQ SSLNNEASKT TNNAWLKSTN
NKETNSNYGG SRPTVISTLN MSGSNVWDNS MADGIGKAPG SGLSVPEDKP EPITTFPIGA
WKSVSKSSDI RSTNSFAHLN DSVPNVQSPS NHPSNAKEDF SWVKAQDAPT TTFSETIIRT
TAAVSMTQST LMPAVSVANT SSSSTSDSTS LSSTSSSVQQ FLSYQANPQS PLLHSYSGKL
PSSIAPISAV QEPAAEYSPF GDYGSSFLNN SVVTTSSSSI VSSNLIASHS DVTIDKSKAP
GYRPPSFQNR SQSPTREQLP VFQNNVNVPG FVSACNSLNS DKYSDSNRYV PNDNESVPSS
SAYHYLLPSE QQHSIKTTNV ESATISVAAA IDNNSSYQSI ITPMVNSTLN PNAPKFNSKP
TSISPQMFYK SNLPSMYSSG MKYNLQPHLD PSNVPKMHQQ QQQNIPINSS NDLQMFPNVN
PGYSNIDGSN LSYQGSHQTS MPVEYSQRRF SFSEGIAASN SNKSVVKSNE VNTTSSIYQS
PSIHQHSFTP YTTTTLSYQQ QRQQSYSVDH YMRRFPGQLQ PIGTERNSKK QLVFSAISTS
ASTLSSSSAS PLSLASVSIS SPLSSLVTAV QPSVTTIIPL VPKSNLMVAG PVGPTVADHL
ANNMVAPPWN YHPGMSRSST NTAQQLNNNT AMLNIPSSTY LYNNVYNNNV GSSAAAAAAA
AAASTRPAQV FNHHGPAYSP NLNNLPSNQY VIPNGSTLPS AYHGRLTPQL HPQQHYQQQQ
QLQQPQQQQN SFFYHHSSSY PRFPHEMEGW QMGVHENHGD DLQAWNLWNS VDNSRK
//