ID A0A0D2WKR3_CAPO3 Unreviewed; 840 AA.
AC A0A0D2WKR3;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=PUM-HD domain-containing protein {ECO:0000259|PROSITE:PS50303};
GN ORFNames=CAOG_002161 {ECO:0000313|EMBL:KJE90935.1};
OS Capsaspora owczarzaki (strain ATCC 30864).
OC Eukaryota; Filasterea; Capsaspora.
OX NCBI_TaxID=595528 {ECO:0000313|EMBL:KJE90935.1, ECO:0000313|Proteomes:UP000008743};
RN [1] {ECO:0000313|Proteomes:UP000008743}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=ATCC 30864 {ECO:0000313|Proteomes:UP000008743};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Capsaspora owczarzaki ATCC 30864.";
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE346362; KJE90935.1; -; Genomic_DNA.
DR RefSeq; XP_004348911.2; XM_004348861.2.
DR AlphaFoldDB; A0A0D2WKR3; -.
DR STRING; 595528.A0A0D2WKR3; -.
DR EnsemblProtists; KJE90935; KJE90935; CAOG_002161.
DR GeneID; 14900220; -.
DR eggNOG; KOG2050; Eukaryota.
DR InParanoid; A0A0D2WKR3; -.
DR OrthoDB; 5488678at2759; -.
DR PhylomeDB; A0A0D2WKR3; -.
DR Proteomes; UP000008743; Unassembled WGS sequence.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR012959; CPL_dom.
DR InterPro; IPR033133; PUM-HD.
DR InterPro; IPR040059; PUM3.
DR InterPro; IPR001313; Pumilio_RNA-bd_rpt.
DR PANTHER; PTHR13389:SF0; PUMILIO HOMOLOG 3; 1.
DR PANTHER; PTHR13389; UNCHARACTERIZED; 1.
DR Pfam; PF08144; CPL; 1.
DR Pfam; PF00806; PUF; 2.
DR SMART; SM00025; Pumilio; 5.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS50302; PUM; 2.
DR PROSITE; PS50303; PUM_HD; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008743};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884}.
FT DOMAIN 193..655
FT /note="PUM-HD"
FT /evidence="ECO:0000259|PROSITE:PS50303"
FT REPEAT 257..292
FT /note="Pumilio"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00317"
FT REPEAT 479..515
FT /note="Pumilio"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00317"
FT REGION 1..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 694..840
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..28
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..44
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..82
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..100
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..141
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 840 AA; 89842 MW; 63AD8787C00D014A CRC64;
MSKSTFKRSN DDAAQELAAA RKKHLNEHQS ASTSAGSGSH QSKHAAGGQR KPAAGTKPPF
NKSSASSSDK KFDSDKKFAH KLGGANKSNN NNNYNGKSNG SSFEDKKKPG TEYGAGKKLH
DQSKGKGKAP STAHDDDTEN GSESADAAAD GEQAKPGKLF NANHRFASNH KEAKLLRIER
KAHTAPHFEL IQRAKRIWET LRQKRMPSAE RQALADELMG IVTGHMNDII FKHDAVRVIQ
CCIKFGNEAQ RDMVFQELKS HLLDILRSKY GKFVVSKMLK YGSAEHRNHI INVMTKMTRT
LIRHRDAADL VDTAYSLYAN AAQRAALVAD FYGPQFLLFK TSPQTTLKEV IDSHPELESV
MLTHLKQTLT GCLDKGTIGF SLVHRALLDL YLHGQAPSLK DMSSSLNEAV IDIVHTREGA
HVSVLVIRDA NAKDRKTIIK SMKSLVKKIG LDEHGFAVLL ALFDMVDDTV LVSKAVLSEI
QESLPEFVES KYGFRVLQYL LAPRSKRYIP EHVLAVLAEG DGNPNSKKDT DVRREELRRA
ILPSLVAYAT EHVEALSRDP HKLPLIQELV SSSSATELAP FYAALVAECL KDGEQSLIQH
ANGHRMVRQL ILSERVSDEA PSFASQLFSQ LSGQQDALLE SNHGLFVASA FVRSPDAAVA
ASAKTAFNKH MKALKAKSTL AGAKVLIESL EGKAAPASAE PTRAAAPATP AAAAAKQPAK
KAAESTPVAA SKPTAAAVST PAAAAVATPK GKGKTPAKAA AAAAPAPMEI DDEPVATPAP
ATARKAAAAA AATPARAAPA AAMPSTPTNG PSPPGVRTRH QTSVKAARTM AAAGAKTPAR
//