ID F2TW96_SALR5 Unreviewed; 603 AA.
AC F2TW96;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=TOE1 protein {ECO:0000313|EMBL:EGD72342.1};
GN ORFNames=PTSG_00363 {ECO:0000313|EMBL:EGD72342.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the CAF1 family.
CC {ECO:0000256|ARBA:ARBA00008372}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832955; EGD72342.1; -; Genomic_DNA.
DR RefSeq; XP_004998912.1; XM_004998855.1.
DR AlphaFoldDB; F2TW96; -.
DR STRING; 946362.F2TW96; -.
DR EnsemblProtists; EGD72342; EGD72342; PTSG_00363.
DR GeneID; 16067555; -.
DR KEGG; sre:PTSG_00363; -.
DR eggNOG; KOG1990; Eukaryota.
DR InParanoid; F2TW96; -.
DR OMA; KCKLENS; -.
DR OrthoDB; 2879493at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR Gene3D; 6.10.250.3220; -; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR006941; RNase_CAF1.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000571; Znf_CCCH.
DR InterPro; IPR036855; Znf_CCCH_sf.
DR PANTHER; PTHR15092; POLY A -SPECIFIC RIBONUCLEASE/TARGET OF EGR1, MEMBER 1; 1.
DR PANTHER; PTHR15092:SF37; TARGET OF EGR1 PROTEIN 1; 1.
DR Pfam; PF04857; CAF1; 1.
DR SMART; SM00356; ZnF_C3H1; 1.
DR SUPFAM; SSF90229; CCCH zinc finger; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50103; ZF_C3H1; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|PROSITE-
KW ProRule:PRU00723}; Reference proteome {ECO:0000313|Proteomes:UP000007799};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PROSITE-ProRule:PRU00723};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00723}.
FT DOMAIN 314..342
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT ZN_FING 314..342
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT REGION 135..174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 352..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 554..603
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 382..402
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..428
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..477
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 554..583
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 603 AA; 65569 MW; 287890A77AB9E1A8 CRC64;
MGDKDAVSGD SSSSSSVVVV DVNRDNFGPL WACLHVALSQ ASFVALDCEL SGLGARGAMR
ARSIQQRFEL MRTTAQSRAL LSLGIACFKE LPQGQGYQVQ TFDILLISTR EYVVEPGAIA
FLDRHGFDFN KQARSGLQYT PGHHKRRNSQ RRKASNGATA ATAATAARGG KGTPPPSIHD
LFRVLVTSKC PVVLHNGLLD LVFLYEAMYT ELPPQLDMFL ADVSELLPAI YDTKVLAEYA
VREEASYLEY LFHRCLAQRG PQLPLRPAPF MAAMVRRGTW VRELPIRVVP EEEEGEQEER
EGTCDDGNGA GAGRRLRAIC MQFAAHGHCR AGKTCPHSHD VVRVVQEKVL QPAPTAASKR
RQRRKRAKLE REAKQQAGDG GDDDDYDNDD DDDDDDDDGD GGGDGTSGGA EEVMHTPRKG
EERGVYHANS KAQVDGQPGS EQETTSAKAK RSHVQSLTST TPGNSGTPSS IRKAANATGG
GGQSGGRDSR ALASSQGHRA GYDAFMTGFV FACYKRQLND SSLAETRNKL YLSGKPRPLC
VHKSAYASFS PNHLDIKQQQ QQQQQQQQQQ QQQQQQQQQG SETGAKKDRS KPQAGPVTAS
ASA
//