ID F2UGC3_SALR5 Unreviewed; 1114 AA.
AC F2UGC3;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=HTH La-type RNA-binding domain-containing protein {ECO:0000259|PROSITE:PS50961};
GN ORFNames=PTSG_07792 {ECO:0000313|EMBL:EGD75673.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832973; EGD75673.1; -; Genomic_DNA.
DR RefSeq; XP_004991594.1; XM_004991537.1.
DR AlphaFoldDB; F2UGC3; -.
DR STRING; 946362.F2UGC3; -.
DR EnsemblProtists; EGD75673; EGD75673; PTSG_07792.
DR GeneID; 16072154; -.
DR KEGG; sre:PTSG_07792; -.
DR eggNOG; KOG2242; Eukaryota.
DR eggNOG; KOG4213; Eukaryota.
DR InParanoid; F2UGC3; -.
DR OrthoDB; 5402316at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006396; P:RNA processing; IEA:InterPro.
DR CDD; cd07323; LAM; 1.
DR CDD; cd12884; SPRY_hnRNP; 1.
DR Gene3D; 2.60.120.920; -; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR043136; B30.2/SPRY_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR006630; La_HTH.
DR InterPro; IPR002344; Lupus_La.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR003877; SPRY_dom.
DR InterPro; IPR035778; SPRY_hnRNP_U.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR12381:SF56; B30.2_SPRY DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR12381; HETEROGENEOUS NUCLEAR RIBONUCLEOPROTEIN U FAMILY MEMBER; 1.
DR Pfam; PF13671; AAA_33; 1.
DR Pfam; PF05383; La; 1.
DR Pfam; PF00622; SPRY; 1.
DR PRINTS; PR00302; LUPUSLA.
DR SMART; SM00715; LA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS50961; HTH_LA; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000007799};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00332}.
FT DOMAIN 872..962
FT /note="HTH La-type RNA-binding"
FT /evidence="ECO:0000259|PROSITE:PS50961"
FT REGION 1..363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 769..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 958..1114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 736..767
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 1..20
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 28..113
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 132..157
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 195..228
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..264
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..290
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 333..353
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 790..806
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 861..875
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 971..985
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1024..1114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1114 AA; 119496 MW; 87DD6360B4F10B11 CRC64;
MADEQEHQAK RPRLEQEDEP GAVDVAASAT ATTATTTATT ETVVSQEAAT AATTVAETTT
AQVSTTTTTT TTEVAATAPA EPATTATTTT TVAGAPATQA PPVQEQPTAS TAAAPEAMAS
RDAAPANKPP TTEAAPGSTN GNGAAPVVAQ VPAATQNQEP SPPLPPPAQA EQAKVPDAAS
TTRAAPAGQE APVDANVTAP TNVTAPTNVT APAPTKQQQQ QQQPQEQPLA HAAPPAPPTT
ASGAESAAAA PPPPPLPSSS APPAPTSAPT ETAADVKKEE VEDVSAQLEG IHPDRLKAIL
GQEGGSGGSA ARGRGRGRGR GGAMGAPASA PRGGDDHKSD QIPQRKGKPM RDPEPINLPD
TWAKFSPTHK HLHAFIGNHE SSKGKFVEPI PGRGFRWLWT GARLSLGIFG GGTRYHYKTL
VKDLPANSVK AGPVDVRVGL SFERANLLVG DDTFGWAYCS SGSKIHNGER VKYGTPYHTN
DEIECRLDVV TTTCPLVPSV NREGQPPLLV ISFYKNGDFQ GVAYQLDAST VPAAVFGHVA
LRNCRVLAIN EVNHPDPSVG FLVGLPGSGK TEWLSSFLQA QQSTGTDNFY VLSTEAIIQE
LAHNDPDLPE YNYRESWLRY CRTSSAMFRH LIPLAAQERR FFIVDQTNTS RFARQRKMQP
FKDSEIPFKI RAMLFLRTAE DIAGTLQAKE NAGKLIPKAA VLHMMKNFIP PDASEGFLEV
TCLPNEDIKP AVGQSYQHFT EQHKELIEQY EKEQEEMRAM RRARREKILL GRTRGGGGRG
GRGRGRGRGR GRGFGGDRDR DRGRGRGRGR GRGRGGPMSR GRGGDFWRSW SRPCRSGGRG
GRGGDRGGGG RGGRGGGRGG GRGRRPPEPE RFATPQERDA AIVKQVEFYF GDENLPQDKF
LQDKLRLGND WLDVGVIFDF PKVQAKTTSM DTFLAALRTS RLIRLSDDGR RMQRAVPLPA
ALQQRAMETR EQNRGFSERP RGRGRGRGRG GGRGRGGGRG GGRGGPRGAP RGGSRGGRGF
GRGRGGHGPE HRSRSRDRDR GRGGYERSSR GRDRSSGSVD RRRSADRRSM DRGYDRRRPS
TADGSYRRSS AGRDGELPPK MRRTDSSRDV RGRY
//