ID F1QMN6_DANRE Unreviewed; 3476 AA.
AC F1QMN6; A0A8M1RN36;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 79.
DE SubName: Full=Msx2-interacting protein isoform X1 {ECO:0000313|RefSeq:XP_003201252.2};
DE SubName: Full=Spen family transcriptional repressor {ECO:0000313|Ensembl:ENSDARP00000098999};
GN Name=spen {ECO:0000313|Ensembl:ENSDARP00000098999,
GN ECO:0000313|RefSeq:XP_003201252.2,
GN ECO:0000313|ZFIN:ZDB-GENE-050309-70};
OS Danio rerio (Zebrafish) (Brachydanio rerio).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Danionidae; Danioninae; Danio.
OX NCBI_TaxID=7955 {ECO:0000313|Ensembl:ENSDARP00000098999};
RN [1] {ECO:0000313|Ensembl:ENSDARP00000098999}
RP IDENTIFICATION.
RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000098999};
RG Ensembl;
RL Submitted (JUL-2011) to UniProtKB.
RN [2] {ECO:0000313|Ensembl:ENSDARP00000098999, ECO:0000313|Proteomes:UP000000437}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tuebingen {ECO:0000313|Ensembl:ENSDARP00000098999,
RC ECO:0000313|Proteomes:UP000000437};
RX PubMed=23594743; DOI=10.1038/nature12111;
RG Genome Reference Consortium Zebrafish;
RA Howe K., Clark M.D., Torroja C.F., Torrance J., Berthelot C., Muffato M.,
RA Collins J.E., Humphray S., McLaren K., Matthews L., McLaren S., Sealy I.,
RA Caccamo M., Churcher C., Scott C., Barrett J.C., Koch R., Rauch G.J.,
RA White S., Chow W., Kilian B., Quintais L.T., Guerra-Assuncao J.A., Zhou Y.,
RA Gu Y., Yen J., Vogel J.H., Eyre T., Redmond S., Banerjee R., Chi J., Fu B.,
RA Langley E., Maguire S.F., Laird G.K., Lloyd D., Kenyon E., Donaldson S.,
RA Sehra H., Almeida-King J., Loveland J., Trevanion S., Jones M., Quail M.,
RA Willey D., Hunt A., Burton J., Sims S., McLay K., Plumb B., Davis J.,
RA Clee C., Oliver K., Clark R., Riddle C., Elliot D., Eliott D.,
RA Threadgold G., Harden G., Ware D., Begum S., Mortimore B., Mortimer B.,
RA Kerry G., Heath P., Phillimore B., Tracey A., Corby N., Dunn M.,
RA Johnson C., Wood J., Clark S., Pelan S., Griffiths G., Smith M.,
RA Glithero R., Howden P., Barker N., Lloyd C., Stevens C., Harley J.,
RA Holt K., Panagiotidis G., Lovell J., Beasley H., Henderson C., Gordon D.,
RA Auger K., Wright D., Collins J., Raisen C., Dyer L., Leung K.,
RA Robertson L., Ambridge K., Leongamornlert D., McGuire S., Gilderthorp R.,
RA Griffiths C., Manthravadi D., Nichol S., Barker G., Whitehead S., Kay M.,
RA Brown J., Murnane C., Gray E., Humphries M., Sycamore N., Barker D.,
RA Saunders D., Wallis J., Babbage A., Hammond S., Mashreghi-Mohammadi M.,
RA Barr L., Martin S., Wray P., Ellington A., Matthews N., Ellwood M.,
RA Woodmansey R., Clark G., Cooper J., Cooper J., Tromans A., Grafham D.,
RA Skuce C., Pandian R., Andrews R., Harrison E., Kimberley A., Garnett J.,
RA Fosker N., Hall R., Garner P., Kelly D., Bird C., Palmer S., Gehring I.,
RA Berger A., Dooley C.M., Ersan-Urun Z., Eser C., Geiger H., Geisler M.,
RA Karotki L., Kirn A., Konantz J., Konantz M., Oberlander M.,
RA Rudolph-Geiger S., Teucke M., Lanz C., Raddatz G., Osoegawa K., Zhu B.,
RA Rapp A., Widaa S., Langford C., Yang F., Schuster S.C., Carter N.P.,
RA Harrow J., Ning Z., Herrero J., Searle S.M., Enright A., Geisler R.,
RA Plasterk R.H., Lee C., Westerfield M., de Jong P.J., Zon L.I.,
RA Postlethwait J.H., Nusslein-Volhard C., Hubbard T.J., Roest Crollius H.,
RA Rogers J., Stemple D.L.;
RT "The zebrafish reference genome sequence and its relationship to the human
RT genome.";
RL Nature 496:498-503(2013).
RN [3] {ECO:0000313|RefSeq:XP_003201252.2}
RP IDENTIFICATION.
RC STRAIN=Tuebingen {ECO:0000313|RefSeq:XP_003201252.2};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the RRM Spen family.
CC {ECO:0000256|ARBA:ARBA00005387}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX957326; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; CU207252; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_003201252.2; XM_003201204.5.
DR STRING; 7955.ENSDARP00000098999; -.
DR PaxDb; 7955-ENSDARP00000098999; -.
DR Ensembl; ENSDART00000109248; ENSDARP00000098999; ENSDARG00000074245.
DR Ensembl; ENSDART00000109248.4; ENSDARP00000098999.3; ENSDARG00000074245.4.
DR GeneID; 503940; -.
DR KEGG; dre:503940; -.
DR AGR; ZFIN:ZDB-GENE-050309-70; -.
DR CTD; 23013; -.
DR ZFIN; ZDB-GENE-050309-70; spen.
DR eggNOG; KOG0112; Eukaryota.
DR HOGENOM; CLU_224781_0_0_1; -.
DR OMA; RSQYEFQ; -.
DR OrthoDB; 26044at2759; -.
DR TreeFam; TF315637; -.
DR Reactome; R-DRE-9013422; RHOBTB1 GTPase cycle.
DR Proteomes; UP000000437; Chromosome 23.
DR Bgee; ENSDARG00000074245; Expressed in retina and 20 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003729; F:mRNA binding; IBA:GO_Central.
DR GO; GO:0060047; P:heart contraction; IMP:ZFIN.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd12348; RRM1_SHARP; 1.
DR CDD; cd12349; RRM2_SHARP; 1.
DR CDD; cd12350; RRM3_SHARP; 1.
DR CDD; cd12351; RRM4_SHARP; 1.
DR CDD; cd21543; SPOC_SHARP; 1.
DR Gene3D; 2.40.290.10; -; 1.
DR Gene3D; 3.30.70.330; -; 4.
DR InterPro; IPR049095; MINT_MID.
DR InterPro; IPR049093; MINT_RID.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR034172; SHARP_RRM1.
DR InterPro; IPR034173; SHARP_RRM2.
DR InterPro; IPR034174; SHARP_RRM3.
DR InterPro; IPR034175; SHARP_RRM4.
DR InterPro; IPR016194; SPOC-like_C_dom_sf.
DR InterPro; IPR012921; SPOC_C.
DR InterPro; IPR010912; SPOC_met.
DR PANTHER; PTHR23189:SF48; MSX2-INTERACTING PROTEIN; 1.
DR PANTHER; PTHR23189; RNA RECOGNITION MOTIF-CONTAINING; 1.
DR Pfam; PF20809; MINT_MID; 1.
DR Pfam; PF20810; MINT_RID; 1.
DR Pfam; PF00076; RRM_1; 3.
DR Pfam; PF07744; SPOC; 1.
DR SMART; SM00360; RRM; 4.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 2.
DR SUPFAM; SSF100939; SPOC domain-like; 1.
DR PROSITE; PS50102; RRM; 4.
DR PROSITE; PS50917; SPOC; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000000437};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 6..81
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 383..461
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 486..561
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 565..637
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 3310..3476
FT /note="SPOC"
FT /evidence="ECO:0000259|PROSITE:PS50917"
FT REGION 103..201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 235..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 275..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 748..897
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 958..983
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1152..1246
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1291..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1390..1425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1492..1538
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1559..1598
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1756..2025
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2328..2361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2425..2489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2504..2531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2586..2613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2750..2776
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2795..2827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2869..2917
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..178
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 180..201
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 748..775
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 776..794
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 822..879
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 961..975
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1189..1212
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1221..1246
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1313..1327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1513..1538
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1559..1573
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1574..1594
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1783..1814
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1840..1857
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1858..1880
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1881..1919
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1926..1940
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1955..1985
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1986..2022
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2443..2489
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2598..2612
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3476 AA; 385779 MW; 869FAC7299FCC81C CRC64;
MVRETRHLWV GNLPENVREE KIIEHFKRYG RVESVKVLPK RGSEGGVAAF VDFVDIKSAQ
KAHNSINKMG DRDLRTDYNE PGTIPSAARG LDDSLSIATR GRDVSGFTRG AGGPVYGPSV
SLHSREGRFE RRLDGTSDSR ERSYDHSAYG HHERSSSNST SFDRQRHYES DYYRDSRDRA
LSGTSGSASS ASGSIGGSSS AGSIGVAGNV AGSSGAGGST AAGVGSGGST PGGIVYYGSR
SRSPSRFETT ETRYEPRARE AFTLASVVHR DLFREERGRR GERTYHHSRS RSPHSSQSHN
PSPQRLQSQA ARPARSRSGS GSRSRSSSSD SVSSTSSSGS GSSDSSSTSS DRSPARSVQS
TAVPAPSTQP LPSLDKDEPR KSFGIKVQNL PVRSTDTSLK DGLFHEFKKH GKVTSVQIHG
ASEERYGLVF FRQQEDQEKA LSASKGKLFF GMQIDVTAWH GPETESENEF RPLDERIDEF
HPKATRTLFI GNLEKTTTYN DLLNIFQRFG EIVDIDIKKV NGSPQYAFLQ YCDIASVCKA
IKKMDGEYLG NNRLKLGFGK SMPTTCVWLD GLSSSITEQY LTRHFCRYGH VVKVVFDRLK
GMALILYNNI EYAQAAVKET KGWKIGGNKI KVDFANQESQ MAFYRSMQGS GQDIRDFYEI
ISERRDERRT PYHEFTAERA YYENVRTPTS YTEDPRRKYP ARSREFFSEW DPYQGDYYDP
RYYDDSREYR DFRDPYEQDI RKYSYLQRER ERERERFETD RERDHGRRTI EHQQSPTHSR
RPASPTASPS LSERPPSDSE HHVYSRSSER SGSCSSLSPP RFEKPDKIRL ERHNRNDKLE
KEKSLFETER GNGGEKERRA GRKEKGEKDR TERQKLKKLK LASPTVPSPE AELELDRDAS
PEANITLRGK VSKSLIKDKD YSGKGKLDLL PCVVQLTRVK EKEGKLIDSI LCEKQKTKVG
SDPVLSPTTP SSGDHKSMPF RKDTQARDFF KHGKHLKEKG LASQVEVVDK EEKVKNKKYF
KSDLAFESSS SVDADRKAAR KRRFEETSAK ADNLRRVSQE EDEGKLRRII DEPLLKDTDY
DKKLLRKEAH KRERKIKPER MVTVSTTIEE LDNVMPVGPS LDLQARLGEP AEDAIDPLDS
LDQKIGAFGA NRQPSFSLAV SDDGSVDMDF SREQEQQHLT SYHTLSSRQE RVSESKESLL
SDIDHSQSCR KQMEQNRRLQ QQMLECDKSD KTESTPSTDA EEFEHRSIVH EVVKPLQDVT
GSSPTSKQKK LGGFEFDFGK REQNYEMFRL RNDEPERGLA SHPGTPLAEE ERNASQLLDK
DPDLPVTMDK NSSHLDVSKY NTSNAVHQGL SPHAEILKVK PSVTKEEFSW ESNIRQGTLR
EMSFPTSIVK RESIRKRPER ELEPGEVQSD SDEDESRHYS LKPISFKREH EERLSDVKYS
ESLEKNKFYE FALDKTITPD TKALLERAKS LSSSREENWF FLNHDSKFKS LQNNIDKEKS
EPTPRPIPSW YLKKKKSRSD SDGKLDEKKQ DAKPDEQERQ DLLASRFLHS SIFEQDSRRL
QHLERKNTDP DVRIGRDSVT SNSQGEQPGT GGSDLTQEPR VLFHSRFLEL QQRDKNQQLP
ISEKLSLTDQ MEKTLDEEFR PSPNLSDISQ ESLASHSLTP TISPVSQSKI SEMSVHHEKP
VLHTTLPNIH NPAVVNVRSE NAPIDYPSLI SLKDENNTIP LSTPSPEPMI KEDVIVEEPE
EKLRDSRKTV HAAVEHEIEI KPPTPGASLG NVEPECDALQ ASSPLYPNPS KDQETTELSE
TTTEPANSSL VVSPLEVPDT EIEQVEQVLP PRKQPKSKKV KNVSSAQTPI TQVVNEKPAT
RKSERIDKEK LKRSPRGDST KLTADSRNSA KSPVQVLDSE QGLELNTNQG RTRQRRNVRS
VYATPPEEES PQQQGKEVTE PPRSTLKRSR GRPPKTRRRV DDMSPVKGDQ IKNSEAEDTE
NKESTSSGEI TKSTEAWRSP RSQKGQSSPI RSGQSKKTLK LDKVSGTSAC VQLDREAVDV
PPALDRKAES KETLMQVGHL SKDTKQNSIS TREKELVAEE ISTDDRVEIT PVEKILDKEV
KTSRSTRNTK TMSDNKAVNV VKLTMDAVKE AVHSGDDITV CFEGSLKANV PQSVIEQSDI
QDYHKKENVN SLDEKDETLS EMEPPTDPVA ALLARQMELE RAVENISRFT DDQHPVPYKE
PRTEPPTLTA PVVVQPAEDT EVGKPANPAS ETELAEAINS ITAEDISGDT DGFSASTTYT
TLLPQPETLD LPVPNEVVET ESDLTVKTIA QSEQDNVMVP VSKSSKCKTD TSLKESPLTE
VAKRGGRARP KTAKKSRGRK VSLNRKLDIA EDVVLEPEST TVKLPESIPE EIQTVNPKAA
TSAAAAAVVT VAAACKHEAA STVILNTPKE AEQPAVDQPE PQESAFHSGN NSPSYLRTQQ
SSPERGASAL TSPTTRLNSP ASSNIPPEWN TRTEEKVMLP KPQGNVALSI PGAGGPPANP
PMPPDTKASD INASSSTLRK ILMEPKYVSA SNSNSVTSMQ FTTTLADPRM SDNENSVEAV
LPLKTSLPED RPSPIAQLVP RTTPPQPPPL QQCETPQILK EKLAITSTAT SVISRIPMPF
DFEDTPRISL SNRNSGMSLP KQKYRTGLNE NNRYHGHNTS EDGGSVGRPV VEGTHCNTGS
STGLRVNTSE GVVVLSYSGQ KTEGPQRIIA KISQIPPASA VDIEFQQSIT KSQIKQEPSS
HPSTPKGSQT PTGYSHTGVV LAGQSINAQP VISSIRQESP GSEKSETSYQ QGSSLKSFHQ
SPSHPQLLRY GQSITPQHHM KKNGETESFS MKADIKPDIK ATQTFNVKPV LSPRHPSFSG
NHILSPSAPH ERAVPQPKQD SHSPRASNHS MSPFPKVCPP NSPVVLGPAG PMSQYVSNIH
HAEQSVIMPP HSVTQSVPMS HLSQGDVRAS TPTLSGIGYG LRSENLLSPR SAPPQRSTTP
QPAVIRDVIL KSHAGSSVGG HVVETSNDDA QNLPQGLRRS SAPQLQQESM VIQSEFKNLH
HRALRLDQYA RLMQQHLTDH PGVVESRQTR TAEVVQSSSH ISSASSKASS IVKNAPQIVR
DGQKTVELKM SPSPHSDNRI TGHPPGSVMV SPQGVQLNYP GNGNTMNEYY KEMRGFHPQY
PGHSVIGINL ANRGIPVSQV SQIDHSQRHK VPSLSSSESV GSFSESKLEG SHIRHSGTMD
LSHISRVQSE AGSPSYISPV TITPKLELAI TLQKGPQGPV SNKMPLPSLA SSQMRSDFKL
DHTGLRSVDM VQLLTKYPII WQGHLALKND TAAVQLHFVS GNNVLAHRSL PPPEGGAFLR
IAQRMRLEAS QLEGVARRMT AENEYCLLLA LPCGLDQEDV HNQTHALKTG FITYLQAKQA
AGIINVPNPG SNQPAYVVQI FPPCEFSESH LSHLAPDLLN SISSISPHLM IVIASV
//