ID G3X163_SARHA Unreviewed; 1137 AA.
AC G3X163;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=RNA binding motif protein 6 {ECO:0000313|Ensembl:ENSSHAP00000021418.1};
GN Name=RBM6 {ECO:0000313|Ensembl:ENSSHAP00000021418.1};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000021418.1, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000021418.1, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000021418.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_012398959.1; XM_012543505.1.
DR RefSeq; XP_012398960.1; XM_012543506.1.
DR AlphaFoldDB; G3X163; -.
DR STRING; 9305.ENSSHAP00000021418; -.
DR Ensembl; ENSSHAT00000021591.2; ENSSHAP00000021418.1; ENSSHAG00000018151.2.
DR GeneID; 100924023; -.
DR KEGG; shr:100924023; -.
DR CTD; 10180; -.
DR eggNOG; KOG0154; Eukaryota.
DR GeneTree; ENSGT00940000157976; -.
DR HOGENOM; CLU_291499_0_0_1; -.
DR InParanoid; G3X163; -.
DR OMA; RNIPHAD; -.
DR OrthoDB; 298711at2759; -.
DR TreeFam; TF315789; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd12563; RRM2_RBM6; 1.
DR Gene3D; 3.30.70.330; -; 2.
DR Gene3D; 2.160.20.80; E3 ubiquitin-protein ligase SopA; 1.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR041591; OCRE.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR034125; RBM6_RRM2.
DR InterPro; IPR000504; RRM_dom.
DR PANTHER; PTHR13948; RNA-BINDING PROTEIN; 1.
DR PANTHER; PTHR13948:SF22; RNA-BINDING PROTEIN 6; 1.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF17780; OCRE; 1.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00360; RRM; 2.
DR SUPFAM; SSF141571; Pentapeptide repeat-like; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 2.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50102; RRM; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 469..549
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 1065..1111
FT /note="G-patch"
FT /evidence="ECO:0000259|PROSITE:PS50174"
FT REGION 1..401
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 587..670
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 757..803
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 832..962
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1002..1122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..135
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..234
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..285
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 307..322
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..351
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..604
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 609..670
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..787
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..803
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 848..900
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1002..1061
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1137 AA; 130149 MW; 8F89E3BDB2992AF4 CRC64;
MWGDPRPANR TGPFRGSQEE RFAPGWNRDY PPPLKSHAQE RHSGNFPGRD SLPFDFQGHT
GPPNPPFANV EDHSFSYGAR DGPHTDYRVG EGPGRDFMGG DFPSDFQNRD SPQLDFRGRE
MHPGDFRDRE GPPMDYRGGD GTSMDYRGRE ASHMNYRERE AHAVEFRGRD GPPPDFRGRG
TYDLDFRGRD GSHSDFRGRD ISDLDFRGRD QSHSDFRNRD MPDLDFRGKD GSQMDFRGRG
SGTADLDFRD RDTPPSDFRG RHRSRTDQDF RGREMAPHME FTDREMPPMD PNILDFIQPS
TQDRERSGIN VNKREESAHD LSGTERSPFG IQKGEFQHSE TRAREGDSHG LGLENESPLD
FRNNQRPLQD QDKAPQIFAN KQPLPGGEQQ RSESGLAFKE EEGLDFLGRQ DTDYRNMEYR
DVDHRVPGNQ MFDYSHNKSF PEGKIVKDSR QDLQDQDYRT GPTEEKPSRL IRLGGVPENA
TKEDILNAFR TSDGIPVKDL QLKDYSSGYD YGYVCVEFSL LEEAIGCMEA NQGTLKICDK
EVTLEYSPSP DFWHCKRCKV STVGYRSSCS YCKFPREEVK VQQELATYPE SQKSPAQPAV
LPEKQQTHPQ GPPDKEPETK KREEGRERRL GQEQRRDSER YPSRREGHDS STRRDSEKEQ
WPGESRQDGE SKTIMLKRIY RSTPPEVIVE VLEPYVRLST ANVRIIKNRT GPMGHTYGFI
DLDSHSEALR VVKILQNLDP PFSIDGKMVA VNLATGKRRN DSGDHSDNMH YNQGKKYFRE
RKGGSRNSDW SSDSNRHGQQ SSSDCYIYDS VTGYYYDPLA GTYYDPKTQQ EIYIPQDPGS
PGAENKGKKH NSQGKPNEKK DASKRDSREK KARSTPTKEI SSEGKAPTED VFKKPLPPTV
KKEESPPPPK VVNPLIGLLG EYGGDSDYEE EEEEEKPPPA QTRPAQPPQR EESSKKENEE
DKLTDWNKLA CLLCRRQFPN KEVLLKHQQL SDLHKQNLEI HRKIKQSEQE LAYLERRERE
GRYREKGSDR KEKFHFSDSP DRKRMKYSRE TDSDRKSSNK AGIDSNSKGS YVLQSPGWKK
GAGLGYGQSG LASAEETEGW TRGPGVGSQG KPSKRQSNET YRDAVRRVMF ARYKELE
//