ID A0A091M9H6_CARIC Unreviewed; 1984 AA.
AC A0A091M9H6;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Histone-lysine N-methyltransferase SETD1B {ECO:0000313|EMBL:KFP68468.1};
DE Flags: Fragment;
GN ORFNames=N322_05215 {ECO:0000313|EMBL:KFP68468.1};
OS Cariama cristata (Red-legged seriema).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Cariamiformes; Cariamidae; Cariama.
OX NCBI_TaxID=54380 {ECO:0000313|EMBL:KFP68468.1, ECO:0000313|Proteomes:UP000054116};
RN [1] {ECO:0000313|EMBL:KFP68468.1, ECO:0000313|Proteomes:UP000054116}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N322 {ECO:0000313|EMBL:KFP68468.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK526891; KFP68468.1; -; Genomic_DNA.
DR Proteomes; UP000054116; Unassembled WGS sequence.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0140999; F:histone H3K4 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd12549; RRM_Set1B; 1.
DR CDD; cd19169; SET_SETD1; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR024657; COMPASS_Set1_N-SET.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR034468; Set1B_RRM.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR037841; SET_SETD1A/B.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B; 1.
DR Pfam; PF11764; N-SET; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM01291; N-SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000313|EMBL:KFP68468.1}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000054116};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}; S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:KFP68468.1}.
FT DOMAIN 86..174
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 1845..1962
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1968..1984
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 224..303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 337..404
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 420..624
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 656..696
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 921..1141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1217..1244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1257..1302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1347..1416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1533..1563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1644..1681
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1790..1816
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 224..294
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 337..368
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..390
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 434..463
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 469..494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 495..539
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 553..580
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 600..614
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 953..983
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 984..1007
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1021..1064
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1079..1118
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1119..1141
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1220..1242
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1271..1300
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1367..1416
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1534..1549
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1644..1664
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFP68468.1"
FT NON_TER 1984
FT /evidence="ECO:0000313|EMBL:KFP68468.1"
SQ SEQUENCE 1984 AA; 220334 MW; F2F733664EC0AE83 CRC64;
WINGMENSTQ ASTSVEKRNH HWRSYKLIID PALKKGQHKL YRYDGQHFSM PNSGIAPVDC
VRDPRIGRIW TKTKELELSV PKFKIDEFYV GPVPPKQVTF AKLNDNIREN FLTDMCKKYG
EVEEVEILYN PKNKKHLGIA KVIFATVKGA REAVQHLHNT SVMGNIIHVE LDTKGETRMR
FYELLVNGLY TPQTLPVGTE QDVSPTVNET LQLTDSLKRL KDSNLSSAGS SVTPNSSTPF
SHDTAYSSCR QDTPNSYSQF TPQSQGTPHT PRLGTPFSQD SSYSSRQTTP VYHFGQDSGY
KPRRHETKFT DAYNRRPGHH YVHNSAGVYR GTEHQFSTYK SHQQEPVQFS HTPPLSHSSS
SSYKSAFSPY QAPAVFPQSD EQQFPQTSRE TEYRRPAPPP TEMVVESSAA TNIDFVPVKE
KQEEPPPLPD SNSVPEPSTA SFSQTPERSE TPGTPTMESE MQHNSLDSRI EMLLKEQRTK
LPFLNEHDSD NEIRMEGSPI SSSSSQLSPI PMYGSNSQPG YRGQTPSSRP SSTGLEDISP
TPLPDSDDDE PIPGTASLSQ NSRGTSEASM TPIDQLSRTS KVETSDAKEM VPGDQTPTSE
KMDEGQHSSG EDMEISDDET NPAPITSAEC AKTIVVNSSV SNTAVMAPSI PPMPPPGFPP
LPPPPPPQPG FPMPPPLPPP PPPTHPAVTV PPPPLPAPPG VPPPHILPPL PPFHPGMFPV
MQVDMISVLG NQWGGMPMSF QMQTQMLSRM MQAQNTYQYP PFMTGRMQFV NLPPYRPFSM
GAALGRGQQW PPLPKFDPSV PPPGYEPKKE DPHKATVDGV LLVIVKELKA IMKRDLNRKM
VEVVAFRAFD DWWDKKERLA KASLTPVKSG GELEEKPKPK DRITSCLLEN WNKGEGLGYE
GIGLGIGLRG AIRLPSFKVK RKEPPEAASA GDQKRIRPST SVDDEDEESE RDRDASDATS
DLSKKDAEAV GLRRRPARPL ELDSEGEEGD ETSGKEEESS SEKEEEQEEE GGLVKAAPGK
DEEEDEDDED EDDDEEDDED EEEEEEETGI DTSDKEEEQD SEEDVASPSS SKAEVESSDE
SEDSSEFESS SDSDDEDEDD DDEDEEEEEE EEVAEDQDRE AMVAEAQHEP AGHELPDDER
ETILELYPVD DYMDATGLGL AEPALAVKDE GMEEIKAGEG ECGEVPEASI AAETLKHLVM
ERDQETKLAL SPICRPEEES EPATMLEPEV QEHPKPESQD EEEALCLSTP VGVFGEPLPN
KSSFFSKSDD SSLEARVKTK LPSVAEEDDR LPRTPGREVM IHSETDTLLL PAHKVPSSML
PLPSTPGKEE SLVPPEKFPE QLMVTKTSVE EEIPRTPGRD ILAKSSHPLA KSQSTDTVPA
TPGSDAPLTG SSLTLSSPQV PGSPFSYPSQ SPGINSGIPR TPGRDFNFTP TFPEAGATIS
CLLSGKKQSE DELDEKPFKE PLGASLTISM NSVPSPIPFA SPPQADFHTD MGLPPDEPIP
VAALPCMHGD GRMPIEECKA EVKSVLLSPE VPTGASILPP PPPPSVLPKR RPGRPKRSPP
SVLSLDVYSG KTIEPPPVPV ALVESAVGKE LLSGHPDAFY GLKDPEAVTL DFRNDGFHEK
IAAETVAEKL PFKELENQWN EDFKEEEAHV KPKRQWRRQK KTPEDLPAIP SPEYSPPRPQ
FRPRSEFEEM TILYDIWNGG IDEEDIKFMC ITYDRLLQQD NGMDWLNDTL WVFHPYILSH
WLSAFSTPKK KKRDDGMREH VTGCARSEGY YKIDKKDKLK YLNNSRAFAE EPPADTQGMS
IPAQPHASTR AGSERRSEQR RLLSSFTGSC DSDLLKFNQL KFRKKKLKFC KSHIHDWGLF
AMEPIAADEM VIEYVGQNIR QVIADMREKR YEDEGIGSSY MFRVDHDTII DATKCGNFAR
FINHSCNPNC YAKVITVESQ KKIVIYSKQH INVNEEITYD YKFPIEDVKI PCLCGSENCR
GTLN
//