ID A0A3M0IYS7_HIRRU Unreviewed; 1038 AA.
AC A0A3M0IYS7;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE RecName: Full=SAM domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=DUI87_31703 {ECO:0000313|EMBL:RMB91893.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMB91893.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMB91893.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMB91893.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMB91893.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMB91893.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000232; RMB91893.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0IYS7; -.
DR STRING; 333673.A0A3M0IYS7; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006325; P:chromatin organization; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd20135; MBT_L3MBTL3_rpt2; 1.
DR CDD; cd20139; MBT_L3MBTL4_rpt3; 1.
DR CDD; cd12594; RRM1_SRSF4; 1.
DR CDD; cd12764; RRM2_SRSF4; 1.
DR CDD; cd09582; SAM_Scm-like-3MBT3_4; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 3.30.70.330; -; 2.
DR Gene3D; 4.10.320.30; -; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR002515; Znf_C2H2C.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF130; SAM DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF02820; MBT; 3.
DR Pfam; PF00076; RRM_1; 2.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF01530; zf-C2HC; 1.
DR SMART; SM00561; MBT; 3.
DR SMART; SM00360; RRM; 2.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 3.
DR PROSITE; PS51079; MBT; 3.
DR PROSITE; PS50102; RRM; 2.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
DR PROSITE; PS51802; ZF_CCHHC; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU01143}.
FT DOMAIN 2..72
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 104..177
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REPEAT 553..594
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 602..701
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 710..805
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 971..1035
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 72..97
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 169..472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 486..550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 860..899
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 182..240
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..344
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..382
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 383..421
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 432..446
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..501
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1038 AA; 118107 MW; DE9585401535913A CRC64;
MPRVYIGRLS YQARERDVER FFKGYGKILE VDLKNGYGFV EFDDLRDADD AVYELNGKDL
CGERVIVEHA RGPRRDSSYG SGRSGYGYRR SGRDKYGPPT RTEYRLIVEN LSSRCSWQDL
KDYMRQAGEV TYADAHKGRK NEGVIEFKSY SDMKRALEKL DGTEVNGRKI RLVEDRPGSR
RRRSYSRSRS HSRSRSRSRH SRKSRSRSPS SSRSKSRSRS RSVSRSRSKS RSRSKSRSRG
QKERSRTPSK EDKSRSRSRS AEKSRNKSKD KSESILHNSD EKTKSRSRSK EKSRSTSGSK
ERGEARESVR SRSKEKSRSK DREKSISKAR SRSKSRDESR SRSHSKDKRK SRKRSRDDSR
SRSRSRSKSE KSKRRSKRDS KGSSKRRKDS HERSRSGSKG KEQLKSESDK KEAKGEGEGA
VSRSRSRSIS KSKPNVKSDS RSRSKSVSKP RSRSKSRPWN LQQEAAAPEV VEGSELDGML
QMIAETGNNG NLTTSSGCQV TEAGRPVRRL RRKRRLPLDS EDEEENAYED EEKNKSNNMK
SRRNTKPIKQ GPGYRMRLHF DGYPECYDFW ANADSSDIHP VGWCEKTNHK LLPPKGFKEG
EFNWTSYLKN CKAQAAPKSL FKTLSSPVTP SGFRLGMKLE AVDKKNPSLV CVATITDMVE
NRLLIHFDNW DESYDYWCET SSPYIRPVGY CQETGIPLTT PPGHKDSKAF SWEKYLEETN
SQAAPARAFK LRPPHGFQVN TKLEAVDRRN PILIRVATIV DKDNHRVKIH FDGWDHHYDF
WVDADSPDIH PVGWCDKTGH ALQVPLGAED PEGAVGQACP TPGCQGIGHI RGPRYGTHYT
LVGCPYSDVN LSRENLLQDR LSGERPSAGN SMQKARRVET PGPLLRAGES SQGDSSQSRL
GCSHAYIKFQ LVKQESNGKG PDLDLQQALH QSIFMPSLAS NPTHRLHLFW EQHCRLLPEV
SGLTAKQVAK WTVEEVVSFI QRLPGCKEQA SVFREEQIDG EAFLLLKQND IVKILSIKLG
PALKIYNAIL MFKSAEDN
//