ID A0A3Q2IAM4_HORSE Unreviewed; 936 AA.
AC A0A3Q2IAM4;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 2.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Scaffold attachment factor B2 {ECO:0000313|Ensembl:ENSECAP00000045115.2};
GN Name=SAFB2 {ECO:0000313|Ensembl:ENSECAP00000045115.2,
GN ECO:0000313|VGNC:VGNC:22668};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000045115.2, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000045115.2, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000045115.2,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000045115.2}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000045115.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q2IAM4; -.
DR Ensembl; ENSECAT00000062270.2; ENSECAP00000045115.2; ENSECAG00000018978.4.
DR VGNC; VGNC:22668; SAFB2.
DR GeneTree; ENSGT00940000161482; -.
DR Proteomes; UP000002281; Chromosome 7.
DR Bgee; ENSECAG00000018978; Expressed in retina and 23 other cell types or tissues.
DR ExpressionAtlas; A0A3Q2IAM4; baseline.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd12679; RRM_SAFB1_SAFB2; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 1.10.720.30; SAP domain; 1.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR034781; SAFB1_2_RBD.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR PANTHER; PTHR15683; SCAFFOLD ATTACHMENT FACTOR B-RELATED; 1.
DR PANTHER; PTHR15683:SF4; SCAFFOLD ATTACHMENT FACTOR B2; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF02037; SAP; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00513; SAP; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF68906; SAP domain; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50800; SAP; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}.
FT DOMAIN 30..64
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT DOMAIN 404..482
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REGION 85..126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 222..404
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 521..646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 687..865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 892..936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 85..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 100..121
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 377..399
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 521..553
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..646
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 687..822
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 842..858
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 909..923
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 936 AA; 104649 MW; 3887F5FEF36BA59C CRC64;
MAETLAGSGD LGAGAAAVGL GASEAGTRRL SELRVIDLRA ELKKRNLDTG GNKSVLMERL
KRAVKEEGQD PEEIAVALEA TSKKLAKRGV KGQKTEEEGT EDNGLEEDSR DGQEDTEAGL
EGLPDMDMVD VSVLGEADAE SSSTAGLGAD GILESLCDSK GYVAAQLREL PAQLTGHAVD
GDGFENTLDA SPMDFKVPPD VEEPLSEPEN EKILDILGET CKSEPVKEEG PELEQPFAQD
TSSVGPDRKL AEEEDLFGSG HPEEGALDVA GESPGQAQAS QADSLLAVVK REPAEEPGAG
ARTDCEPVGL EQRAEQSRGA CEPAGACSEE AAEAPPEASS PEPGDSHEDG PKLAFEACNE
VPPAPKESSA SEGADQKMSS FKEEKDIKPI IKDEKGRAGS ASGKNLWVSG LSSTTRATDL
KNLFSKYGKV VGAKVVTNAR SPGARCYGFV TMSTSDEATK CISHLHRTEL HGRMISVEKA
KNEPAGKKLS DRKECEVKKE KLTSADRYHP VEVKVEKTVI KKEEKIDKKE EKRPEDIKKE
EKDEDELKPG PTDRSRVTKS GSRGMERTVV MDKSKGEPVI SVKTTSRSKE RSSKSQDRKS
ESKEKRDILS FDKIKEQRER ERQRQREREI RETERRRERE QREREQRLEA LHERKEKARL
QRERLQLECQ RQRLERERLE RERLERERMR VERERRKEQE RIQREREELR RQQEQLRYEQ
ERRPALRRPY DDGRREDPYW PEGKRLAVED RYRPDLPRPD HRFHDCDHRD RGQYQDHVAD
RREGPRAVMG ERDGQHYDDR HSHGGPPERH GRDSRDGWGG YGSDKRMSEA RGLPPPPRGG
RDWTEHCQRL DEHPERTWPG TVDAGTAGRE HARWQGAAAL YRAGTRRLTR CPGAPWKAED
WPARTAAAAS PTHTPTRTPT SPAATDRRAA APASVP
//