ID A0A1S4ARM4_TOBAC Unreviewed; 718 AA.
AC A0A1S4ARM4;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Pre-mRNA-processing protein 40A-like isoform X2 {ECO:0000313|RefSeq:XP_016479334.1};
GN Name=LOC107800638 {ECO:0000313|RefSeq:XP_016479334.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016479334.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016479334.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016479334.1; XM_016623848.1.
DR AlphaFoldDB; A0A1S4ARM4; -.
DR GeneID; 107800638; -.
DR OrthoDB; 25674at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR PANTHER; PTHR11864:SF36; PRE-MRNA-PROCESSING PROTEIN 40A-LIKE ISOFORM X1; 1.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR Pfam; PF01846; FF; 4.
DR SMART; SM00441; FF; 5.
DR SUPFAM; SSF81698; FF domain; 5.
DR PROSITE; PS51676; FF; 4.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051}.
FT DOMAIN 181..235
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 248..303
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 316..370
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 388..451
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 27..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..172
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 563..718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 225..252
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 287..317
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 361..388
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 30..56
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 563..632
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 639..673
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 681..718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 718 AA; 82869 MW; 8F3E2445F7C00310 CRC64;
MPDEVKLARQ SMKVDLVKGL GKERDSISHA SDFGSISGVK TSPLSANGSP VSAQGAMSSP
IAVAPVSNLP TIVASESSSL SGNISSLTIG AVEMQNSLEP ASPAVATSEK NGTAVTLENS
VATPVTSSEF PSAQDSVVYE DGVSLENTEE VKKDATVSET GSATPSEEKT VEPGPLVYES
KAEAKSAFKI LLESANIGSD WTWDQAMRAI INDRRYGALK SLGERKQAFN EYLSHKKKLE
AEERRIKQKK AREDFRIMLE ECKELAPSTR WSKAISIFEH DERFKAVERA KDREDLFEDY
MEELEKKERA RALEEQKRNR VEYLEFLKSC DFIKASSQWR KVQDRLEDDE RCSCLEKIDR
LEIFQEYIRD LEREEEEHRK LRMDEMRKAE RKNRDEFRKL MEEHVAAGIL NAKTNWRDYC
IKVKDLPAYL AVSSNTSGPK AKDLFQDVFD ELEKQIGLTS TWTLEDFKVA ISKDISSPPI
SDTNLKFVFE ELLERARERE EKEAKRRKRL ADEFYELLHT SKEITASSKW EDCKSIFGDR
IMGEESFLLE IFDKFISELK EKAKEKERKR QEDKARKEKE RKDREKKKEK HRRDKDRGDK
SRKERERTKK DGTDSEKADT YSFEEIKRLG SDRDKKHRKR HMSSFDDNEN EKDHSRNSYR
HDNDHKKSKQ VDQHVWSSEV NSEGQHKKQK RDHRSGSLRD GDNEDHKDGE FGEDGEVR
//