ID A0A0D2S3C8_GOSRA Unreviewed; 994 AA.
AC A0A0D2S3C8;
DT 29-APR-2015, integrated into UniProtKB/TrEMBL.
DT 29-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Pre-mRNA-processing protein 40A {ECO:0008006|Google:ProtNLM};
GN ORFNames=B456_012G147100 {ECO:0000313|EMBL:KJB77622.1};
OS Gossypium raimondii (New World cotton).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=29730 {ECO:0000313|EMBL:KJB77622.1, ECO:0000313|Proteomes:UP000032304};
RN [1] {ECO:0000313|EMBL:KJB77622.1, ECO:0000313|Proteomes:UP000032304}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23257886; DOI=10.1038/nature11798;
RA Paterson A.H., Wendel J.F., Gundlach H., Guo H., Jenkins J., Jin D.,
RA Llewellyn D., Showmaker K.C., Shu S., Udall J., Yoo M.J., Byers R.,
RA Chen W., Doron-Faigenboim A., Duke M.V., Gong L., Grimwood J., Grover C.,
RA Grupp K., Hu G., Lee T.H., Li J., Lin L., Liu T., Marler B.S., Page J.T.,
RA Roberts A.W., Romanel E., Sanders W.S., Szadkowski E., Tan X., Tang H.,
RA Xu C., Wang J., Wang Z., Zhang D., Zhang L., Ashrafi H., Bedon F.,
RA Bowers J.E., Brubaker C.L., Chee P.W., Das S., Gingle A.R., Haigler C.H.,
RA Harker D., Hoffmann L.V., Hovav R., Jones D.C., Lemke C., Mansoor S.,
RA ur Rahman M., Rainville L.N., Rambani A., Reddy U.K., Rong J.K.,
RA Saranga Y., Scheffler B.E., Scheffler J.A., Stelly D.M., Triplett B.A.,
RA Van Deynze A., Vaslin M.F., Waghmare V.N., Walford S.A., Wright R.J.,
RA Zaki E.A., Zhang T., Dennis E.S., Mayer K.F., Peterson D.G., Rokhsar D.S.,
RA Wang X., Schmutz J.;
RT "Repeated polyploidization of Gossypium genomes and the evolution of
RT spinnable cotton fibres.";
RL Nature 492:423-427(2012).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001751; KJB77621.1; -; Genomic_DNA.
DR EMBL; CM001751; KJB77622.1; -; Genomic_DNA.
DR EMBL; CM001751; KJB77623.1; -; Genomic_DNA.
DR EMBL; CM001751; KJB77625.1; -; Genomic_DNA.
DR RefSeq; XP_012459512.1; XM_012604058.1.
DR RefSeq; XP_012459513.1; XM_012604059.1.
DR RefSeq; XP_012459514.1; XM_012604060.1.
DR RefSeq; XP_012459515.1; XM_012604061.1.
DR RefSeq; XP_012459516.1; XM_012604062.1.
DR RefSeq; XP_012459517.1; XM_012604063.1.
DR AlphaFoldDB; A0A0D2S3C8; -.
DR EnsemblPlants; KJB77621; KJB77621; B456_012G147100.
DR EnsemblPlants; KJB77622; KJB77622; B456_012G147100.
DR EnsemblPlants; KJB77623; KJB77623; B456_012G147100.
DR EnsemblPlants; KJB77625; KJB77625; B456_012G147100.
DR GeneID; 105780006; -.
DR Gramene; KJB77621; KJB77621; B456_012G147100.
DR Gramene; KJB77622; KJB77622; B456_012G147100.
DR Gramene; KJB77623; KJB77623; B456_012G147100.
DR Gramene; KJB77625; KJB77625; B456_012G147100.
DR KEGG; gra:105780006; -.
DR OMA; RTSDINM; -.
DR OrthoDB; 25674at2759; -.
DR Proteomes; UP000032304; Chromosome 12.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR039726; Prp40-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR11864:SF33; PRE-MRNA-PROCESSING PROTEIN 40B; 1.
DR PANTHER; PTHR11864; PRE-MRNA-PROCESSING PROTEIN PRP40; 1.
DR Pfam; PF01846; FF; 4.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 5.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 5.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000032304}.
FT DOMAIN 193..226
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 234..267
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 440..494
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 507..562
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 575..629
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 647..710
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 782..837
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 1..35
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 283..324
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 841..994
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 546..576
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 623..652
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 701..728
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 7..33
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 841..912
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 919..939
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 949..963
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 994 AA; 112952 MW; 2F1CD6472552E29C CRC64;
MDPSQNFPPP MSGQFRPVVP SQQPQQFVSV PPQQFQPVGR GVTVMNAGFP SQTPQPQIPQ
VMQQLPARPG QPGHILPPPP SVSLPAAQPN LHVNPGASVP QPNIQAPNNY FPGVPASHLS
SSYTFAPSSY GQVTVSHNAM AQYQPMAQLQ APNVPVGGQV GIHVSQSTSV TSAQQIGEQP
SASTATIPPK PTEEALTDWI EHTSANGRRY YYNKKTRQSS WEKPLELMTP IERADASTNW
KEYTSPDGRK YYYNKVTNLS TWSLPEELKL AREQVEMASA KGPLSDVSSH IPAPVPPASK
AQSGADTPST IIQGASSSPV PVAPVPSSSK IESVVVSGSD LPVATSSTVT NVDVVQIVED
TITPSVAVSE SSEVSLSVAD AATTLMNNIS KVSSLDMVSS EGVSTQNADE TVKDVVVSEK
INNALEEKAI DQESLTYASK QEAKDAFKAL LESANVGSDW TWDQAMRVII NDKRYGALRT
LGERKQAFNE FLGQKKKQDA EERRIKQKKA REEYKKMLEE CLELTSSTRW SKAVTMFEND
ERYKAVEREK DRKDFFENYI DELRQKERVK AQEQRKQNVM EYRRFLESCD FIKANSQWRK
VQDRLETDER CSRLDKIDRL EIFQEYIRDL EKEEEEQRRI QKEELRKTER KNRDEFRKLV
EGHVAAGTLT AKTHWRDYCM MVKDSPPFLA VASNTSGPTP KDLFEDVAEE LQKQYDDDKA
RVKDAVKLRK ICLASTWTLE DLKAAILEDI SSPPISDVNL KLIFEELLER VKEKEEKEAK
KRKRLADDFF DLLHSMKEIT SSSAWEDCKH LLESSQEFSS IGDEDICKGI FDEYVKQLKG
DAKEKERRRK EDKAKKEKER DERERRKEKH GRDKERGYER EKEEHLREEP SEAHGDISEV
HDENENKRSG KEDSKKHRKR HQSSVDNSNE TEKDRTKTHR HSSDRRKSKK HASTPESDNE
SRHKRHKRDH RNGSRRNLDP EELEDGEFGE RESQ
//