ID A0A1U8KZB3_GOSHI Unreviewed; 886 AA.
AC A0A1U8KZB3;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Pre-mRNA-processing protein 40C-like isoform X1 {ECO:0000313|RefSeq:XP_016707727.1};
GN Name=LOC107922278 {ECO:0000313|RefSeq:XP_016707727.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016707727.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016707727.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016707727.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016707727.1; XM_016852238.1.
DR AlphaFoldDB; A0A1U8KZB3; -.
DR STRING; 3635.A0A1U8KZB3; -.
DR PaxDb; 3635-A0A1U8KZB3; -.
DR GeneID; 107922278; -.
DR KEGG; ghi:107922278; -.
DR OrthoDB; 2882240at2759; -.
DR Proteomes; UP000189702; Chromosome 25.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0070063; F:RNA polymerase binding; IBA:GO_Central.
DR GO; GO:0003712; F:transcription coregulator activity; IBA:GO_Central.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 1.10.10.440; FF domain; 5.
DR InterPro; IPR002713; FF_domain.
DR InterPro; IPR036517; FF_domain_sf.
DR InterPro; IPR045148; TCRG1-like.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR15377; TRANSCRIPTION ELONGATION REGULATOR 1; 1.
DR PANTHER; PTHR15377:SF3; WW DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01846; FF; 4.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00441; FF; 4.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF81698; FF domain; 5.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS51676; FF; 4.
DR PROSITE; PS01159; WW_DOMAIN_1; 2.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT DOMAIN 266..293
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 318..345
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 482..536
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 549..604
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 617..671
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT DOMAIN 720..777
FT /note="FF"
FT /evidence="ECO:0000259|PROSITE:PS51676"
FT REGION 1..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 442..481
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 680..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 742..764
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 851..886
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..462
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 743..764
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..867
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 886 AA; 97873 MW; 42BAC4939EA2FFE1 CRC64;
MQPPLPVPQG ALSSSASFSF TPNPQLVQNA QIQPSKSDML ATGTQAMAAS SPSTVSQSGP
LPVHNSSEFT MNASTTPSFA PVTSRMPTTP PFPMSSGSSG TSGTLGHPVS VPSIQMITAS
AAVDSPSSAV PGPGAPVSLN PAVQQQVYPP YTSLPSMVSS PQGYWMQHPP MGGFPRPPFV
PYPTVYPGPF PSTSSGMPLP APSSDSQPPG FRPLGMSPFA PSAAALANQS LAILTGFPPQ
GIDNRKLVHD VTTKVESAGN EQSDVWTAHK TDTGVVYYYN ALTGESTYEK PAGFKGEPDQ
VTVQPTPVSV EQLAGTDWAL VTTNDGKKYY YNSKTKISSW QIPNEVTELR KKQDSEVSKE
NAVSVPNIDV VAEKGSTPIS LSAPAVNTGG RDAMPLRTSV VPGSSSALDL IKKKLQDPGV
PSSSPVPVMP VTATHELNGL RAVDVKGLQS ESNKDKLKDA NGDGSISDSS SDSEDADSGP
SKEECIMQFK EMLKERGVAP FSKWEKELPK IVFDPRFKAI PSHSARRSLF EHYVKTRAEE
ERKEKRAAQK AAIEGFKQLL DEASEDIGHD TNYQTFKRKW GSDPRFEALD RKDRELLLNE
RVLLLKRAAE EKARAIRAAA ASSFKSMLKE KGDINVNSRW SRVKDSLRDD PRYKCVKHED
REVLFNEYIS ELKAIEEKAE RKDKVKKEEE EKLKEREREL RKRKEREEQE MERVRLKVRR
KEAVASFQAL LVETIKDSQA SWTESKPKLE KDPQGRAANP DLDSSDMEKL FREHIKMLFE
RCVNDFRALL AKVITQDAAA QETEGGKTAL NSWSTAKRLL KPDPRYNKMP RKEREALWRR
YAEDMLRKQK LALDQEEEKH TDVKGRSSGG DFGRYSSGTR RTHERR
//