GenomeNet

Database: UniProt
Entry: W9SIY4_9ROSA
LinkDB: W9SIY4_9ROSA
Original site: W9SIY4_9ROSA 
ID   W9SIY4_9ROSA            Unreviewed;       961 AA.
AC   W9SIY4;
DT   14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT   14-MAY-2014, sequence version 1.
DT   24-JAN-2024, entry version 39.
DE   SubName: Full=Protein NLP8 {ECO:0000313|EMBL:EXC33984.1};
GN   ORFNames=L484_007540 {ECO:0000313|EMBL:EXC33984.1};
OS   Morus notabilis.
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Rosales; Moraceae; Moreae; Morus.
OX   NCBI_TaxID=981085 {ECO:0000313|EMBL:EXC33984.1, ECO:0000313|Proteomes:UP000030645};
RN   [1] {ECO:0000313|Proteomes:UP000030645}
RP   NUCLEOTIDE SEQUENCE.
RA   He N., Zhao S.;
RT   "Draft Genome Sequence of a Mulberry Tree, Morus notabilis C.K. Schneid.";
RL   Submitted (JAN-2013) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KE346346; EXC33984.1; -; Genomic_DNA.
DR   RefSeq; XP_010112531.1; XM_010114229.1.
DR   AlphaFoldDB; W9SIY4; -.
DR   STRING; 981085.W9SIY4; -.
DR   eggNOG; ENOG502QQ6H; Eukaryota.
DR   Proteomes; UP000030645; Unassembled WGS sequence.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   InterPro; IPR045012; NLP.
DR   InterPro; IPR000270; PB1_dom.
DR   InterPro; IPR003035; RWP-RK_dom.
DR   PANTHER; PTHR32002; PROTEIN NLP8; 1.
DR   PANTHER; PTHR32002:SF41; PROTEIN NLP8; 1.
DR   Pfam; PF00564; PB1; 1.
DR   Pfam; PF02042; RWP-RK; 1.
DR   SMART; SM00666; PB1; 1.
DR   SUPFAM; SSF54277; CAD & PB1 domains; 1.
DR   PROSITE; PS51745; PB1; 1.
DR   PROSITE; PS51519; RWP_RK; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000030645}.
FT   DOMAIN          582..672
FT                   /note="RWP-RK"
FT                   /evidence="ECO:0000259|PROSITE:PS51519"
FT   DOMAIN          861..943
FT                   /note="PB1"
FT                   /evidence="ECO:0000259|PROSITE:PS51745"
FT   REGION          562..601
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          815..855
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        563..578
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   961 AA;  105464 MW;  7EF62CC3BB644E66 CRC64;
     MEHPFSSKEK EKESEYWPLS RAQVENFPSF DGGARSVVQE DVFTNFSDLL NFDSYAGWCN
     SPAVTDQASA TYGLSSLPSV AYAALDAPNF IEQSVGALPG TEVGGNLGRS SFNFGDKIVF
     QPADTQFEVS AHSNAANDSV AKQTNASVQG NSQIDAVNTY RPTRCSLDEK MLRALSVVKE
     SSGGGILAQV WVPVKRGDQL FLSTSEQPYL LDHMLAGYRE VSRMYTFGAE GNSGRVLGLP
     GRVFVSKVPE WTSNVCYYQK NEYLRSEHAF SHQVRGSMAL PVFEPDPTMP CCAVLELVTT
     KEKSNFDKEM EIVCNALQAV NLRTNAHPRL VPQCLSNDQK DALAEIIDVL RAVCHAHRLP
     LALTWIPCCY TEGADGEYVR VRVREGKLSA NEKCILCIEE TACYVNDRVM QGFAHSCMEH
     HLEEGQGLAG KALQSNLPFF LPDVKTYDIN EFPLVHHARK FGLNAAVAIR LRSTYTGDCD
     YILEFFLPVN MKGASEQQLL LNNLSGTMQR ICKNLRTVSD TEIVGAGSND AFQKDVVSNL
     PSLSRESSQM VLSDSDLNSV DELPSKVSKR RNKGFEGDGV REQGMSGSRR QTEKKRSTSE
     KNVSLSVLQQ YFSGSLKDAA KSIGVCPTTL KRICRQHGIS RWPSRKINKG VEGGLKFDPT
     TGGLVAAGSI AQEFDTRKGL FFTEKTQSLQ SSDPISAIKS EEDDCTGGAM VNPNSVEIRM
     SNIDTQTNSA QESKVIAVDA GSERASYDTM SGPFLEKASF GFYHAKEVRT LNQRKINSKF
     ENSDCHHVFR DSVCLDAGDE MDTVGDGANE LIEHNQPASS SMTDSSNGSG SMLHGSSSSS
     QSFENPKHPK GKTSCVDSSS KIVVKATYKE DTVRFKFDAS AGCLQLYEEV AKRFKLQTGT
     FQLKYLDDEE EWVMLVSDMD LQECLEILDD VGTRSVKFQV RDMPCAVGSS GSSNCFLAGG
     S
//
DBGET integrated database retrieval system