ID A0A2P5A462_PARAD Unreviewed; 927 AA.
AC A0A2P5A462;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=NIN-like transcription factor {ECO:0000313|EMBL:PON31336.1};
GN Name=PanNLP1 {ECO:0000313|EMBL:PON31336.1};
GN ORFNames=PanWU01x14_370700 {ECO:0000313|EMBL:PON31336.1};
OS Parasponia andersonii (Sponia andersonii).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Rosales; Cannabaceae; Parasponia.
OX NCBI_TaxID=3476 {ECO:0000313|EMBL:PON31336.1, ECO:0000313|Proteomes:UP000237105};
RN [1] {ECO:0000313|Proteomes:UP000237105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. WU1-14 {ECO:0000313|Proteomes:UP000237105};
RA Van Velzen R., Holmer R., Bu F., Rutten L., Van Zeijl A., Liu W.,
RA Santuari L., Cao Q., Sharma T., Shen D., Roswanjaya Y., Wardhani T.,
RA Kalhor M.S., Jansen J., Van den Hoogen J., Gungor B., Hartog M.,
RA Hontelez J., Verver J., Yang W.-C., Schijlen E., Repin R., Schilthuizen M.,
RA Schranz E., Heidstra R., Miyata K., Fedorova E., Kohlen W., Bisseling T.,
RA Smit S., Geurts R.;
RT "Parallel loss of symbiosis genes in relatives of nitrogen-fixing non-
RT legume Parasponia.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PON31336.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXTB01001115; PON31336.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P5A462; -.
DR STRING; 3476.A0A2P5A462; -.
DR OrthoDB; 603551at2759; -.
DR Proteomes; UP000237105; Unassembled WGS sequence.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR CDD; cd06407; PB1_NLP; 1.
DR InterPro; IPR045012; NLP.
DR InterPro; IPR000270; PB1_dom.
DR InterPro; IPR034891; PB1_NLP.
DR InterPro; IPR003035; RWP-RK_dom.
DR PANTHER; PTHR32002:SF44; PROTEIN NLP5; 1.
DR PANTHER; PTHR32002; PROTEIN NLP8; 1.
DR Pfam; PF00564; PB1; 1.
DR Pfam; PF02042; RWP-RK; 1.
DR SMART; SM00666; PB1; 1.
DR SUPFAM; SSF54277; CAD & PB1 domains; 1.
DR PROSITE; PS51745; PB1; 1.
DR PROSITE; PS51519; RWP_RK; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000237105}.
FT DOMAIN 582..663
FT /note="RWP-RK"
FT /evidence="ECO:0000259|PROSITE:PS51519"
FT DOMAIN 830..913
FT /note="PB1"
FT /evidence="ECO:0000259|PROSITE:PS51745"
FT REGION 54..84
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 517..557
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 682..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 722..748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 530..552
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 682..697
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 927 AA; 103779 MW; A290E55C8E369D7E CRC64;
MEDGVLSPAT ILGAPADYPM DLDFMDELFL EGCWLETRDG SEFLNQNPPS SNPLFDPLFW
PTLEPDGESN ANPSPKSNQE ERHRSLFVES QGKSPLHTLP PTRATTDVVK YSGVSEAHIT
EGSSELSRRW WIGPKANPGP SSSVMERLWR ALMYIKDVIR DKDILVQIWV PVHKEGRRVL
TTRDLPFALD DSGSKLARYR DISVKYQFSA EEDSKDLVLG LPGRVFSGKV PEWTPDVRFF
RNDEYPRLIH AQQIDVRGTL ALPIFELDSR TCLGVVEIVM TTQKIKYRPE LESVCKALEA
VDLKSSEVLS TQNVYNEYYQ AAIPEIQQVL RSACDTHRLP LAQTWVPCIY QGKEGCRHSD
ENYGQCVSTV DHACYALDPQ VQSFHEACSE HHLFRGQGVV GLAFMTNQPC FSADITSYTK
TEYPLSHHAR MFRLQAAVAI RLRSIHASAA DFVLEFFLPV DCKDPDEQKK MLTSLSLIIQ
QCCQSLRVIT DKELEEESCS RVDEVVVPSN LRPARNTCFT EAPQNDTDLS LFPEEKKPRE
ISDGRLSKLS DNQRDSSLKP SVECVEECST VGEGSFSSVG VGKTGERRRA KAEKTITLQV
LRQYFAGSLK DAAKSIGVCS TTLKRICRQH GIKRWPSRKI KKVGHSLQKL QLVIDSVQGA
SGAFQIDSFY TNFPELASPN VSGTSPFSTS KLNDHPLPSN MQPGDGGIFS VQAATAAATS
KSSSSCSQSS SSSHCCSSRS QLHPQTWNNV TSSDDLIAGE NSGGGDDVVL KRVRSEAGLN
ACSEDDRKLL PRSQSHKSLM KHHKTDKWFP PSSAKNNNNG ARIPQQQGEF QRVKVTYGED
KTRFRMQNNW GFIDLQQEVG RRFGIQEMVK FTLKYLDDDS EWVLLTCDAD LEECFEVYRS
SQNATIKLSL QPSRHLYRGC LRGNDPL
//