ID A0A2K1XBJ6_POPTR Unreviewed; 1939 AA.
AC A0A2K1XBJ6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE RecName: Full=S1 motif domain-containing protein {ECO:0000259|PROSITE:PS50126};
GN ORFNames=POPTR_016G062600 {ECO:0000313|EMBL:PNS98149.1};
OS Populus trichocarpa (Western balsam poplar) (Populus balsamifera subsp.
OS trichocarpa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus.
OX NCBI_TaxID=3694 {ECO:0000313|EMBL:PNS98149.1, ECO:0000313|Proteomes:UP000006729};
RN [1] {ECO:0000313|EMBL:PNS98149.1, ECO:0000313|Proteomes:UP000006729}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nisqually {ECO:0000313|Proteomes:UP000006729};
RX PubMed=16973872; DOI=10.1126/science.1128691;
RA Tuskan G.A., Difazio S., Jansson S., Bohlmann J., Grigoriev I.,
RA Hellsten U., Putnam N., Ralph S., Rombauts S., Salamov A., Schein J.,
RA Sterck L., Aerts A., Bhalerao R.R., Bhalerao R.P., Blaudez D., Boerjan W.,
RA Brun A., Brunner A., Busov V., Campbell M., Carlson J., Chalot M.,
RA Chapman J., Chen G.L., Cooper D., Coutinho P.M., Couturier J., Covert S.,
RA Cronk Q., Cunningham R., Davis J., Degroeve S., Dejardin A.,
RA Depamphilis C., Detter J., Dirks B., Dubchak I., Duplessis S., Ehlting J.,
RA Ellis B., Gendler K., Goodstein D., Gribskov M., Grimwood J., Groover A.,
RA Gunter L., Hamberger B., Heinze B., Helariutta Y., Henrissat B.,
RA Holligan D., Holt R., Huang W., Islam-Faridi N., Jones S.,
RA Jones-Rhoades M., Jorgensen R., Joshi C., Kangasjarvi J., Karlsson J.,
RA Kelleher C., Kirkpatrick R., Kirst M., Kohler A., Kalluri U., Larimer F.,
RA Leebens-Mack J., Leple J.C., Locascio P., Lou Y., Lucas S., Martin F.,
RA Montanini B., Napoli C., Nelson D.R., Nelson C., Nieminen K., Nilsson O.,
RA Pereda V., Peter G., Philippe R., Pilate G., Poliakov A., Razumovskaya J.,
RA Richardson P., Rinaldi C., Ritland K., Rouze P., Ryaboy D., Schmutz J.,
RA Schrader J., Segerman B., Shin H., Siddiqui A., Sterky F., Terry A.,
RA Tsai C.J., Uberbacher E., Unneberg P., Vahala J., Wall K., Wessler S.,
RA Yang G., Yin T., Douglas C., Marra M., Sandberg G., Van de Peer Y.,
RA Rokhsar D.;
RT "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray).";
RL Science 313:1596-1604(2006).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM009305; PNS98149.1; -; Genomic_DNA.
DR STRING; 3694.A0A2K1XBJ6; -.
DR InParanoid; A0A2K1XBJ6; -.
DR OMA; GQYLRAY; -.
DR OrthoDB; 167902at2759; -.
DR Proteomes; UP000006729; Chromosome 16.
DR ExpressionAtlas; A0A2K1XBJ6; baseline and differential.
DR GO; GO:0005730; C:nucleolus; IBA:GO_Central.
DR GO; GO:0032040; C:small-subunit processome; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IBA:GO_Central.
DR GO; GO:0006364; P:rRNA processing; IEA:UniProtKB-KW.
DR CDD; cd05703; S1_Rrp5_repeat_hs12_sc9; 1.
DR CDD; cd05693; S1_Rrp5_repeat_hs1_sc1; 1.
DR CDD; cd05694; S1_Rrp5_repeat_hs2_sc2; 1.
DR CDD; cd05695; S1_Rrp5_repeat_hs3; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 11.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR003107; HAT.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR045209; Rrp5.
DR InterPro; IPR048059; Rrp5_S1_rpt_hs1_sc1.
DR InterPro; IPR003029; S1_domain.
DR InterPro; IPR008847; Suf.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23270; PROGRAMMED CELL DEATH PROTEIN 11 PRE-RRNA PROCESSING PROTEIN RRP5; 1.
DR PANTHER; PTHR23270:SF10; PROTEIN RRP5 HOMOLOG; 1.
DR Pfam; PF00575; S1; 5.
DR Pfam; PF05843; Suf; 1.
DR SMART; SM00386; HAT; 6.
DR SMART; SM00316; S1; 15.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 12.
DR SUPFAM; SSF48452; TPR-like; 2.
DR PROSITE; PS50126; S1; 14.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000006729};
KW rRNA processing {ECO:0000256|ARBA:ARBA00022552}.
FT DOMAIN 141..223
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 239..305
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 328..398
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 414..474
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 505..572
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 592..661
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 676..748
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 768..837
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 881..945
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1071..1146
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1170..1241
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1276..1350
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1384..1453
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT DOMAIN 1474..1543
FT /note="S1 motif"
FT /evidence="ECO:0000259|PROSITE:PS50126"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 55..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1543..1572
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1644..1673
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..28
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..93
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1543..1561
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1939 AA; 216876 MW; B0001B10D861F22A CRC64;
MTAPSKKKQQ NPKQNDGPKF NKASQKQFKA KQTKNKKSFS NDAVVKDASI ALQLEDDVPD
FPRGGKSSLS QREREEIRAQ VDEEFEGEER RLNKKNKKGK KFQNKSSQLS GDDLGSLFGD
VLTGKLPRFA NKITMKNISP GMKLWGVVTE VNEKDLVISL PGGLRGLVRS VDAVDPVLTD
QIEDGEGSLP RVFHVGQLVS CIVLKLDDDK NDNKKRKIWL SLRLSLLHNG FSLDAVKEGM
VLTAYVKSIE DHGFILHFGL SSFMGFLPKN SQAESRDSEV KTGQFLQGIV TKIDKTRKVV
YLSSDPDTVS KCVTKDLKGI SIDLLIPGMM VDARVQSTLE NGIMLSFLTY FTGTVDMFHL
QNTFPTSNWK DDYAKNKKVS ARILFIDPST RAVGLTLNQH LVHNNSPPSS VKVGDIYDIA
KVVRVDKGMG LLLEIPSTPL PTPAFVNVSD VAEDEVRKLE KKFKEGSNVR VRILGYRHLE
GLATGILKAS AFEGSVFTHS DVKPGMATRA KIIAVDSFGA IVQFPGGVKA LCPLRHMSEF
EIVKPRKKFK VGAELFFRVL GCKSKRITVT HKKTLVKSKL PILSSYSDAT DGLITHGWIT
KIEKPGCFVH FYNGVQGFAP RSELGLEPGS DAISTYQVGQ VVKCRVISSI AASRRINLSF
IMKPLRFSEE DGIKMGSVVT GVIDKVTASS VIVYVNAKDY LKGTIATEHL SDHHEHAALM
KSVLKPGYEF DQLLVLDIES NNLALSAKYS LIKSASQLPS DLSQIRPQSI VHGYICNMIE
TGCFVRFLGN LTAFSPRSKA MDDQRSQLSE AFYIGQSVRS NILDVNNETS RITVSLKQSC
CSSTDACFLQ EYFLSENKIA DLQSSDSKGR DLKWVEGFHI GSTIEGKIQE SKEFGVVVSF
EKHNDVFGFV SHHQLGGAMV KAGANVRAAV LDVAKTERLV DLSLKLEFLD KSRDKSSNSL
THKKKRKGEM SKDLEVHQTV NAVVEIVKEN YLVLSIPEHN YAIGYASVSD YNTQKISQKQ
FLNGQSVSAT VMALPTPSTA GRLLLLLKSI SEVTETSSSK KAKRKSSCNV GSLVQAEITE
IKPLEMRLKF GIGFRGRIHI TEVNDTCLLE NPFSNFRVGQ TVSARIIAKA GQSDNKKSQL
WDLSIKPKML EDSCMIEDKL VPKEYEFSSG QHVSGYVYKV DGEWAWLTIS RHLKAKLFVL
DSACEPSELQ EFQKRFYVGK AVTGHVLNYN KEKASLRLAL HPFAASQTLV DGGAPIMDDL
QGNAPWDNVT AHIREGDIVG GRISKILPGV GGLLVQLGPH IHGRVHFTEL QDSWVPDPLS
AYKEGQFVKS KVLEISHPVK GTIHIDLSLR LSLNGMLGQN SAEFSNNQDA PSKHVDKIED
LQPDMVVQGY VKNVSSKGCF ISLSRKLDAK ILLSNLSEGY IDDPEKEFPI GKLLTGRVLS
VEHLSKRIEV TLKKSGVSNA SKSENSDLSR LHVGEIISGR IKRVESYGLF IALDHTNLVG
LCHVSQLLDH IGNIESKYKA GEKVTAKILK VDEERRRISL GMKNLDVRDD MNSSKEESDE
EKSENESMDD SNAQIKIIPE SSLLGIHNID VECQNERSIL AQAESRASIP PLEVALDDTE
HSHPDDVLLQ NQGHIDEADT MVKKNKQEKK KPKKLSEQEI SAAEERRLEE DEPRTADEFE
MVIRSSPNNS FLWIAYMRFM LSLADIEKAR SIAERALNTI NIREEDEKLN IWVAYFNLEN
EYGNPPEDAV KKVFQRALQY CDPKKVHLAL LKMYKKTNQN KLAEELLDKM IKKFKHSCKF
WLKRVKWLLK QKQDGVQSVV QRALLCLPRH KHIKFISQTA IREFKCGVAD RGRTLFEEIL
REYPKRTDLW SVYLDQEIKL GDVDVIRSLF ERAISLSLPP KKMKFLFKKY LEYEKSYGDE
KQIESVKQKA MEYVQNTLA
//