ID I1MYQ8_SOYBN Unreviewed; 2735 AA.
AC I1MYQ8;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 2.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KRG97561.1, ECO:0000313|EnsemblPlants:KRG97561};
GN ORFNames=GLYMA_18G016000 {ECO:0000313|EMBL:KRG97561.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:KRG97561};
RN [1] {ECO:0000313|EMBL:KRG97561.1, ECO:0000313|EnsemblPlants:KRG97561}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRG97561};
RC TISSUE=Callus {ECO:0000313|EMBL:KRG97561.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRG97561}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRG97561};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRG97561.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRG97561.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000851; KRG97561.1; -; Genomic_DNA.
DR SMR; I1MYQ8; -.
DR STRING; 3847.I1MYQ8; -.
DR PaxDb; 3847-GLYMA18G01940-1; -.
DR EnsemblPlants; KRG97561; KRG97561; GLYMA_18G016000.
DR Gramene; KRG97561; KRG97561; GLYMA_18G016000.
DR eggNOG; KOG1823; Eukaryota.
DR HOGENOM; CLU_000327_0_0_1; -.
DR InParanoid; I1MYQ8; -.
DR OMA; EGLMAMF; -.
DR Proteomes; UP000008827; Chromosome 18.
DR GO; GO:0030686; C:90S preribosome; IBA:GO_Central.
DR GO; GO:0005730; C:nucleolus; IBA:GO_Central.
DR GO; GO:0032040; C:small-subunit processome; IBA:GO_Central.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 2.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT DOMAIN 937..1552
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 1790..2006
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
SQ SEQUENCE 2735 AA; 310334 MW; 88E60A485B1816A6 CRC64;
MLREWTKIKT RKKTNPYSLR FLWRHLCCNC NTNTSTEQNM ATASQARAVK SLNKSPGGRR
FVFKSFSDRV DEIDINVYRS LDKVKAEPSE GSSFFRDCLI EWRELNTAED FISLYEEIMP
YTQTLPLVLL HKESLISKLL SRLHIKARLS LEPILRLIAA LSRDLLEEFV PLLPRIVDSL
VSLLESGGDR EPDIIEQIFM SWSYIMMYLQ KYLVRNPSEV LKVTSKLRYY PKEYVQQFMA
EAMSFVLRNA PDEQLKRGIR RVIDDAVKKP SLCRESGVEA LVFNIMKGHS SRFHSKAERV
LQLLTSEAIY PIGDKADQDS MIILKIVKSV FKKLCEKMES KELDLVWNCI YKEVNECLNT
GNSRHLRHIL SVLVSAIKVQ NGQKVSDYKP MLELVLLLVQ TFIKPCGVID SQEDIYLVVD
KILKLMLAIL KGLCNCNTSM ISECAFKWAP IFESPPIFKS ASSSLLRFIR ELLQENLCLL
HFRRNVISAM NDLMEISEEE VIHLLRSFCE KMQLDKQNSD FVDGTSEEAP LTRICSRLQE
IICCWKGKIN DIAHADVLCQ IDEGVLALLW GAVSCYAHMC IVGANPSLMV ELVDAVDNFL
TVKSDCIGDM SKKAWESIIG AALSSFNRLY SNSNHGADET GKFLSLAKRY KSSPQVLFAV
AGYLEFKHGS LLEDAVYRIY HPELEEKTAD AVATFSDNLH HSDKEIRIST LKILCHYKPL
GWENSSVDQP VAKKRKTEVS PTLNVECTEN NALLLLLSIE TTPISISSSR SIQLFISKIQ
MELSAGRIPN VYVPLVLNGL FGILNNRFSY LWNPVLECIA VLISLHFLRV WDSLVAYLER
CQTIFDTPSN LHGSVNGALF DQPAGLVDCF KLFVYHASDS TPSVTILALL LQALQKIPTV
IEPRSRQFIP LFLKFLGYPD LVSVGLFDSH ACKGKEWKAI LKEWLNLLKL MKNPKSFYCG
QFLKDVLQHR LLEENDTEIQ MRVLDCLLIW KDDYILPYVE HLRNLISSKN LREELTTWSL
SRESEIIEEC HRAYLVPLVI RLLMPRVRKL KGLASRKKAS ICHRKSILSF IAGLDVVELP
LFFALLIKPL QIVKKTDGPA NLFWTSDKVS IDEFQADALL EYFTLDNIAN LSWKKKYGFL
HVIEDIIGVF DELHIRPFLD LLVGCVVRLL ESCTSSLHAN LNGLPSDQHN CSTSSNSLGE
DSVPTNQTQI NGTLNQLKDM RSLCLKIISL VLNKYEDHEF SSDLWDRFFS AVKPLVDKFK
QEAASSEKPS SLLSCFLAMS ANNKLVALLY RKESLVPDIF SIISVNSASE AVIYCVLKFV
ENLLSLDNEF NDEDNSAQRV LLSNIKVLMD SMCCLFGSDN AIKRKLIKSP GETVIRILEF
LPKYISEAEL AKQFVDILLL FLENKTQNSD VRVEALQVIQ NIIPILGHGS TAKILSAVSP
LYISAELDMR LRICDLLDAL VASDASLLSV AKLLRQLNAT STLGWLDHDA ILNAYGIINT
DFFRSVQVEH ALLILSHCVH DMSSEETTFM FSAYSSLLSF VDFSAHILCQ EGNSEEQLSV
MRNTDSCWTK SCIQRTAKKF LLKHMADAMD GSLSVIKGWI KLLHQMVLKL PEVSNLKSLM
VLCNEDGEVN FFDNITDSVI RKRVKALSWF RNVISVNKFS EFITEKVFMR LFFNMLYDEK
EGKAEHMKNA CIETIASVSG QMGWKSYYAL LIRCFWGASR SPDKQKLFIR LICSILDKFH
FSEVPHNKEP KESLGGVSDM DITDTDVNKE IQTCLYKVVL PKIQKLLNSD SEKVNVNISL
AALKLLKLLP GDVMDLYLPT IVHRISNFLK SHLESIRDEA RSALATCLKE LGLEYLQFIL
KVLQSTLRRG YELHVLGYTL NFILSKCLSS PVAGKIDYCL EDLLSVIEND ILGDVAEQKE
VEKIASKMKE TRRKKSFESL KLVAQNVTFK SYALKLLAPV TAHLKKHITP NVKGKLENML
QHIATGIESN PSVDQTDLFI FVYGIIEDGL NDEIGWHENK LLKLEGKDSR INAKRISTGH
VVANGLLCSH LITVFGLRIF HKRMKSMKQD VKDENTLSLL DPFVKLLCDG LCSKYEDILS
TSLGCLAILV KLPLPSLQQH AERVKAALLD IAHGSVNSIS PLMQSCLTLL TVLLRNTKIS
LTSDQISLLI HLPIFLDLEK NPSLVALSLL KGIVSRKMVV PEIYDLVTTV AELMVTSQME
PVRKKCSKIL LQFLLDYRLS EKRLQQHLDF LLSNLRYEHS TGRESVLEMI HAIIVKFPRS
VLDEQSHILF VHLVACLAND NDNIVRSMSG AAIKKLISSV SPNSLKSILE YALSWYLGGK
QQLWGAAAQV LGLLIEVKKK GFQEHINCIL PVTKHILHSA VDAVTNRQEG FSAESAIPLW
KEAYYSLVML EKMINQFRDL CFAKYLETFQ DIWEAISEML LHPHSWIRNR SVRLVALYFA
RATDVSRETN GSSLRSYFIM SPSRLFLIAT SLCCQLKMPF INDADSSLMT QNIVFAICGV
HSLMGQNACI DPPAFWSTLE QQEKDRFLKA FDLLDSRKGR SMFMSSSFSS IYEDNNQLNV
DNAQRALVSL LLRKMGKIAL QMDVIQMGIV FNSFGNIMAQ ISQDDCQHYA HVILLPLYKV
CEGFAGKVVT DNVKKLAEDT CKKLENILGT QNFVQVYNLI RKNLKLKRNK RRQEEKLMAV
INPMRNAKRK LRITAKNRAN KKRKITTIKM GRWMR
//