ID A0A0B2PM57_GLYSO Unreviewed; 1033 AA.
AC A0A0B2PM57;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=DUF3741 domain-containing protein {ECO:0000259|Pfam:PF14383};
GN ORFNames=D0Y65_013300 {ECO:0000313|EMBL:RZC14212.1}, glysoja_040671
GN {ECO:0000313|EMBL:KHN10516.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:KHN10516.1};
RN [1] {ECO:0000313|EMBL:KHN10516.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Root {ECO:0000313|EMBL:KHN10516.1};
RA Lam H.-M., Qi X., Li M.-W., Liu X., Xie M., Ni M., Xu X.;
RT "Identification of a novel salt tolerance gene in wild soybean by whole-
RT genome sequencing.";
RL Submitted (JUL-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:RZC14212.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZC14212.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN664312; KHN10516.1; -; Genomic_DNA.
DR EMBL; QZWG01000005; RZC14212.1; -; Genomic_DNA.
DR OrthoDB; 473317at2759; -.
DR Proteomes; UP000289340; Chromosome 5.
DR InterPro; IPR032795; DUF3741-assoc.
DR PANTHER; PTHR34282:SF1; EXPRESSED PROTEIN; 1.
DR PANTHER; PTHR34282; OS01G0228800 PROTEIN-RELATED; 1.
DR Pfam; PF14383; VARLMGL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000289340}.
FT DOMAIN 322..339
FT /note="DUF3741"
FT /evidence="ECO:0000259|Pfam:PF14383"
FT REGION 269..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 343..378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 432..508
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 574..594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 637..661
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 269..320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 451..479
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 482..508
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 576..594
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1033 AA; 116616 MW; 9AC9317289F7F058 CRC64;
MAKRSDFAQK LLDDLRLRKE RMASSQTQRS NQSHHLPIDA YAYTKQTYRG SRHTKATEIV
SSKTGEMLNS SSRSYRSVNN GQVSNQMVTY GKGQSSRQMG DMSLALAFAF ENGGKLRRND
SIMGFLHQIK RGTLEFSMSE RQLASTSNYP MQISEISKGA QKLNQILRAC SNGLNMDSYS
IQFAKELLQG AIDLEESLRM LVDLQNNSQF MITSQKKNRI TLLEEDNDDD NDTGMEMQLA
QPTFSFNKHT AENIQQFGKA IFMQRPITLT SSKEGRNSNN ENKNVKRQVS QKRSTKSSSD
IKNVNAISEG KNQTASNPEK GRIPNVIAKL MGLDILPDKV EKESKRAMLQ KREGTSPKHA
AKGSTKKTEL KSKETDNLMP MKNQKVIEAF KVPATQGKEM IFGANKKLLV EKTSSEVAVR
NGIIALKGFD KPSIKADKPT KSSPQKNLTR ESQKDVQEIG RKQNHPNNNN REQKGTRKGR
ANDPIPNNQP EQVCERSQVN SLTQEDKEID GNTVQCEKRH TNTHVMNNEK KPWNNVGVQK
SYVLSKNGPH EEKHRREQKL QLKEEHMLMM RPQGGSEMAS KNSPKSPHQL INPQKKQLSM
NQVTLFKKSS GEKNVASMKS EGLLTNHHDL VRDEASNATN ENVKESIHRK SGQISSPRDQ
EFELAKRNGI KTLMDEKHVN KLASKKIKNT RKQKVGMPGK IDQVLTGRNG AKLITKQGKQ
QIPTPDKFEV LNEAERERVS MLRETDAHII NSNEPVSVAV TEPLDMRHQP CKEAELPPTL
SSSVGGELQS QQELVAIVPN DLYCQDVQSL QDEAVPVAAD EGSVTGEVAL HKTNGLDEER
LCVNNSNLNI SEKSIQQPLT ESENCLKWIL VMSQLFVNTA EALFKLNIPF NVLQGGGREN
QDEGSKLILD CGYEVMKRKG IRQELKVHSY SRISMGSMNI ISLDDLVRQL NEDMEKLKLY
GRKKSCQADD VEDYQSKMLE HDVYDRDPDM NCMWDLGWND ETVAFIEKYD VIRDTEKHIL
SVLLDEITVD FCM
//