ID C5WVA8_SORBI Unreviewed; 2212 AA.
AC C5WVA8;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2016, sequence version 2.
DT 24-JAN-2024, entry version 55.
DE RecName: Full=Virilizer N-terminal domain-containing protein {ECO:0000259|Pfam:PF15912};
GN ORFNames=SORBI_3001G191700 {ECO:0000313|EMBL:EER93923.2};
OS Sorghum bicolor (Sorghum) (Sorghum vulgare).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade;
OC Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum.
OX NCBI_TaxID=4558 {ECO:0000313|EMBL:EER93923.2, ECO:0000313|Proteomes:UP000000768};
RN [1] {ECO:0000313|EMBL:EER93923.2, ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=19189423; DOI=10.1038/nature07723;
RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J.,
RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J.,
RA Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M.,
RA Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L.,
RA Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P.,
RA Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman,
RA Ware D., Westhoff P., Mayer K.F., Messing J., Rokhsar D.S.;
RT "The Sorghum bicolor genome and the diversification of grasses.";
RL Nature 457:551-556(2009).
RN [2] {ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=29161754; DOI=10.1111/tpj.13781;
RA McCormick R.F., Truong S.K., Sreedasyam A., Jenkins J., Shu S., Sims D.,
RA Kennedy M., Amirebrahimi M., Weers B.D., McKinley B., Mattison A.,
RA Morishige D.T., Grimwood J., Schmutz J., Mullet J.E.;
RT "The Sorghum bicolor reference genome: improved assembly, gene annotations,
RT a transcriptome atlas, and signatures of genome organization.";
RL Plant J. 93:338-354(2018).
CC -!- SIMILARITY: Belongs to the vir family. {ECO:0000256|ARBA:ARBA00008371}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000760; EER93923.2; -; Genomic_DNA.
DR STRING; 4558.C5WVA8; -.
DR EnsemblPlants; EER93923; EER93923; SORBI_3001G191700.
DR Gramene; EER93923; EER93923; SORBI_3001G191700.
DR eggNOG; KOG4822; Eukaryota.
DR HOGENOM; CLU_006767_9_0_1; -.
DR InParanoid; C5WVA8; -.
DR OMA; WIGFAID; -.
DR OrthoDB; 549513at2759; -.
DR Proteomes; UP000000768; Chromosome 1.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0036396; C:RNA N6-methyladenosine methyltransferase complex; IBA:GO_Central.
DR GO; GO:0080009; P:mRNA methylation; IBA:GO_Central.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR031801; VIR_N.
DR InterPro; IPR026736; Virilizer.
DR PANTHER; PTHR23185:SF0; PROTEIN VIRILIZER HOMOLOG; 1.
DR PANTHER; PTHR23185; UNCHARACTERIZED; 1.
DR Pfam; PF15912; VIR_N; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000000768}.
FT DOMAIN 8..121
FT /note="Virilizer N-terminal"
FT /evidence="ECO:0000259|Pfam:PF15912"
FT REGION 1547..1649
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1662..1917
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2053..2088
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2134..2165
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1571..1595
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1626..1644
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1673..1689
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1744..1766
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1773..1802
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1827..1843
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1876..1917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2067..2084
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2212 AA; 243535 MW; C9BBE2CEFC51CA8E CRC64;
MGRPEPVVLF AQTILHSQLD EYVDEVLFSE PVVITACEFL EQNASPSTPN ISLIGATSPP
SFALEVFVHC DGESRFRRLC HPFLYSHSSS NVLEVEAIVT NHLVLRGTYR SLTLVIYGNT
AEDLGQFNIE LGLDHSVANV VSSPSEGKLE DLPPALLSSK LSFEESLSSL KPLSFHATDV
DLSIEAKKVL HLALKMYQMS DVENLIPNLR SAVLSAISKY VTASTNHILH TSSQDSANSF
TKSDFDSQEI NNILAEAGNE LSEIWKNVHA VTDSNLFNDN GFTIGGDEDL PTTKILIELF
NQCFPYYKNF SLLDLQCPSQ NKWLVLSLSL VLLLCSSKES CFYFVDTGGM EQIINLLCWK
TPKSAATTLL LLGIVEHATR NGFGCEAFLG WWPQTEHSSI RVASSNGYCS LLKLLLEKER
HDIASLATYV LQRLRFYEIL SKYESVVVKV ISNLQADKVS TDGVPFLISA SVELAEMLKL
IICCGPIEDP SPVATARRLF KSEHLEGLLS YKATIDLISS SKYSFLQYDT DPYLLSLIQE
RSFFPLSAAL LSSPILHSAS GPAAEILMGI ASSIESLILS LLFCRSGLSF LLSQPEATEL
IVLSLQDAEN MNKAECITLR QAFVLLSKGF FCRPKEVGMI TELHLKVGSA ANRILSVPPN
SDELLWVLWE LCAISRSDSG RQALLALGYF PEAISVLLRS LSSYKDLDSV MAKNGGSPLG
LAIFHSAAEI LEVLVADSTA SSLRSWIGFA VDLHKALHSS SPGSNRKDAP TRLLEWIDAG
VVYQRNGARG LLRYSAILAS GGDAHLSSGN VLVSDSMDVE NVVADSNSSS DGLVIDNLLG
KLVADKYFDG VALCSTSVVQ LTTAFRILAF ISDDKAVASS LFEEGAITVI YIVLMNCKSM
LERISNSYDY LVDEGAELSS TTELLLDRTH EQAIVDLMIP SLVLLINLLH ILRETKEQYR
NKKLLSSLLQ LHREVSPRLA ACAADLSFMF PTFAIGFGVV CHLITSALAC WPLYNWAPGL
FHCLLENIEA TNASVPLGPK AAISLLCLLG DLFPDEGIWL WKVELPSLSA IRSLSTGTVL
GPQVEKDVNW YLHPEHVAIL LVRLMPQLDR LARIIDNFAT SALMVIQDML RVFIVRVASE
KIECAVVLLR PIFIWLDDKV DKTSLSEREI FKVHQLLQFT VKLSEHPTGK VLLWRMEFTR
ILRKLLQNCS RSSFSDDNQT FGRAPSKNDL MLKWRIPLFK SIACVFSIDT SNNEKAVIEE
SLNEKSVHEC SSVMQHLVMF CQVLPVGREM LACSLAFKEL AASYTCRSAV TLILSQIHTS
NKDVLEKDES DPNHNLPTLD GWNCFSSLFN CWKKLAKYIG SNQPTDYLVE TIYSLTLGAI
TLSQYGENLE GLLILRYLFG LPSDPSGSLE SSGESPSEIE LFMKTSEEKI CQSFENSTTV
DGKTLLHKLL NSITLLRSIL ENSGQSADSV QMVIQEGTDS LSEIAHSVVM TADLMPSLAN
VSVKDESPFL FSNVWKVIVD SEEPLDCQEG EFAKRLVWEL PDSSLDRQLT PGQSARRKLA
LGESASRRVR DNQLPEPTGQ FSRGLNTTNA SSGHTRRDTF RQRKPNTSRP PSMHVDDYVA
RERNIDGASS ASNIVNSTPR GTLSGRPPSI HVDEFMARQR ERQNPVPAPT GDAPQPKSQT
ASLDGSLRTK PENLRQPKTD LDDDQEIEIV FDEESGSDDK LPFPQPDDSL QSPPVIIGEN
SPGPVIETEN QENERIPFSQ RATSLPKDDE SPGVDISSQT AMLSEPNNSL ELKYSVSSPG
KNSFRDHAEK SNYPSIGVSG RSSVQADHQH LSRRHEKRSP RKYSETSLSS GSHGHEHRHS
NNHPPLPPMP PPISSVPMQN TDSANRQSSS FSARDRPTPS LSGYPTQSFD SSMPSAFTGL
QGQTQYMLAG AGGSSANDLP NAEAKLLWNT FPVNRIPLET FSSGLSARPM PPLTPYSAVA
TQHAPMSSSS PATLYNQGSV VQPSPTASII SDSNLAMNSN LLPSFASQFL MGRPSMPTPF
FGTPLQQVQF SSGLPQNISN SQPSVSSVQP RPPPPPPPPQ QPHPSQTLQQ LGAIQLPHQD
QQLPYPQSAI LPQVPLQFPN QLPIPQLQLY HQSQQESGQT LRQVGEQSQL QNQGMQADSF
SQQQQDSGIN LNQFFSSPEA IQSLLSDREK LCQLLEQNPK LMQMLQDRIG QL
//