ID A0A372Q880_9GLOM Unreviewed; 507 AA.
AC A0A372Q880;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Homeodomain-like protein {ECO:0000313|EMBL:RGB23942.1};
GN ORFNames=C1646_773781 {ECO:0000313|EMBL:RGB23942.1};
OS Rhizophagus sp. MUCL 43196.
OC Eukaryota; Fungi; Fungi incertae sedis; Mucoromycota; Glomeromycotina;
OC Glomeromycetes; Glomerales; Glomeraceae; Rhizophagus.
OX NCBI_TaxID=1803374 {ECO:0000313|EMBL:RGB23942.1, ECO:0000313|Proteomes:UP000263633};
RN [1] {ECO:0000313|EMBL:RGB23942.1, ECO:0000313|Proteomes:UP000263633}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MUCL 43196 {ECO:0000313|EMBL:RGB23942.1,
RC ECO:0000313|Proteomes:UP000263633};
RA Morin E., San Clemente H., Chen E.C.H., De La Providencia I., Hainaut M.,
RA Kuo A., Kohler A., Murat C., Tang N., Roy S., Loubradou J., Henrissat B.,
RA Grigoriev I.V., Corradi N., Roux C., Martin F.M.;
RT "Comparative genomics reveals the genomic features of Rhizophagus
RT irregularis, R. cerebriforme, R. diaphanum and Gigaspora rosea, and their
RT symbiotic lifestyle signature.";
RL Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RGB23942.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QKKE01000901; RGB23942.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A372Q880; -.
DR STRING; 1803374.A0A372Q880; -.
DR InParanoid; A0A372Q880; -.
DR Proteomes; UP000263633; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd00167; SANT; 3.
DR Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR46621; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR PANTHER; PTHR46621:SF1; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000313|EMBL:RGB23942.1};
KW Homeobox {ECO:0000313|EMBL:RGB23942.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000263633}.
FT DOMAIN 238..286
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 238..282
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 283..344
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 288..344
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 345..399
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 345..395
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 348..399
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 415..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 471..493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 507 AA; 57499 MW; B5FC1177AC0FC855 CRC64;
MENLQKNLLI EEQFLLDHVL PAQPQSDDTH MFDDELLANY LSNNDNNKES IGNSGLQPFQ
QHQDLAEIGN SIIHIPYYFE NNNVYSRDNT NMLLNHYDQQ ENKNMLLQNN SGINFDQRTQ
TRFSVSTDNQ NGWPVALSSH ISPQISSVNE MYPHHNMGLS QMVLESPFPS PDLLTMSTLP
VSTPSYSLIS FGDPQTFNYQ FNPRRSQTFP IYETDYQSRI TSAGSNNTLS QKPPVYTKWT
EEEDELLRAA ISIYGPHKWS LIAAHVPNRT PMQCSTRWLG ALNPTIHKGR WTPEEDAALK
EAVSEYVDLL DSDGHPQPIP WNKIASRIPH RTGIQCQARW SEALDPSVRK GKWSPEEDEV
LKEGVRRYGR CWIRIAELIE GRTQRQCRTR WVQIKNKQAK IERDAIAAKV TTDATLSTDD
ESNDIMTPPR TAPPTPAQPN AQGQSLHLLR SMNHISAVPT IVPQVTTVKN QSPVMSPTGT
SVTEESCVTT PENSSPYFEK PLYAHSY
//