ID A0A2T9YR66_9FUNG Unreviewed; 882 AA.
AC A0A2T9YR66;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:PVU94822.1};
GN ORFNames=BB561_002246 {ECO:0000313|EMBL:PVU94822.1};
OS Smittium simulii.
OC Eukaryota; Fungi; Fungi incertae sedis; Zoopagomycota; Kickxellomycotina;
OC Harpellomycetes; Harpellales; Legeriomycetaceae; Smittium.
OX NCBI_TaxID=133385 {ECO:0000313|EMBL:PVU94822.1, ECO:0000313|Proteomes:UP000245383};
RN [1] {ECO:0000313|EMBL:PVU94822.1, ECO:0000313|Proteomes:UP000245383}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SWE-8-4 {ECO:0000313|EMBL:PVU94822.1,
RC ECO:0000313|Proteomes:UP000245383};
RX PubMed=29764946;
RA Wang Y., Stata M., Wang W., Stajich J.E., White M.M., Moncalvo J.M.;
RT "Comparative Genomics Reveals the Core Gene Toolbox for the Fungus-Insect
RT Symbiosis.";
RL MBio 9:e00636-e00618(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PVU94822.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MBFR01000073; PVU94822.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2T9YR66; -.
DR STRING; 133385.A0A2T9YR66; -.
DR Proteomes; UP000245383; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR CDD; cd00167; SANT; 3.
DR Gene3D; 1.10.10.60; Homeodomain-like; 3.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR PANTHER; PTHR46621; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR PANTHER; PTHR46621:SF1; SNRNA-ACTIVATING PROTEIN COMPLEX SUBUNIT 4; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 3.
DR PROSITE; PS50090; MYB_LIKE; 3.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245383}.
FT DOMAIN 159..208
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 164..212
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 213..267
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 213..263
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 264..314
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 269..318
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 312..340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 546..573
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 313..327
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 882 AA; 100739 MW; 9A2611071B1DAF20 CRC64;
MVVNSSINTN SAPNQFPSSA SSSLLPKDYE SFHNNGSQGI FSSATYLNPP FSYPDIQPIT
ELEKISNSLI KADIIAKKVS RGDKIKNYRT AKYRSASKNS SPPAQLQTDK SVIELAENAK
ENARQLILSL GAESKVLPDY KQYIKDVEDS LKIIDKNMTF PKIRAMWNSS EDHLLTLGVS
LYGANTESWP KIAVLVPGRT NKACRKRWFH SLDPTLHKGS WSVEEDNLLR LWVSKHPGQW
SKIAKRIQGR TDDQCAKRWR ESLDPNISRA KWSPEEDARL LEKYNEYGAQ WQKIAVFFQG
RPGLHCRNRW RKIQRRSNQD PLKSDPGLDK PDSNNFQPEQ FFSKPINTSL EQNHIQPSTY
NPPLIPSNLS EQNLMNHLDQ KNKKIFRDSP VSTLSYPSFK IHELDTINNL SPNLSNHNHN
NSFTKNTFIQ NNTFNLNKNK DKRSFSNPFS ADPNIGSFSI QLQNTNINDN NFDSNANTNF
KRLKQNSPPD LFLSQVNIQQ LATPKFNLID SSNPEIHPDC MTLFGSNQNS LNYNYNDLNI
QRDTLHASNI SKKQPRQKLS KKNKKSDYNT PTPHQKEWLY SSAIKPYGCA ALPGICDSSF
YDSLELLEHL KAAHNIFNAD LDTETDHEAM NNSKWNHIFR CGISGCSSLY KNVRSLENHI
YNSNKSLHYQ NYIKELNSNQ NLNNPIIFNN CENHMNLLED VNSIVNNTLD SSNCNLGPTD
CLLQTNNFSN VPPEHFNQNA NCMRSLNVNS ENQIFSNYYL KGHDFIKNNS DQKSELPNAR
PFTKSFFDNS IDSYTELNSP IINSKVNSSP TKINYNTNTN LFSNASSNSS MINLNSRFFD
NHDINKTPQF DFSSYETKFS HVHQDDTKLI QTACGMFNNQ NN
//