ID F7VRQ8_SORMK Unreviewed; 1323 AA.
AC F7VRQ8;
DT 21-SEP-2011, integrated into UniProtKB/TrEMBL.
DT 21-SEP-2011, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE SubName: Full=WGS project CABT00000000 data, contig 2.5 {ECO:0000313|EMBL:CCC08194.1};
GN ORFNames=SMAC_01743 {ECO:0000313|EMBL:CCC08194.1};
OS Sordaria macrospora (strain ATCC MYA-333 / DSM 997 / K(L3346) / K-hell).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Sordariaceae; Sordaria.
OX NCBI_TaxID=771870 {ECO:0000313|EMBL:CCC08194.1, ECO:0000313|Proteomes:UP000001881};
RN [1] {ECO:0000313|EMBL:CCC08194.1, ECO:0000313|Proteomes:UP000001881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-333 / DSM 997 / K(L3346) / K-hell
RC {ECO:0000313|Proteomes:UP000001881};
RC TISSUE=Mycelium {ECO:0000313|EMBL:CCC08194.1};
RX PubMed=20386741; DOI=10.1371/journal.pgen.1000891;
RA Nowrousian M., Stajich J., Chu M., Engh I., Espagne E., Halliday K.,
RA Kamerewerd J., Kempken F., Knab B., Kuo H.C., Osiewacz H.D., Poeggeler S.,
RA Read N., Seiler S., Smith K., Zickler D., Kueck U., Freitag M.;
RT "De novo assembly of a 40 Mb eukaryotic genome from short sequence reads:
RT Sordaria macrospora, a model organism for fungal morphogenesis.";
RL PLoS Genet. 6:E1000891-E1000891(2010).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCC08194.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CABT02000005; CCC08194.1; -; Genomic_DNA.
DR RefSeq; XP_003348721.1; XM_003348673.1.
DR STRING; 771870.F7VRQ8; -.
DR GeneID; 10806185; -.
DR KEGG; smp:SMAC_01743; -.
DR VEuPathDB; FungiDB:SMAC_01743; -.
DR eggNOG; KOG0773; Eukaryota.
DR HOGENOM; CLU_008497_2_0_1; -.
DR InParanoid; F7VRQ8; -.
DR OMA; TIHNYSR; -.
DR OrthoDB; 450547at2759; -.
DR Proteomes; UP000001881; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR11850:SF102; HOMEOBOX PROTEIN HOMOTHORAX; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51253; HTH_CENPB; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000001881};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 155..218
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 369..397
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 745..819
FT /note="HTH CENPB-type"
FT /evidence="ECO:0000259|PROSITE:PS51253"
FT DNA_BIND 157..219
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 128..159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 179..236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 334..361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 128..156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 180..195
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..354
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1323 AA; 145015 MW; CE741DEAF28C1686 CRC64;
MPTIDEMEAF INWDGIDPAA MPGARHPNDL DLALENVDDD DFAGWALQHY EHSNPLGIGK
TTDAIPSLVS QSGDPIVAFE DSFDMPPSPC THCQTNGYQC KRIHEGSYKG YCTSCVALSR
VCNFGLSDQP TLPRSPGSTT SIEEQEQPST AAAVTPATKV NGRFSRESIK ILKSWLSTHH
KHPYPNDEEK EMLQKQTGLN KTQITGWLAN ARRRRGKTMG APRSISPGVR SLSNNMDIPQ
RRPQLELMNP LQRWQVSPPE HEPASVTAIA RAVTASATTL SSGSPHSNCN NFTDDGSSRS
LCAASSASSF NTSISSGLSF ASAYSYGTHD SLGSYGSSMN RGRRRRRRKA APVPSGKKRN
SLSAPLKTFQ CTFCTETFRT KHDWQRHEKS LHLSLERWVC APEGPRASNP GNGLVSCVFC
GEANPDEAHI ESHNYSICQE KTQEERTFYR KDHLRQHLKL VHDVRFVNWS MEQWKATTPE
IRSRCGFCGI VMDTWSIRVD HLAGHFKAGQ TMADWKGDWG FDTPVLEMVE NAIPPYLIHD
DRNSPCPYTA TQGPTETARN AYELIKNELM CYLANERDVK GRAPTDEELQ VEACRIIYAA
DVQTDQSISA IPSWLRDLLL SSEPLAMQAR MAPIRSANES RQSVLCINGK SNIFEDDPME
RELHEYVKAR RLLGLTAMDG ELQYEACNII GRMEESSSHP SEHVANFLLR LIYGSASWLA
EFRQRAVLPR SEDVGDEARR STDPSKIDST IHNYSRLERE LAEFLHMQRS MGLEPTDIDL
QNKARLIIYE CDDSWNQTAA DNSDWLTAFK QRHVSPEASA VALATCEPLT LSSVSQNRCT
PLMDMKTWMN SSCFGGPRNT SLGSIGTPTF DTTAGVDGHI GKPNSAVKIG PYFFNDANCY
RRLARELGAY VASVTSPSSP DCHIPSDEEL QRQARWILYQ DDDPWNQTAA DNAEWLRRFK
RDVGILTDAS LPGLPECTQW SAAQGGSGFE PPYLFPNPSA QVTTVEADIP IKMKEAKRMF
VAERQTANKY ARGFKTRWQR PAVVFCSREL EKGLVEFVTS RVMGNSDGGG GGGGGGGIRG
MFPSDEAIRV KAREIQKMNT TSADDAVLLE MFKTMMRERL GLTSALGSAS ASLGSAQDFS
GMGGSGMGMG LIQSSSVDTS VVPSPTTAAG FSTGTTASPD LRMGMDFSMD LGLNMGGISS
SMSNMGTMDM ANLTDWSSSS LNMMGLGTSM ADSCTNMGMG MGMGMNLDMP STTMSTSMPT
NTDMGLLGVT GGVAAVGGGG QDMMMMFKEN SYEMDDLLQA QSSFGFSSSN DDDLAVMGGI
GQL
//