ID A0A0B2WWS6_METAS Unreviewed; 1344 AA.
AC A0A0B2WWS6;
DT 04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT 04-MAR-2015, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Thermotolerance protein {ECO:0000313|EMBL:KHN97877.1};
GN ORFNames=MAM_04266 {ECO:0000313|EMBL:KHN97877.1};
OS Metarhizium album (strain ARSEF 1941).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Clavicipitaceae; Metarhizium.
OX NCBI_TaxID=1081103 {ECO:0000313|EMBL:KHN97877.1, ECO:0000313|Proteomes:UP000030816};
RN [1] {ECO:0000313|EMBL:KHN97877.1, ECO:0000313|Proteomes:UP000030816}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ARSEF 1941 {ECO:0000313|EMBL:KHN97877.1,
RC ECO:0000313|Proteomes:UP000030816};
RX PubMed=25368161; DOI=10.1073/pnas.1412662111;
RA Hu X., Xiao G., Zheng P., Shang Y., Su Y., Zhang X., Liu X., Zhan S.,
RA St Leger R.J., Wang C.;
RT "Trajectory and genomic determinants of fungal-pathogen speciation and host
RT adaptation.";
RL Proc. Natl. Acad. Sci. U.S.A. 111:16796-16801(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KHN97877.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZHE01000009; KHN97877.1; -; Genomic_DNA.
DR STRING; 1081103.A0A0B2WWS6; -.
DR HOGENOM; CLU_003539_0_0_1; -.
DR OrthoDB; 2087067at2759; -.
DR Proteomes; UP000030816; Unassembled WGS sequence.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR PANTHER; PTHR10644:SF21; CLEAVAGE_POLYADENYLATION SPECIFICITY FACTOR A SUBUNIT N-TERMINAL DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50993; Peptidase/esterase 'gauge' domain; 1.
DR SUPFAM; SSF101908; Putative isomerase YbhE; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000030816}.
FT DOMAIN 156..692
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
FT REGION 480..519
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 480..496
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1344 AA; 149167 MW; 2DF2AFC6DB65F2EA CRC64;
MAFQTSIRRD GEWVTETVNF QAALKAYATP KSASRPHPEP PSCGILSRTI VDSPMIHQIL
PVRLRSEAHN DIAFVGLWQD HFVQISELRR DGQVHEVVRK SDFGSRIRRA AVLGDSPQHG
LDDDDLADMV KSEDKEMLVR GSFATEPHSR HRLPPQLLVL VLESGDTMYM FLRERQDSTL
EFVIHKRESP RNLSYFGYHL SVDPSSRYMA AASPDGVLVI YELESMVALS EKYRLQGFVD
PIKSIRIRVI QGVVHKLEFL YPRPEDDYHI ILLLIVTRKE RRSAEPVSRM LTYEWEVGDN
LKEVFAEEKT GNRLPKEHRM PSLLIPLRFN TAFFTVSQPD IGIVKNCLSG SPVFEVLDTD
TPERTPLHHG IGNPLWTAWS RPFRRKKYFE KTDIIYLARE DGAIIHVEID APELVPSVTT
VGCLNTNINT AFTIAYDIFS DVLIIGGDSG PGGIWKLAPR TDLEQVSVLP NWSPVVDMAT
SSGRPLKASS TSEHRNANSR SPERKNLPSR PDSLFSASGR GIRGNLTQWR WGIQGRIGLD
IETGEPIRRS WGLTMNGPEG NGLYGLLALP NSSTLLHFSA DFNQVDAVGA DSTAFDLTSR
TLHAYQAQSG LIVQITESSI ALILDSQASL HALPSILGIT GIRAENAFGA DDLLVLSTHN
DGNFQLHILQ VEGMNIRAVN SWGIAGEATC VSLFKIAGNY CVISGSVIDS TSWVSAYALD
GTAVIAEAVD RRTADASARE IPNEAYLFEP LTSICIVRET TDSADFVAGT RCGHVLNFRI
LDQTSKRVTW NSEAMGVAPV DVFPTRGEFG GDVAAMACCD NNLTMMSNFS PSALKFQNKN
FIWLTDSNDA SMPSPAVHSV FGLGVSLSGH SGHMSLMILA GSRLLFAEVW PHFSLIPRSL
PLNGTPTRVI FSQAWNCLVA AILQDGKSTL AFIDPDSGMH IASACDKDRN PLQFIWGLGH
SDDRIYGLGE WLYVKDGKTF AFLLVTTKEG RLLIVSVNKL ESRPGRGGAG RLQYWTRWKK
MLAKPIYSIV GDNDGIIYCV DRTIHWDVLD LTEKKLRPVK EYEVDSPVTS LRVFNGKIFA
LTTMHSLEVI DHRAGEDNTM SLIHTDAISR ITIHMTDIGT DGGASDVVGR WPITLLSDHR
GGLAGVWIPW GQRDKEFQTI FEATLPTSIR RFTHARSRPL WVGSETRNRY GVVASSEEGS
EVFGVSLDGS LRHFTLVNLE LWRILSLMQT LALRRRLFNV MGGSPSDSDS MVSDDDNDIE
PHVHPKLMHI DGDLLSRCLE NRLLEKIIGT ADGLDLFCEY LDALDGGRWT DRFRDGIGSS
EGEREAAYFN LGYDILGYIL APVL
//