ID A0A1Y2UEK6_9PEZI Unreviewed; 1234 AA.
AC A0A1Y2UEK6;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Clr5 domain-containing protein {ECO:0000259|Pfam:PF14420};
GN ORFNames=M434DRAFT_17771 {ECO:0000313|EMBL:OTA80359.1};
OS Hypoxylon sp. CO27-5.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Xylariomycetidae; Xylariales; Hypoxylaceae; Hypoxylon.
OX NCBI_TaxID=1001938 {ECO:0000313|EMBL:OTA80359.1, ECO:0000313|Proteomes:UP000194361};
RN [1] {ECO:0000313|EMBL:OTA80359.1, ECO:0000313|Proteomes:UP000194361}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CO27-5 {ECO:0000313|EMBL:OTA80359.1,
RC ECO:0000313|Proteomes:UP000194361};
RX PubMed=28078400; DOI=10.1007/s00253-017-8091-1;
RA Wu W., Davis R.W., Tran-Gyamfi M.B., Kuo A., LaButti K., Mihaltcheva S.,
RA Hundley H., Chovatia M., Lindquist E., Barry K., Grigoriev I.V.,
RA Henrissat B., Gladden J.M.;
RT "Characterization of four endophytic fungi as potential consolidated
RT bioprocessing hosts for conversion of lignocellulose into advanced
RT biofuels.";
RL Appl. Microbiol. Biotechnol. 101:2603-2618(2017).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KZ112673; OTA80359.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2UEK6; -.
DR STRING; 1001938.A0A1Y2UEK6; -.
DR Proteomes; UP000194361; Unassembled WGS sequence.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 3.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR025676; Clr5_dom.
DR PANTHER; PTHR46224; ANKYRIN REPEAT FAMILY PROTEIN; 1.
DR PANTHER; PTHR46224:SF6; IQ MOTIF AND ANKYRIN REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF14420; Clr5; 1.
DR SMART; SM00248; ANK; 8.
DR SUPFAM; SSF48403; Ankyrin repeat; 3.
DR PROSITE; PS50297; ANK_REP_REGION; 1.
DR PROSITE; PS50088; ANK_REPEAT; 2.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000194361};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 12..66
FT /note="Clr5"
FT /evidence="ECO:0000259|Pfam:PF14420"
FT REPEAT 476..508
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 1092..1124
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REGION 110..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 110..132
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1234 AA; 137011 MW; 19EFEC0FD68CD895 CRC64;
MPSSSTTRRI SPSEWKIHKD RILELYIHQK LALLGDDGVI ETMKKEGFTA TKSQYETQLR
VWEIRKYRTS KEWVGIISEI QQNQRRGGIN DIHPLDRVMP PSRIERACRR HAPKTHQITE
SRPRDGDRVL STPTESRHTS VGRIVPETVE MPPVESLGGA IDISALGGLD LFNFDEANPP
GDDFAYSGFG AQDSDGIGFE TSLLSSDDGG FGGTIDFSLG LAMGAVETSH IQSLAPVQRH
ITDLTLNTAA TPIAYTFSSD PFISNPTNST RRKQLTFPRS HWIQSLTSTE IAKMVLNSIS
SASILNSTYR SVFSTSRYHI IGQFITDVNK STFLIMRRKF PKWKQPPNNV LSPQVCNNLL
SEEIFVGEEQ VRFDILSNDD AVECRFYTRL IRSMTNACSG LEGIPAFSVL RFLNRNHNIQ
SKFFQFLRSN SKYAAKSLAE STFQAAIEAD DVNVAELLIE LRLVDANETV CFYGAERRTP
LEKAAIEQSL GILRLLLNRN VDINKSYAKK SHCVGKTHLC ELRYSEMPIG ETYSGGLECL
IGSNYDTGLT LNENFLGLVD DFMKKGATID MEVIFPMLRR FVDPRLARSI ISGFVSQTPS
RSVLPIDHLP EIVKIFDEED AIIMIESSIA KSPPDYGNHR HTEFSRCPFN SALEEAVRRR
SLKLVKILLP YSSPLHKAFE IAIKDKNQDI IDMILNEGRD SEDIASVGTL VAALTSGHQE
LIRFLEENGA FSRSIGNQLG KAITAAIEAG NLIFANKLLD CNPDFDGHTL EDTLRIAVAY
GYDDFACKLL AAGADAVSRG GLGPSSALLV AMEKRKPQVV RAILEASNLN WLFSNAIHLM
DRDDSCLALE AAIEYGDDTI LNDTLKACSL VDYLRHIEGA LELSFKRGGL DLFWKIIKLR
RHDPYCLSAA LKFAVGREDI LLLDELFNLG ASSDDEGALE KAVLGHPSMI EPLLKQFRQG
HPRGRSRYGI SALSTALERY PECAEGLNML LAFKLIDVNG IDYTIGGAFT NRNFRIATKG
KYRRQVTDDD IELVKKLLDA GWDPNIITSG RDSDAYYAQT TFLDAIETGS TKMVLLLLQY
GAQVNEPASF GLKRTPLQKA VEVGSLEIVR LLLEKDADVN APPAFRGGAT ALQLAAVEEH
GARLDIPPPP GKNGRWPLEG AAENGRLDMI QLIWYANNRR LDKEQCRKAM RLAEYNGHIG
CRDLIADLMS AVPAVSQGFE GNDDIDAFVN FDSC
//