ID A0A1Y1X694_9FUNG Unreviewed; 755 AA.
AC A0A1Y1X694;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=Muskelin N-terminal domain-containing protein {ECO:0000259|Pfam:PF06588};
GN ORFNames=BCR32DRAFT_279807 {ECO:0000313|EMBL:ORX81330.1};
OS Anaeromyces robustus.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Anaeromyces.
OX NCBI_TaxID=1754192 {ECO:0000313|EMBL:ORX81330.1, ECO:0000313|Proteomes:UP000193944};
RN [1] {ECO:0000313|EMBL:ORX81330.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX81330.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ORX81330.1, ECO:0000313|Proteomes:UP000193944}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S4 {ECO:0000313|EMBL:ORX81330.1,
RC ECO:0000313|Proteomes:UP000193944};
RG DOE Joint Genome Institute;
RA Mondo S.J., Dannebaum R.O., Kuo R.C., Labutti K., Haridas S., Kuo A.,
RA Salamov A., Ahrendt S.R., Lipzen A., Sullivan W., Andreopoulos W.B.,
RA Clum A., Lindquist E., Daum C., Ramamoorthy G.K., Gryganskyi A., Culley D.,
RA Magnuson J.K., James T.Y., O'Malley M.A., Stajich J.E., Spatafora J.W.,
RA Visel A., Grigoriev I.V.;
RT "Pervasive Adenine N6-methylation of Active Genes in Fungi.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORX81330.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFG01000121; ORX81330.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y1X694; -.
DR STRING; 1754192.A0A1Y1X694; -.
DR OrthoDB; 2714048at2759; -.
DR Proteomes; UP000193944; Unassembled WGS sequence.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 2.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006594; LisH.
DR InterPro; IPR010565; Muskelin_N.
DR PANTHER; PTHR15526; MUSKELIN; 1.
DR PANTHER; PTHR15526:SF5; MUSKELIN; 1.
DR Pfam; PF13415; Kelch_3; 1.
DR Pfam; PF06588; Muskelin_N; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR PROSITE; PS50896; LISH; 1.
PE 4: Predicted;
KW Kelch repeat {ECO:0000256|ARBA:ARBA00022441};
KW Reference proteome {ECO:0000313|Proteomes:UP000193944};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 31..223
FT /note="Muskelin N-terminal"
FT /evidence="ECO:0000259|Pfam:PF06588"
SQ SEQUENCE 755 AA; 88661 MW; 7411FDAED1EEBAB2 CRC64;
MLPPIQLNGF NNNTAVKYNN VSHDLSIIPS EKLKYNIHSW SSYSQPYHPK NIMVNKPQDQ
SSRWSSSSNN HMQFVTIKLD KMSIVKTITF GKYHKVHVCN LREFRVYGGL TPNNMIELLH
SGLINDCKTE TFALKFKTND IVFPCLYIKI VPLVAWGTNF NFSIWYVELR GHTQQQIVEK
AYHEYINYRE NEVIRLCLKH FRQRCFLDTF NSLQQLTNIK LEDQLLTNLH TNLVINGDFD
TTEEILYRAS ERNLFDDYIR DCQCKPVWRR INATDKNGNS PYMRGGHQMC IDSELGKIYM
FGGWDGTKNL ADFWEYDENL EEWTCLSLDT SKDGGPTPRS CHRIAYDSIN KQIYVLGYFI
ETPDISANKE SDFWKYDIAS RRWIKLSSNT AAKGGPSLIC DHQMLIEPKS QMIYIFGGRN
QSKNNGEDNY SGLYSYSIKE DKWRLLRSDT NQPEYSVQLK SRIEHAMLIN PETNELYIFA
GKRYKDFTER KNNYLSDFYI YRIDDDSVIE VSKNYTMQGG PDGGFTQRAT MDVELGEIYM
LSGLLNEKNS NVDTIKNILW VYNIKKNKWT KIYQNTNYGS EYNNRISEIE PCTRFASQLV
YDTKRKVHYL FGGNPGEVID QSIRLNDFWE LKLVRPNNED ILRSAKFHIR KQKYKEICNT
GEYLQALKYL QNNVSEVVNH NDENESKEFR ELTKFLFNIK QNPNNNPKTN NIQEKNDMLN
DVYQERTELY EFLLKYFPNS MKEPLENLVD LVPIS
//