ID A0A1Y2EA66_9FUNG Unreviewed; 750 AA.
AC A0A1Y2EA66;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 13-SEP-2023, entry version 16.
DE RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=LY90DRAFT_639740 {ECO:0000313|EMBL:ORY68136.1};
OS Neocallimastix californiae.
OC Eukaryota; Fungi; Fungi incertae sedis; Chytridiomycota;
OC Chytridiomycota incertae sedis; Neocallimastigomycetes; Neocallimastigales;
OC Neocallimastigaceae; Neocallimastix.
OX NCBI_TaxID=1754190 {ECO:0000313|EMBL:ORY68136.1, ECO:0000313|Proteomes:UP000193920};
RN [1] {ECO:0000313|EMBL:ORY68136.1, ECO:0000313|Proteomes:UP000193920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G1 {ECO:0000313|EMBL:ORY68136.1,
RC ECO:0000313|Proteomes:UP000193920};
RG DOE Joint Genome Institute;
RA Haitjema C.H., Gilmore S.P., Henske J.K., Solomon K.V., De Groot R.,
RA Kuo A., Mondo S.J., Salamov A.A., Labutti K., Zhao Z., Chiniquy J.,
RA Barry K., Brewer H.M., Purvine S.O., Wright A.T., Boxma B., Van Alen T.,
RA Hackstein J.H., Baker S.E., Grigoriev I.V., O'Malley M.A.;
RT "A Parts List for Fungal Cellulosomes Revealed by Comparative Genomics.";
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ORY68136.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCOG01000046; ORY68136.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y2EA66; -.
DR STRING; 1754190.A0A1Y2EA66; -.
DR Proteomes; UP000193920; Unassembled WGS sequence.
DR InterPro; IPR025584; Cthe_2159.
DR Pfam; PF14262; Cthe_2159; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000193920};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..750
FT /note="Carbohydrate-binding domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012756556"
FT REGION 486..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 533..568
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 730..750
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 533..563
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 750 AA; 79989 MW; F8F4B471A2071BD8 CRC64;
MKINNYIVLL VIFVFSSVVK ADDEEETTSL DITGDEEELE LIPGPVTCEY KKKDLNESYD
VDSDISINCN GDSSTCTSNH DGVTISEGVV EITTAGTYIV GGNLEGQLRI AATKDDFIHL
VLNNANITSF DGPAIFGSKA DKVTITLVGE NLLNDPANYT IVDEDDEPDA CLFIDSDLAI
NGSGNLTVIG NYKDAIRCKK DLRIVNGTIN VQSAVDKGIK VKNSLCIKDG NIKVNSTDTG
IKVTRDDDAE KGYIVIDGGN VVVSSGNDGI HAETHLTIND GFIDIQKSGE GLEGQMIDIL
GGEIHIISSD DGINASKVGS SNEDDMGPMG GPMGGPMDDM QGMNTMNRTE SMRGPMGGSM
GGSMGGPMRD MQGMNSMNRT ESMGPGGMSP PNGMPGGMPG GKSSEVDSSV YINIVGGKLY
ITSKGNDVDG IDSNGVLYIG GDAEVYVSVE GGDIYGNMAA LDAEGTNAIV PGSTVFATGG
GKNNIGGRNN MGSNKNNGIN GNDNFNNRPG FNETENNMND FFYKNKNGKD FIDENTTDSI
DNEDSKDNEN KKDSSEEQNN IENKSENEII ISNAVNEENT KKPEKKIVKK PKTIIIKKCK
PKNISNVKSQ APPQQQFEGM DDNKNMNFTM PDNMNGHNGN MNFTMHDNMN GGMGGGMGSE
SGIVLQPYIQ TSIDTQEAGT EISIIDNEND KTIISYTPEI SYASILVSTP KLVEGKEYTI
IAGDFTQTIT ASSPSSSEEI EPPSITSHSN
//