ID Q2GP57_CHAGB Unreviewed; 490 AA.
AC Q2GP57;
DT 21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2006, sequence version 1.
DT 24-JAN-2024, entry version 56.
DE RecName: Full=GYF domain-containing protein {ECO:0000259|PROSITE:PS50829};
GN ORFNames=CHGG_10247 {ECO:0000313|EMBL:EAQ83843.1};
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901 {ECO:0000313|EMBL:EAQ83843.1, ECO:0000313|Proteomes:UP000001056};
RN [1] {ECO:0000313|Proteomes:UP000001056}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970
RC {ECO:0000313|Proteomes:UP000001056};
RX PubMed=25720678; DOI=10.1128/genomeA.00021-15;
RA Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT "Draft genome sequence of the cellulolytic fungus Chaetomium globosum.";
RL Genome Announc. 3:E0002115-E0002115(2015).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408035; EAQ83843.1; -; Genomic_DNA.
DR RefSeq; XP_001228174.1; XM_001228173.1.
DR AlphaFoldDB; Q2GP57; -.
DR STRING; 306901.Q2GP57; -.
DR GeneID; 4396478; -.
DR VEuPathDB; FungiDB:CHGG_10247; -.
DR eggNOG; KOG2950; Eukaryota.
DR HOGENOM; CLU_024456_1_0_1; -.
DR InParanoid; Q2GP57; -.
DR OMA; VRKCGEN; -.
DR OrthoDB; 1468947at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR GO; GO:0005682; C:U5 snRNP; IEA:InterPro.
DR Gene3D; 3.30.1490.40; -; 1.
DR InterPro; IPR039905; CD2BP2/Lin1.
DR InterPro; IPR003169; GYF.
DR InterPro; IPR035445; GYF-like_dom_sf.
DR PANTHER; PTHR13138:SF3; CD2 ANTIGEN CYTOPLASMIC TAIL-BINDING PROTEIN 2; 1.
DR PANTHER; PTHR13138; PROTEIN LIN1; 1.
DR Pfam; PF02213; GYF; 1.
DR SUPFAM; SSF55277; GYF domain; 1.
DR PROSITE; PS50829; GYF; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001056}.
FT DOMAIN 434..490
FT /note="GYF"
FT /evidence="ECO:0000259|PROSITE:PS50829"
FT REGION 1..204
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 324..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 413..433
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 14..38
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..175
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..204
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..345
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 357..371
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 490 AA; 53719 MW; 99FF3152E798D7B4 CRC64;
MSSRFSAARP KRAGEAFARA HHGEERDERD EGGPTSKKVK FDVRNPSALA PSARDDEDEN
DNVLDADVIA ASGRATKRGA VNIDGYDSDS ENETFNTRAE ARGKKGKEAE DVDLAEVMDN
YNSKAGAGGG DEEDDEVDMF GDADADDNLA TGGGKSGKKD KQVRFLADKE IEGQETTSKS
GGTVRIDGNP DNDEEDDDDD DDEEVVALAI AEEGVDEEVG LGGLKKHAPK IDAFNMREEQ
EDGAFDEAGN FVRKAADADA VHDRWLEGIS KKEMKKAAAA HDKREAELRK QQRENDSLLT
GDLFKELIIR LEPGETALDA LARLRKSQTN KNKSKKIPKW KQKKAKAKNN GGEGGDAMDV
DSEKGPEDPK QAKIKDAINA IADAADKLMQ RDYPNIYDRE RERLIREYRN ETGEAWVEPP
EPDETEGGGG IGEDKLWEFR WTDGRDDAAK QGPFDGPTMK AWQDAGYFGE GVEFRLAGGE
GGWTRVATFV
//