ID Q2HGV6_CHAGB Unreviewed; 886 AA.
AC Q2HGV6;
DT 21-MAR-2006, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2006, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=NIMA interactive protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CHGG_00548 {ECO:0000313|EMBL:EAQ92313.1};
OS Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 /
OS NRRL 1970) (Soil fungus).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Chaetomium.
OX NCBI_TaxID=306901 {ECO:0000313|EMBL:EAQ92313.1, ECO:0000313|Proteomes:UP000001056};
RN [1] {ECO:0000313|Proteomes:UP000001056}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970
RC {ECO:0000313|Proteomes:UP000001056};
RX PubMed=25720678; DOI=10.1128/genomeA.00021-15;
RA Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT "Draft genome sequence of the cellulolytic fungus Chaetomium globosum.";
RL Genome Announc. 3:E0002115-E0002115(2015).
CC -!- SIMILARITY: Belongs to the ADIP family.
CC {ECO:0000256|ARBA:ARBA00009291}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH408029; EAQ92313.1; -; Genomic_DNA.
DR RefSeq; XP_001219769.1; XM_001219768.1.
DR AlphaFoldDB; Q2HGV6; -.
DR STRING; 306901.Q2HGV6; -.
DR GeneID; 4387901; -.
DR VEuPathDB; FungiDB:CHGG_00548; -.
DR eggNOG; ENOG502RZ1J; Eukaryota.
DR HOGENOM; CLU_010128_0_0_1; -.
DR InParanoid; Q2HGV6; -.
DR OMA; GRPICDE; -.
DR OrthoDB; 2726628at2759; -.
DR Proteomes; UP000001056; Unassembled WGS sequence.
DR InterPro; IPR021622; Afadin/alpha-actinin-bd.
DR Pfam; PF11559; ADIP; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000001056}.
FT REGION 372..402
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 419..716
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 728..870
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 65..127
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 381..402
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..478
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 537..555
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 556..607
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 618..636
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..666
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..688
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 754..805
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 886 AA; 96267 MW; 5B01990B653865EE CRC64;
MIESENLRTA SLYINNQLLS RGLLRDGNGI DFAQPGDTDT AVAETMSRIM AVRDAEHRES
LSATLRTLRA ESLRQANDIQ RLQEKYSDAQ RKVGLSEAAE AALRTNLKTA EGTVHRLKEE
AARTKTLVAQ TRASCANEVR KRDRQIEGLK KAVTEAGRAR GAGRSTGITS INVVGDTGSE
AEHGELMVDD DAEQSHDLRT ETNAFLAELA KGLSEENEGL LQLVRTTTEH LKEMSGWDVA
NGNMMENGDG HAMVLAAHPD DMSLEVNAVL DHLRTILTNP SFVPIEEVVV REDEIHRLRD
GWEKMETRWK EAVHLIDGWR KRMQASGRPV NVEELKMGLR LSPVKVRNVE ETAHGYALRL
AAVLEDEEVD ASQLLDDEGP LDLVPRAEAG EDDSDTSSMF DHDVDVDDLD VEEPNVEILQ
QSVMLPSPPL PLPPQLTPLR DSYTAANRGD RPHNRKPPKG FPTISEEKNR DFEADEPPLP
PPHADQLQQS PNKKPTLSLR TVPPEEAAKV ELPPSAHVST PDASVGSMFA KAADERTSKT
SAASARAQTR PAPTTKRRGE SPRKNDPAKK ETTTRREAIR KEAVRKEPVK RETAKKKESE
EVKTTTKPPP KSRPTRPPIT RGRSDTRPDD KPGHVKPQRA TSATSATSVR SAGTTKSANR
SVNKSANAEL PPPPPPHASV PSPAPSPSRS PKRVNSRLPL PRPGCNNVLP APQQSPLNMA
TITAKLAASE RDADAARVRA KLKAARLGKG ISLPPPGTTS TASTTPTEPE PDRNTGALST
RSETDSAGGT ASASASTSDG APSSTTTRDE DADELGFSPP RKAQQPLPQQ KQQKQGGGGG
GEAELKKSPV RKRERRTSKA TSRRRSTLNS WELDALIQGG NVTVER
//