ID A0A2Z6RZN2_9GLOM Unreviewed; 2698 AA.
AC A0A2Z6RZN2;
DT 10-OCT-2018, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 1.
DT 24-JAN-2024, entry version 11.
DE SubName: Full=ARM repeat-containing protein {ECO:0000313|EMBL:GES91305.1};
GN ORFNames=RCL2_001813600 {ECO:0000313|EMBL:GES91305.1}, RclHR1_00540045
GN {ECO:0000313|EMBL:GBC03935.1};
OS Rhizophagus clarus.
OC Eukaryota; Fungi; Fungi incertae sedis; Mucoromycota; Glomeromycotina;
OC Glomeromycetes; Glomerales; Glomeraceae; Rhizophagus.
OX NCBI_TaxID=94130 {ECO:0000313|EMBL:GBC03935.1, ECO:0000313|Proteomes:UP000247702};
RN [1] {ECO:0000313|EMBL:GBC03935.1, ECO:0000313|Proteomes:UP000247702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HR1 {ECO:0000313|EMBL:GBC03935.1,
RC ECO:0000313|Proteomes:UP000247702};
RA Kobayashi Y.;
RT "The genome of Rhizophagus clarus HR1 reveals common genetic basis of
RT auxotrophy among arbuscular mycorrhizal fungi.";
RL Submitted (NOV-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:GES91305.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HR1 {ECO:0000313|EMBL:GES91305.1};
RA Maeda T., Kobayashi Y., Nakagawa T., Ezawa T., Yamaguchi K., Bino T.,
RA Nishimoto Y., Shigenobu S., Kawaguchi M.;
RT "Conservation and host-specific expression of non-tandemly repeated
RT heterogenous ribosome RNA gene in arbuscular mycorrhizal fungi.";
RL Submitted (OCT-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the GCN1 family.
CC {ECO:0000256|ARBA:ARBA00007366}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBC03935.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BEXD01003915; GBC03935.1; -; Genomic_DNA.
DR EMBL; BLAL01000199; GES91305.1; -; Genomic_DNA.
DR STRING; 94130.A0A2Z6RZN2; -.
DR Proteomes; UP000247702; Unassembled WGS sequence.
DR Proteomes; UP000615446; Unassembled WGS sequence.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 7.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR022716; Gcn1_N.
DR InterPro; IPR000357; HEAT.
DR InterPro; IPR021133; HEAT_type_2.
DR InterPro; IPR034085; TOG.
DR PANTHER; PTHR23346:SF7; EIF-2-ALPHA KINASE ACTIVATOR GCN1; 1.
DR PANTHER; PTHR23346; TRANSLATIONAL ACTIVATOR GCN1-RELATED; 1.
DR Pfam; PF12074; Gcn1_N; 1.
DR Pfam; PF02985; HEAT; 1.
DR Pfam; PF13513; HEAT_EZ; 1.
DR SMART; SM01349; TOG; 1.
DR SUPFAM; SSF48371; ARM repeat; 4.
DR PROSITE; PS50077; HEAT_REPEAT; 3.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000247702};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1375..1604
FT /note="TOG"
FT /evidence="ECO:0000259|SMART:SM01349"
FT REPEAT 1546..1584
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REPEAT 1665..1703
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REPEAT 2011..2048
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
SQ SEQUENCE 2698 AA; 300226 MW; 54D3A04C49C1C817 CRC64;
MDDPVETVDI GLADLSWAEF IKSRALYSFT SSSMTERLKF LNHSLIPRIK AGGLSDKTLG
SLLHLALLTY PRYNDRPSRS AMINVLKELN NWNSYQFLVS FVPLIVKETD KLNQKSPDGT
TYTTSSVDRF VLLTWVNLLI TFVLSEKSTL DSSYWRDLVN VQAILLHSLL SENEKRKNIK
RSALIDVRRT IRNNASHIPS YISYLTSSVK TCNPAYKNAV LLGTVIDCSL RLRKGIGKSY
VENAKEDIKQ YYITGIISSK TSVPKISADS FHDFINRMIT EEDFRTKITP VLEKQLLRAP
EILLNVLISL FKSLSFDVSN VFKMQSLLNH IQSTNKTIQM DAAELLIILI EKSSKEEAII
NIVRDILQTL TGGKVSNPEH RIILLQALAR VHQSPDVVKI VIDGLIPIIS KENNESSLSK
AIEILGLHFG FLLRSGHDMS FVEKIKKLAA DGLSSSKVGT RKAWAIAIGR MVWEVNESPT
NSLKNFVYMS LVQLISTLGK IQSNPLTFTG GPIEGYITIA IAIGKAKNWN DDIIDDLIGA
KKFTQNILVT SPKPSFLLWD KIYSKLNTYE EGFWFIRALE GVLIDKDETL LQNNDAQIFL
STAFIYLITS SNHRIRREAY NSLKRCTQNS AEAVGKIFRK GLSQWVLNLE KNVKDSTAVL
SSGSAHSDPI VNAYRLSQVL TAITTFSKEE DGYKVETSLV ELIILAHHPQ IVSPNNKYNW
ISLIQRAHVD PGKLVENYSG RLRQIIQEKI EIDNEQKNFY KAAMSAIATI TFISPETIIP
LFLTQCHRDL DASLFDNIGE NDIEIWKTEE GTLFIDVLKR NKKNIVENRN RKDYQTEKWE
RELRESIAKK KGDSSIKTPK LTKEEQAAVN AQLAKESEIR KNMEIIRFKL TRGLDIVDAL
VEGNPEAVEE HIVEFMRMLN DVVQKGGPLV GDKAVNTYLR INKCTSEQIV NIRDFIGLAT
LRVMNVQEVL ERWLHEPLDN LVTRVLYRLR FITERSLLPP SSFAYCFPLI YQVVRKGGIK
NDSEGEKDVM EQIALGLDII YFHSSIGSSP LLPRTEMILI LLQIIKEYPK LNKLARTALV
SLCETIGDTA EKKEIDALLN GLLSSEPFVR HASLQALDYL DLTEIDFSRE LWVACHDEDE
NYAKLASVLW EDNGMDVEPC YALELLPLIT HDEKYVRVAA SKAIGNAAKH FIDSMTDTLN
LIYDHYKEKT TSIMPEYDDF HMVIPESLDL KKKDPWEARV GLSLALSALA PCMRTSDLIP
FFEFLINDEA VGDVHAQVRQ KMLEAGLTVI NEHGAKNITL LLPVLENYLD KPAVPTETHD
RIRESVVIWL GALARHLNST DARIPVVIDK LLDTLKTPSE SVQVAVAECL PPLIKLMKDI
TPNLIEGLLN RLFRSEKYAE RRGAAFGLAG VVKGTGISVL KDCNVMTSLK EGVDNKKDYK
YRQGALFAFE TLSQSLGRLF EPYIIQILPL LLVCFGDTHP DVREATSDAA RVIMSKISGH
CVKLILPSLL AGLDDRQWRT KKGSVELLGS MAFCAPKQLS ISLPTIVPRL TNVLTDSHTN
VQSAANEALL RFGEVINNPE IQALVPILLK SLSDPDKHTN AALDSLLETA FVHYIDAPSL
ALVMPILERG LKERGTEIKK KSSQIVGNMA SLTDVRDITP YLPRLLPGLK EVLVDPVPDA
RATAAKALGT MVEKLGEDKF PTLVNELVHT LKSDTSGVDR QGAAQGLSEV LAGLGLERLD
GLLPEIINNT SSSKSYVREG FISLLIYLPA TFGLRFQPYL GRIIPPILSG LADESEYVRD
ASLRAGQMIV ANYATKAVDL LLPELERGLF DDNWRIRQSS VQLMGDLLYR ITGISGKSEI
EGDEDDETGG TEASRKALLQ ILGKERRDRL LAALYIIRQD ISGIVRQASI HVWKTIVSNT
PRTVKEILPI MMGMIIRSLA SPSYERRQVA GRTLGELVRK LGESILSEII PILEEGLESQ
EADMRQGVCI GLSEVMSTAG KFQVIDYIDS IIPAVRKALV DESSDVREAA AQAFDTLQQN
VGDKAIDEIL PTLLNSLQFG EESIYALEAL KEIMSVRANV VFPVLIPTLI QTPINAFNAR
ALGSLVTVAG PALNKRLAQI LSALMISLGT ETDEHTISEL NETKKALLLS IDSVDGLQTL
MTVLFEAIKN DDPSYRASAC DMLTTFCNES KMDYSRYVTE WIRVLILLLD DRQQNVVKAA
WNALTAIIKP LKKDELEQLV IPVRRAVKNV GVPDMDLRGF CLPRGISPIL PIFLQGLMYG
TAEIREQSAL GIGDLIQRTS DEALKPFVTQ ITGPLIRIIG DRYPPQVKAA ILYTLSLLLT
KVPAHLKPFI PQLQRTFIKS LSDPSSALVR SRAASALGIL ISLQTRVDPL IIELVTGIKT
SEPNVRETML IALENVVKKA GSGMNDTSKK SVIKVIVDGL SDTSDGMIVG SSRLFGSLCK
HISPQEAMPI ISTHILAGGS PSQSSLLAIN AILAESSRMF YDLGNTHEVV DLIVKGSASN
LPNIADTATI AAGKMLLIDL YHEESTVACL VEALIVNIRE PATLLGETKR LALTVFRAVG
KKFPQILEPH LSIIIPPMMV DVRDRVVPVK LAAELALVYV LQLLKDETML QQYLTTVDAM
TAKSITDFHK RVLTKLVTQE QQKMEQIQVG ETNDMDEEEQ EIWAVGNIGT FASNVDQE
//