ID A0A061GT06_THECC Unreviewed; 1108 AA.
AC A0A061GT06;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=ARM repeat superfamily protein isoform 1 {ECO:0000313|EMBL:EOY32287.1};
GN ORFNames=TCM_040022 {ECO:0000313|EMBL:EOY32287.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY32287.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY32287.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY32287.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GT06; -.
DR EnsemblPlants; EOY32287; EOY32287; TCM_040022.
DR Gramene; EOY32287; EOY32287; TCM_040022.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006606; P:protein import into nucleus; IEA:InterPro.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000357; HEAT.
DR InterPro; IPR021133; HEAT_type_2.
DR InterPro; IPR040122; Importin_beta.
DR InterPro; IPR041653; Importin_rep_4.
DR InterPro; IPR041389; Importin_rep_6.
DR InterPro; IPR034085; TOG.
DR PANTHER; PTHR10527; IMPORTIN BETA; 1.
DR PANTHER; PTHR10527:SF5; IMPORTIN-5 ISOFORM X1; 1.
DR Pfam; PF02985; HEAT; 1.
DR Pfam; PF13513; HEAT_EZ; 2.
DR Pfam; PF18808; Importin_rep_4; 1.
DR Pfam; PF18829; Importin_rep_6; 1.
DR SMART; SM01349; TOG; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS50077; HEAT_REPEAT; 2.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 362..599
FT /note="TOG"
FT /evidence="ECO:0000259|SMART:SM01349"
FT REPEAT 413..451
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
FT REPEAT 918..950
FT /note="HEAT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00103"
SQ SEQUENCE 1108 AA; 123020 MW; 68526B6635068FFD CRC64;
MDQQSQLAVI LGPDPAPFET LISHLMSSSN EQRSHAEVLF NLCKQSDPDA LCLRLAHLLQ
VCAQPETRAM AAILLRKLLT RDDSYIWPRL NISTQSSLKS VLLAQIQVEN TKTLSKKLCD
TVAELASSIL PENGWPELLP FMFQCVSSDS PRLQESAFLI FAQLSQYIGD VLTPFIKDLH
AVFLRCLSES SNADVKIAAL NAVINFIQCL TSLSDRDRFQ DLLPAMMRTL TEALNNGNEA
TAQEALELLI ELAGTEPRFL RRQLVDVVGS MLQIAEAESL EEGTRHLAIE FVITLAEARE
RAPGMMRKLP QFISRLFAIL MGMLLDIEDD PAWYTAETED EDAGETSNYS VGQECLDRLA
ISLGGNTIVP VASEQLPAYL AASEWQKHHA ALIALAQIAE GCAKVMIKNL EQVVSMVLNS
FHDSHPRVRW AAINAIGQLS TDLGPDLQNQ YHQRVLPALA AAMDDFQNPR VQAHAASAVL
NFSENCTPEI LTPYLDGIVS KLLVLLQNGK QMVQEGALTA LASVADSSQE HFQKYYDAVM
PYLKTILVNA TDKSNRMLRA KSMECISLVG MAVGKEKFRD DAKQVMEVLM SLQGSQMETD
DPTTSYMLQA WARLCKCLGQ DFLPYMRVVM PPLLQSAQLK PDVTITSADS DNDIEDSDDE
SMETITLGDK RIGIKTSVLE EKATACNMLC CYADELKEGF FPWIDQVAPT LVPLLKFYFH
EEVRKAAVSA MPELLRSAKL AVEKGMAQGR NETYVKQLSD FIIPALVEAL HKEPDTEICA
SMLDALNECL QITGPLLDEG QVRSIVDEIK QVITASASRK RERAERAKAE DFDAEEGEFV
KEENEQEEEV FDQVGEILGT LIKTFKASFL PFFDELSSYL TPMWGKDKTA EERRIAICIF
DDIAEQCREA ALKYYETYLP FILEACNDEN PDVRQAAVYG LGVCAEFGGP VFKPLVGEAL
SRLNVVIRHP NALQPENVMA YDNAVSALGK ICLFHRDRID AAQVVPAWLN CLPIKGDLIE
AKVVHEQLCS MVERSDNEVL GPNHQYLPKI VAVFAEVLCG KDLATEQTAS RMVNLLRQLQ
QTLPPATLAS TWSSLQPQQQ LALQSILS
//