ID A0A061FG29_THECC Unreviewed; 645 AA.
AC A0A061FG29;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=MuDR family transposase isoform 1 {ECO:0000313|EMBL:EOY16275.1};
GN ORFNames=TCM_035107 {ECO:0000313|EMBL:EOY16275.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY16275.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY16275.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY16275.1; -; Genomic_DNA.
DR EMBL; CM001886; EOY16276.1; -; Genomic_DNA.
DR EMBL; CM001886; EOY16277.1; -; Genomic_DNA.
DR RefSeq; XP_007019050.1; XM_007018988.2.
DR RefSeq; XP_007019051.1; XM_007018989.2.
DR AlphaFoldDB; A0A061FG29; -.
DR SMR; A0A061FG29; -.
DR STRING; 3641.A0A061FG29; -.
DR EnsemblPlants; EOY16275; EOY16275; TCM_035107.
DR EnsemblPlants; EOY16276; EOY16276; TCM_035107.
DR EnsemblPlants; EOY16277; EOY16277; TCM_035107.
DR EnsemblPlants; Tc08v2_t010520.1; Tc08v2_p010520.1; Tc08v2_g010520.
DR EnsemblPlants; Tc08v2_t010520.2; Tc08v2_p010520.2; Tc08v2_g010520.
DR GeneID; 18592328; -.
DR Gramene; EOY16275; EOY16275; TCM_035107.
DR Gramene; EOY16276; EOY16276; TCM_035107.
DR Gramene; EOY16277; EOY16277; TCM_035107.
DR Gramene; Tc08v2_t010520.1; Tc08v2_p010520.1; Tc08v2_g010520.
DR Gramene; Tc08v2_t010520.2; Tc08v2_p010520.2; Tc08v2_g010520.
DR KEGG; tcc:18592328; -.
DR eggNOG; ENOG502R0RS; Eukaryota.
DR HOGENOM; CLU_006767_8_2_1; -.
DR InParanoid; A0A061FG29; -.
DR OMA; HGFCLRF; -.
DR OrthoDB; 592672at2759; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR InterPro; IPR018289; MULE_transposase_dom.
DR InterPro; IPR004332; Transposase_MuDR.
DR InterPro; IPR006564; Znf_PMZ.
DR InterPro; IPR007527; Znf_SWIM.
DR PANTHER; PTHR31973; POLYPROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR31973:SF93; SWIM-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF03108; DBD_Tnp_Mut; 1.
DR Pfam; PF10551; MULE; 1.
DR Pfam; PF04434; SWIM; 1.
DR SMART; SM00575; ZnF_PMZ; 1.
DR PROSITE; PS50966; ZF_SWIM; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00325};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00325}.
FT DOMAIN 505..546
FT /note="SWIM-type"
FT /evidence="ECO:0000259|PROSITE:PS50966"
SQ SEQUENCE 645 AA; 73824 MW; 4F64AA21F92A017A CRC64;
MLINYTAFPW KQFLFCFLHG GSTLNHIRLL LPSFFVIMAD HDHALVVADT SHSLVEHTLA
DTSRALVEQT LVIGQEFPDV ETCRRTLKDI AIALHFDLRI VKSDRSRFIA KCSKEGCPWR
VHVAKCPGVP TFSIRTLHGE HTCEGVRNLH HQQASVGWVA RSVEARVRDN PQYKPKEILQ
DIRDQHGVAV SYMQAWRGKE RSMAALHGTF EEGYRLLPAY CEQIRKTNPG SVASVFATGQ
ENCFQRLFIS YRASIYGFIN ACRPLLELDK ADLKGKYLGT LLCAAAVDAD DALFPLAIAI
VDLESDENWM WFMSELRKLL GVNTENMPRL TILSERRQSI VDAVETHFPS AFHGFCLRYV
SENFRDTFKN TKLVNIFWNA VYALTTVEFE SKISEMVEIS QDVIQWFQHF PPQLWAVAYF
EGVRYGHFSL GVTELLYNWA LECHELPVVQ MMEHIRHQLT SWFNNRREMG MRWTSSLVPS
AEKRILEAIA DARCYQVLRA NEIEFEIVST ERTNIVDIRS RVCSCRRWQL YGLPCAHAAA
ALISCGQNAH LFAEPCFTVA SYRETYSQMI NPIPDKSTWK EQGEGAEGGA AKLDITIRPP
KYRRPPGRPK KKVLRVENLK RPKRVVQCGR CHLLGHSQKK CTMPI
//