ID A0A061GJV9_THECC Unreviewed; 2146 AA.
AC A0A061GJV9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 54.
DE SubName: Full=Helicases,ATP-dependent helicases,nucleic acid binding,ATP binding,DNA-directed DNA polymerases,DNA binding {ECO:0000313|EMBL:EOY29653.1};
GN ORFNames=TCM_037134 {ECO:0000313|EMBL:EOY29653.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY29653.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY29653.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY29653.1; -; Genomic_DNA.
DR STRING; 3641.A0A061GJV9; -.
DR EnsemblPlants; EOY29653; EOY29653; TCM_037134.
DR Gramene; EOY29653; EOY29653; TCM_037134.
DR eggNOG; KOG0950; Eukaryota.
DR HOGENOM; CLU_000818_2_0_1; -.
DR InParanoid; A0A061GJV9; -.
DR OMA; HNMCQQF; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IBA:GO_Central.
DR GO; GO:0004386; F:helicase activity; IEA:UniProtKB-KW.
DR GO; GO:0006261; P:DNA-templated DNA replication; IEA:InterPro.
DR GO; GO:0006302; P:double-strand break repair; IBA:GO_Central.
DR CDD; cd18026; DEXHc_POLQ-like; 1.
DR CDD; cd08638; DNA_pol_A_theta; 1.
DR CDD; cd18795; SF2_C_Ski2; 1.
DR Gene3D; 1.10.3380.20; -; 1.
DR Gene3D; 3.30.70.370; -; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR011545; DEAD/DEAH_box_helicase_dom.
DR InterPro; IPR001098; DNA-dir_DNA_pol_A_palm_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR002298; DNA_polymerase_A.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR046931; HTH_61.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR048960; POLQ-like_helical.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR10133; DNA POLYMERASE I; 1.
DR PANTHER; PTHR10133:SF62; DNA POLYMERASE THETA; 1.
DR Pfam; PF00270; DEAD; 1.
DR Pfam; PF00476; DNA_pol_A; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF20470; HTH_61; 1.
DR Pfam; PF21099; POLQ_helical; 1.
DR PRINTS; PR00868; DNAPOLI.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SMART; SM00482; POLAc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF158702; Sec63 N-terminal domain-like; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Helicase {ECO:0000313|EMBL:EOY29653.1};
KW Hydrolase {ECO:0000313|EMBL:EOY29653.1};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 523..715
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 761..953
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT REGION 19..49
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 443..463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..49
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2146 AA; 236967 MW; 38422A2ED323DC93 CRC64;
MASGSPRARI DQFFASKKRK TQSPCLKTGR FEKNAKTTVE GSPSAKGTLD NYLRTSQENE
IVQPSCTILG QDPVKRSLAS EIDKSSENEN EQSFLLAEVK SRSCEAFQVT HNGIYLGSSE
EGNFDFGDPA EVAAQGRENS ELKQFATDFL SLYCSEVQPH VGSPSQLKAN DQKRHGSLSM
LSEEDKRFKK RYLISHQLQT EVETACSTKK NLKNETGEFI LNLSREVNTG SNLIELQASL
RKCTTTTKPV LNTTECSTPG SSIVKACAYR TPQSMRGSSM FSPGEAFWNE AIEIADGLFS
RSDALSAQVA EETNNPKSQY EINNTYNLGN KIVDDKSKEM PDECESRVKL KGVGTSLESA
VKQKKEMDKE VSLLPVKHLD FSFDDKILDG SIPHIIQRDS KVTECGIVNH KGPSIVNTLT
DHDELQTIEE VQGEQQERAS VHVVPKKEDN LSSQDNNSIT STSAANKANK SIGECNETTT
PLSFVALKDR LSLSSWLPLE ICRTYKKKGI SELYPWQVDC LQVDGVLQRK NLVYCASTSA
GKSFVAEILM LRRVILTGKV ALLVLPYVSI CAEKAEHLEV LLEPLGKQVR SYYGNQGGGT
LPKDTSVAVC TIEKANSLIN RLLEEGRLSE VGIIVIDELH MVGDQSRGYL LELLLTKLRY
AAGEGMSESS SGESSGSSSG KADPAHGLQI VGMSATMPNV EAVADWLQVS ETYKAALYQT
DFRPVPLEEF IKVGNTIYDK NLDIVRTIPK VVDLGGKDPD HIVELCNEVV QEGHSVLIFC
SSRKGCESTA KHVSKFLKKF SVNVHGDNCE FVDISSAIDA LRRCPAGLDP ILEETLPSGV
AYHHAGLTVE EREIIEACYR RGFVRVLTAT STLAAGVNLP ARRVIFRQPR VGRDFIDATR
YKQMAGRAGR TGIDTKGESM LICKPEEIKR IKGLLNESCP PLQSCLSEDK NGMTHAILEV
VAGGMVQTAS DINRYVKCTL LNSTKPFQDV VKSAQDSLRW LCHRKFLEWN DETKLYGTTP
LGRAAFGSSL CPEESLIVLD DLSRAREGFV LASDLHLVYL VTPINVEVEP DWELYYERFM
ELSALEQSVG YRVGVAEPFL MRMAHGAPIH ISNGLRDGWK RLRGKFENHL GISNNTKLSD
EQTLRVCKRF YVALILSRLV QEAPVGEVCE AFRVAKGMVQ ALQENAGRFA SMVSVFCERL
GWHDLEGLVA KFQNRVSFGV RAEIVELTTI PYVKGSRARA LYKAGLRTPL AIAEASIPDI
FKALFESSSW AAQDLSLESS AQRRMQLGVA KKIKNGARKI VLIKAEEARI AAFSAFKSLG
YSVPQFSRPL VLNGSPGEQE AAITSVGDDS PGSVVWVEQI EHVLAKPLAE RSKNLENVSL
ANEGLIVTKT SADNLVASAE VNLATTLQCN LGMENPGVSV EGPVTGDEVN AAIDRGRSIV
MATVCGYLDQ GVHDGLNEDL CVGNVDSACR KGPLNAVNTP GGIDSFLELW ETTAEFCFDI
HYNRRSEPNS VASFEVHGIA ICWENSPVYY VNLPKDLIWS DNRANNFLST CASSDKNDSL
PPQHFLEMAK LRWKRIVDIM GKSGVRKVSW NLKVQIQVLK SPAISVQRFG GMNLGVKDLG
LEIIDNSYLL FPPVLINDGI DMSIAAWVVW PDEERSSSPN LEKEVKKRLS SEAAAAANQS
AIYSNIMLVL IYQVNVLAEM ELWGIGINME GCLWARNVLG KKLRCLEKEA YKLAGMTFSL
YTAADIANVL YGHLKLPVPE GRNKGKQHPS TDKQCLDLLR DEHPIIPVIR EHRTLAKLLN
CTLGSICSLA RLSRSTHKYT LHGRWLQTST ATGRLSMEEP NLQCVEHMVE FSLSKDKNGS
DANVDHYKVN VRDFFVPTQD DWLLLTADYS QIELRLMAHF SKDSALIELL SKPQGDVFTM
MSAIWTGRAE DSVSSNERDQ TKRLIYGILY GMGADTLAEQ LNCTTDEAKE KIKSFKSSFP
GVASWLCEAI SSCRQKGYVE TLKGRKRFLS KIKFGNSKEK SKAQRQAVNS ICQGSAADII
KIAMIKVYSL IVEGVGRLDS GSSISTKFQM LKGRCRILLQ VHDELVLEVD PSVIKEAAWM
LRMSMESAVS LLGRFPLRVK LNVGKTWGSL EPFLADQGIE EAVSKS
//