GenomeNet

Database: UniProt
Entry: A0A0D2QDG5_GOSRA
LinkDB: A0A0D2QDG5_GOSRA
Original site: A0A0D2QDG5_GOSRA 
ID   A0A0D2QDG5_GOSRA        Unreviewed;       843 AA.
AC   A0A0D2QDG5;
DT   29-APR-2015, integrated into UniProtKB/TrEMBL.
DT   29-APR-2015, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=B456_009G107500 {ECO:0000313|EMBL:KJB56132.1};
OS   Gossypium raimondii (New World cotton).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX   NCBI_TaxID=29730 {ECO:0000313|EMBL:KJB56132.1, ECO:0000313|Proteomes:UP000032304};
RN   [1] {ECO:0000313|EMBL:KJB56132.1, ECO:0000313|Proteomes:UP000032304}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23257886; DOI=10.1038/nature11798;
RA   Paterson A.H., Wendel J.F., Gundlach H., Guo H., Jenkins J., Jin D.,
RA   Llewellyn D., Showmaker K.C., Shu S., Udall J., Yoo M.J., Byers R.,
RA   Chen W., Doron-Faigenboim A., Duke M.V., Gong L., Grimwood J., Grover C.,
RA   Grupp K., Hu G., Lee T.H., Li J., Lin L., Liu T., Marler B.S., Page J.T.,
RA   Roberts A.W., Romanel E., Sanders W.S., Szadkowski E., Tan X., Tang H.,
RA   Xu C., Wang J., Wang Z., Zhang D., Zhang L., Ashrafi H., Bedon F.,
RA   Bowers J.E., Brubaker C.L., Chee P.W., Das S., Gingle A.R., Haigler C.H.,
RA   Harker D., Hoffmann L.V., Hovav R., Jones D.C., Lemke C., Mansoor S.,
RA   ur Rahman M., Rainville L.N., Rambani A., Reddy U.K., Rong J.K.,
RA   Saranga Y., Scheffler B.E., Scheffler J.A., Stelly D.M., Triplett B.A.,
RA   Van Deynze A., Vaslin M.F., Waghmare V.N., Walford S.A., Wright R.J.,
RA   Zaki E.A., Zhang T., Dennis E.S., Mayer K.F., Peterson D.G., Rokhsar D.S.,
RA   Wang X., Schmutz J.;
RT   "Repeated polyploidization of Gossypium genomes and the evolution of
RT   spinnable cotton fibres.";
RL   Nature 492:423-427(2012).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class III subfamily.
CC       {ECO:0000256|ARBA:ARBA00010338}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001748; KJB56132.1; -; Genomic_DNA.
DR   RefSeq; XP_012443272.1; XM_012587818.1.
DR   AlphaFoldDB; A0A0D2QDG5; -.
DR   STRING; 29730.A0A0D2QDG5; -.
DR   EnsemblPlants; KJB56132; KJB56132; B456_009G107500.
DR   GeneID; 105768079; -.
DR   Gramene; KJB56132; KJB56132; B456_009G107500.
DR   KEGG; gra:105768079; -.
DR   eggNOG; ENOG502QRJM; Eukaryota.
DR   OMA; IPLAHTV; -.
DR   OrthoDB; 454859at2759; -.
DR   Proteomes; UP000032304; Chromosome 9.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   CDD; cd14686; bZIP; 1.
DR   CDD; cd00086; homeodomain; 1.
DR   CDD; cd08875; START_ArGLABRA2_like; 1.
DR   Gene3D; 3.30.530.20; -; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR044830; HD-Zip_III.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR013978; MEKHLA.
DR   InterPro; IPR023393; START-like_dom_sf.
DR   InterPro; IPR002913; START_lipid-bd_dom.
DR   PANTHER; PTHR45950; HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-14; 1.
DR   PANTHER; PTHR45950:SF10; HOMEOBOX-LEUCINE ZIPPER PROTEIN REVOLUTA; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF08670; MEKHLA; 1.
DR   Pfam; PF01852; START; 1.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00234; START; 1.
DR   SUPFAM; SSF55961; Bet v1-like; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS50848; START; 1.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW   Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682};
KW   Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682};
KW   Reference proteome {ECO:0000313|Proteomes:UP000032304}.
FT   DOMAIN          28..84
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          154..382
FT                   /note="START"
FT                   /evidence="ECO:0000259|PROSITE:PS50848"
FT   DNA_BIND        30..85
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   COILED          94..121
FT                   /evidence="ECO:0000256|SAM:Coils"
SQ   SEQUENCE   843 AA;  92307 MW;  EA63291E6C517289 CRC64;
     MAMALAQHRE SSSDSSINKH LDGGKYVRYT AEQVEALERV YAECPKPSSL RRQQLIRECP
     ILSNIEPKQI KVWFQNRRCR EKQKKESSRL QTVNRKLTAM NKLLMEENDR LQKQVSQMVC
     ENGYMKQQLH TVNTSAADAN CDSLGTTPQH SLRDTNSPAG LLSIAEETLA EFLSKATGTA
     VDWVQMPGMK PGPDSVGIFT ISQSCSGVAA RACGLVSLEP VKIAEILKDR PSWSRDCRNL
     EVFTMFPAGN GGTIELVYAQ TFAPTTLAPA RDFWTLRYTT TLENGSLVVC ERSLSGSGAG
     PSAAAAAQFV RAEVLPSGYL IRPCEGGGSI IHIVDHLNLE AWSVPEVLRP LYESSKVIAQ
     KMTIAALRYI RQIAQETSGE VVYSLGRQPA VLRTFSQRLS RGFNDAINGF NDDGWSIMNC
     DGTEDVIIAI NSIKSFSSTS NPANALSFLG GVLCAKASML LQNIAPAVLV RFLREHRSEW
     ADFNVDAYCA ASLKAGTNAY PGMRPTRFTG SQIIMPLGHT IEHEELLEVI RLEGHSFVQE
     DAFVSRDIHL LQICSGIDEN AVGACSELVF APIDEMFPDD APLLPSGFRV IPLDSKSSDT
     QDSLTTNRTL DLTSSLEVGP ATNHAAGDTS SCRNTRSVLT IAFQFPFESN LRDNVATMAR
     QYVRSVISSV QRVAMAISPS GLNPAVGSKL SPGSPEALTL AHWICRSYSY HLGAELLRSE
     SLGGDSILKN LWQHQDAILC CSLKSQPVFI FANQAGLDML ETTLVALQDI TLDKLFDESG
     RKALCSDFGK LMQQGYACLP AGICMSTMGR HVSYEQAFAW KVLEADESTV HCLAFSFVNW
     SFV
//
DBGET integrated database retrieval system