ID S9WIM4_CAMFR Unreviewed; 981 AA.
AC S9WIM4;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE SubName: Full=Zinc fingers and homeoboxes protein 3 {ECO:0000313|EMBL:EPY78337.1};
GN ORFNames=CB1_001108087 {ECO:0000313|EMBL:EPY78337.1};
OS Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX NCBI_TaxID=419612 {ECO:0000313|EMBL:EPY78337.1, ECO:0000313|Proteomes:UP000030684};
RN [1] {ECO:0000313|EMBL:EPY78337.1, ECO:0000313|Proteomes:UP000030684}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX PubMed=23149746;
RG Bactrian Camels Genome Sequencing and Analysis Consortium;
RA Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA Bayaer T., Li Y., Meng H.;
RT "Genome sequences of wild and domestic bactrian camels.";
RL Nat. Commun. 3:1202-1202(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the ZHX family. {ECO:0000256|ARBA:ARBA00007440}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB017332; EPY78337.1; -; Genomic_DNA.
DR AlphaFoldDB; S9WIM4; -.
DR Proteomes; UP000030684; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 5.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR024578; Homez_homeobox_dom.
DR InterPro; IPR041057; ZHX_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR15467:SF6; ZINC FINGERS AND HOMEOBOXES PROTEIN 3; 1.
DR PANTHER; PTHR15467; ZINC-FINGERS AND HOMEOBOXES RELATED; 1.
DR Pfam; PF00046; Homeodomain; 3.
DR Pfam; PF11569; Homez; 1.
DR Pfam; PF18387; zf_C2H2_ZHX; 1.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 2.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 5.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000030684}.
FT DOMAIN 319..362
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 502..552
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 622..670
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 771..821
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 321..363
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 504..553
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 624..671
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 773..822
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 203..225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 596..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 665..718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 891..956
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..66
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 596..611
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 665..681
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 916..930
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 981 AA; 106126 MW; 877CA3EBEBB36A98 CRC64;
MASKRKSTTP CMIPVKTVVL PDASAEAQPA EATREVPQQD MPPEAPATSS EATQSASSTD
GATLANGHRS TLDGYSYACK YCDFRSQDIT QFVGHLNSEH TDFNKDPTFV CTECSFLAKT
PEGLSLHNAK CHSGEASFVW HVAKPDNHVV VEQSVPESTS PPDALGEPNV EGTDGQAEII
ITKTPIMKIM KGKAEAKKIH TLKENVPNQP TGEALPNPSA GDAEVKEGDH AFVNGAVPVS
QASTSSAKPP HAANGPLIGT VPVLPAGIAQ FLSLQQPPPA HAQHHAHQPL PTAKSLPKVM
IPLSSIPTYN AAMDSNSFLK NSFHKFPYPT KAELCYLTVV TKYPEEQLKI WFTAQRLKQG
ISWSPEEIED ARKKMFNTVI QSVPQPTITV LNTPLVASAS NVQHLIQAAL PGHVVGQPEG
TAGGLLVTQP LMANGLQAPS SSLPLAVTSV PKPPTVAPIN TVCSNTTSAV KVVNAAQSLL
TACPSITSQA FLDASIYKNK KSHEQLSALK GSFCRNQFPG QSEVEHLTKV TGLSTREVRK
WFSDRRYHCR NLKGSRAGLP GEHGSILGDS VPEVPFPPSS KAPEVPCIPT AATLATPPSA
KRQSWHQTPD FTPTKYKERA PEQLRALESS FAQNPLPLDE ELDRLRTETK MTRREIDGWF
SERRKKVSAK EAKKVEEGAS HGEEEAAEDE GEEGSAGDLR VLGENGSPEV PSSHTLAERK
VSPIKINLKN LRVTEANGRS ELPGLGICEP EDDVPNKLAE QPPGKVSYKK TAQQRHLLRQ
LFVQTQWPSN QDYEAIMAQT GLPRPEVVRW FGDSRYALKN GQLKWYEDYK RGNFPPGLLV
IAPGNRELLQ DYYVTHKMLF EEDLQSLCDK TQMGAQQVKQ WFAEKMGEET RAVADTGSEG
QGPGEPAAVH KGTGDTYSEV SENSESWEPS APEARSEPFD TPSPQAGPQL GKEPVGAALS
PVGVGALAAG TKFSAQSIRC F
//