ID B7PJQ2_IXOSC Unreviewed; 2453 AA.
AC B7PJQ2;
DT 10-FEB-2009, integrated into UniProtKB/TrEMBL.
DT 10-FEB-2009, sequence version 1.
DT 27-MAR-2024, entry version 95.
DE RecName: Full=Zinc finger homeobox protein 4 {ECO:0008006|Google:ProtNLM};
GN ORFNames=IscW_ISCW017915 {ECO:0000313|EMBL:EEC06824.1};
OS Ixodes scapularis (Black-legged tick) (Deer tick).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Acari;
OC Parasitiformes; Ixodida; Ixodoidea; Ixodidae; Ixodinae; Ixodes.
OX NCBI_TaxID=6945;
RN [1] {ECO:0000313|EMBL:EEC06824.1, ECO:0000313|Proteomes:UP000001555}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wikel {ECO:0000313|Proteomes:UP000001555}, and Wikel colony
RC {ECO:0000313|EMBL:EEC06824.1};
RG Ixodes scapularis Genome Project Consortium;
RA Caler E., Hannick L.I., Bidwell S., Joardar V., Thiagarajan M., Amedeo P.,
RA Galinsky K.J., Schobel S., Inman J., Hostetler J., Miller J., Hammond M.,
RA Megy K., Lawson D., Kodira C., Sutton G., Meyer J., Hill C.A., Birren B.,
RA Nene V., Collins F., Alarcon-Chaidez F., Wikel S., Strausberg R.;
RT "Annotation of Ixodes scapularis.";
RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ISCW017915-PA}
RP IDENTIFICATION.
RC STRAIN=wikel {ECO:0000313|EnsemblMetazoa:ISCW017915-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABJB010433945; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS727689; EEC06824.1; -; Genomic_DNA.
DR RefSeq; XP_002408391.1; XM_002408347.1.
DR STRING; 6945.B7PJQ2; -.
DR PaxDb; 6945-B7PJQ2; -.
DR EnsemblMetazoa; ISCW017915-RA; ISCW017915-PA; ISCW017915.
DR KEGG; isc:IscW_ISCW017915; -.
DR VEuPathDB; VectorBase:ISCI017915; -.
DR VEuPathDB; VectorBase:ISCP_022630; -.
DR VEuPathDB; VectorBase:ISCW017915; -.
DR HOGENOM; CLU_000245_1_0_1; -.
DR InParanoid; B7PJQ2; -.
DR OMA; SVCNKFS; -.
DR Proteomes; UP000001555; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 6.
DR Gene3D; 1.10.10.60; Homeodomain-like; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF3; ZINC FINGER PROTEIN 2; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF12874; zf-met; 1.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 19.
DR SMART; SM00451; ZnF_U1; 7.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 4.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS00027; HOMEOBOX_1; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 12.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 9.
PE 1: Evidence at protein level;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Proteomics identification {ECO:0007829|PeptideAtlas:B7PJQ2};
KW Reference proteome {ECO:0000313|Proteomes:UP000001555};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 135..164
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 747..776
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 830..852
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 858..886
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 991..1020
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1035..1059
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1133..1162
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1330..1358
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1557..1617
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 1802..1862
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 1887..1916
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1954..2014
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2215..2275
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 1559..1618
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 1804..1863
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 1956..2015
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2217..2276
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 178..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 307..328
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 696..737
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1276..1306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1415..1452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1653..1740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1775..1807
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1925..1957
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2060..2091
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2117..2219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2366..2416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1535..1569
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 715..737
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1276..1300
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1419..1452
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1674..1688
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1689..1703
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1704..1740
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1781..1807
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2068..2091
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2142..2184
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2187..2219
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2382..2401
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2453 AA; 270808 MW; 060737370FF1E6D5 CRC64;
MDRPTSAGSS SSPGVSPKLI PGPQATFMNA CPSPVVSCPQ HPDGKTNGVE CPKCDTVLGS
SRSLGGHMTM MHSRNSCKTL KCPKCNWHYK YQETLEIHMK EKHPENELNC IYCLTSQPHP
RLARGETYTC GYKPYRCEVC NYSTTTKGNL SIHMQSDKHI NNIQELQNGN LQATPEQLLQ
QQQPPQQQQP PALPEAVKKP GPAAKPKATW RCDVCNYETN VARNLRIHMT SEKHTHNMMV
LQQNVKHMQQ LSALQQAQAL DPLFQFHPGL LLPCDGQLQP EAALADMAYN HALLMMASQQ
HQQQQALAAA AAGSPLASPT MDMEHPDPTL RLELSTDDTS QLFQCCVCGL FSVDSVEALG
AHFQQDRTRQ REDEVLMAVA GSYVCKLCSY KTNLRANFQL HCKTDKHLQR LQHVNHIKEG
GPRNDWKLKY VNVSNPVQVR CNACDYYTNS VHKLQLHAAS ARHDLAARLF LFLVAAEGAL
RGDAARHYQC TACGFHSRTR SGLVGHAHSL RHLQHARPKD EAADTWKDLF VVKEFAGNEH
VSFDVDLENK GTEAMEMDLP PPVLSLEDHE SVTKSGGPSS PSPMLKQALL ENAAKAEASA
SSQCSEEEKA NEGQACNMCS FSAKNQALLQ AHVVSQHSSQ QQAQQPKSVQ CPLCQEQFKE
LNKIESHLVE THNVTSEGMQ RLLALVDTSV WATATTSSKR SKSRASSHAA VHGGTEDSDS
ADADCHASDS ESKDKTASDD TLCAEDLTCS ACGKSFGVLE DLFSHQIDTG HLELKQTPRG
PGYLCWKKAC NQYFMTPHAV QVHFKEIHSK KTQQQQQHQL AVSERHVYKY RCNQCSLAFK
TLEKLQLHSQ YHVIRAATQC VLCGRSFRSV AALQKHVETS HIDMTPEELE QYKASLVNNP
LLTSGGGAVL DPQTTELLKK ESNREEDGVL DELMDTDDVA HLNDNDSGVE AMDTGLAALT
EGATFKEQFF EDYINSQAIA EDSYNDPTRK YKCHRCKVAF TRQSYLTSHN KTFLHRKEEK
LSYPMEKYLD PNRPYKCDIC KESFTQKNIL LVHYNSVSHL HKLKLSLKDG GVPMQTTASA
NTTALTITPS SNCSNNVGTV QTIVAPPPSP TSSQVLALSP TNNNNNDTEK KPFKCNICKV
AYTQCSTLDI HMRSVLHQTR ASKLHELAMT GQEALVQHQQ LYCVMSSATQ SASPMLVAPK
HPVHGHHNQH HAPQQLMPEG QLNSPQLQAA ATATQQLLRL RSPYPRTKPP MYKHLLEGFG
FDVVMQFNEF NQKRMRRDAE KREQILQHQQ QKKEDVEGPT SPKADAVQIK KEVLDSEPEK
ENTMPEINRS VCNLCQKEFS SIWVLKAHRE EVHHDVVPLD FVSKLADDFR TEYDKKNASQ
VSAEEGMLDL AGSQAAGGQN DPAAPGTAAA AAAANVASGG SGTPGQSPQL PTPASVTPPV
SLSVAPQVSQ QQQQPVSEAS AVTANQMAAQ LQFNQLLMSM GLGMGLPMGM NMPFAAAMNM
HPPLIPVVMP PHMDPLMSSA FNHPMMPGAM DPSFFAAQQK LLQQQHQQLA QAQQQAQQQK
RARTRISDEQ LRILRAYFDI NNSPTEEQLM EMSEKSGLPL KVIKHWFRNT LFKERQRNKD
SPYNFNNPPS TYLNLEEYEK TGEAKVIALS EKNNNAEESS TPVAAKPAPV SSAGKEAPSS
STTSSETVVK KEVIKEEARV EAANTDTQDS IASSNTTQNG EGALLTPKLE TPASNTSQHN
NSSLAKFEFL RSLTSRLSVD PPLDLASPVH DGMLLGTPPP LLERSSSTTP TPMTSTPTAG
SSGKRANRTR FTDYQIKVLQ EFFETNAYPK DDDLEYLSKL LNLSPRVIVV WFQNARQKAR
KVYENQPPSA PEDDGSGRFQ RTPGLNYQCK KCLQVFQRYY ELIKHQKTSC FKDENPLAVQ
LKAAATGSGT GGGSGSGGGG SGDDRSMDPM ADPPRDKRLR TTILPEQLDY LYQKYQMESN
PSRKMLENIA RDVGLKKRVV QVWFQNTRAR ERKGQFRAHQ QVIHKRCPFC RALFKARSAL
ESHLATRHAD LYSKGELNID SFPDGDADSN PGTPTSSASE ESKAISPSAG SDLVHNTMKK
YYEDSLKKYL DEMTGAPTDL SVKPKACKGE AGGGEAPLDL SKPLRLDSDR SGDHLSVSEK
SMDLRIGVSD DARSETHSES TDNMDGDDSF FESNPTSPLG MHGGGGQQPS RPVSGSGKRF
RTQMTTVQLK VMKSIFADYK TPSMAECEML GREIGLPKRV VQVWFQNARA KEKKSKLAFA
KTFGQEMEPP KPPEECSLCA VKYNHQFSNT SMQDHLFSKR HIDALRNHID NIKKMTDDSS
TDRTAAADHD KPPSLVQQLQ MLGQGLPPSF ALPSSSPKDA AAPPAKEDRK VVVDEPKKED
TSQAPQAPPQ APQAGDAGAL VPYLYAGLPG YYPGMPGAAA FIHPTLFSGR NKL
//