ID A0A087QPD1_APTFO Unreviewed; 3526 AA.
AC A0A087QPD1;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Zinc finger homeobox protein 4 {ECO:0000313|EMBL:KFM03085.1};
GN ORFNames=AS27_13115 {ECO:0000313|EMBL:KFM03085.1};
OS Aptenodytes forsteri (Emperor penguin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae;
OC Aptenodytes.
OX NCBI_TaxID=9233 {ECO:0000313|EMBL:KFM03085.1, ECO:0000313|Proteomes:UP000053286};
RN [1] {ECO:0000313|EMBL:KFM03085.1, ECO:0000313|Proteomes:UP000053286}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_AS27 {ECO:0000313|EMBL:KFM03085.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL225796; KFM03085.1; -; Genomic_DNA.
DR STRING; 9233.A0A087QPD1; -.
DR Proteomes; UP000053286; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 4.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 5.
DR Gene3D; 1.10.10.60; Homeodomain-like; 4.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003604; Matrin/U1-like-C_Znf_C2H2.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45891; ZINC FINGER HOMEOBOX PROTEIN; 1.
DR PANTHER; PTHR45891:SF2; ZINC FINGER HOMEOBOX PROTEIN 4; 1.
DR Pfam; PF00046; Homeodomain; 4.
DR Pfam; PF00096; zf-C2H2; 2.
DR SMART; SM00389; HOX; 4.
DR SMART; SM00355; ZnF_C2H2; 23.
DR SMART; SM00451; ZnF_U1; 7.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 5.
DR SUPFAM; SSF46689; Homeodomain-like; 4.
DR PROSITE; PS00027; HOMEOBOX_1; 2.
DR PROSITE; PS50071; HOMEOBOX_2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 11.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 7.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000053286};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 1346..1369
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1374..1402
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1490..1521
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1542..1571
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1884..1912
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2022..2082
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2119..2179
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2207..2236
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2498..2558
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 2570..2598
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2824..2884
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 2024..2083
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2121..2180
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2500..2559
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT DNA_BIND 2826..2885
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..54
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 423..479
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 522..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1096..1127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1247..1336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1441..1474
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1575..1599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1625..1644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1793..1840
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1931..2014
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2260..2376
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2451..2505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2638..2760
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2772..2827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3012..3126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3240..3296
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3402..3421
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3471..3493
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..26
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 34..54
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 433..453
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 542..558
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..603
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1109..1125
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1278..1315
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1316..1334
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1578..1599
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1793..1828
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1935..1962
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1976..2009
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2260..2286
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2295..2312
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2313..2376
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2451..2470
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2472..2505
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2638..2674
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2693..2728
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2746..2760
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2772..2797
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2798..2819
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3014..3053
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3054..3084
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3093..3126
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3240..3271
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3272..3296
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3404..3421
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3471..3486
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3526 AA; 389808 MW; BE7C0B79820FE1F8 CRC64;
METCDSPTIS RQENGQSTSK LCGTTQLDNE VPEKVAGMEP DRENSTTDDN LRTDERKSEI
LLGFSVENAA ATQVTSAKEI PCNECATSFP SLQKYMEHHC PNARLPVLKD DNESEISELE
DSDVENLTGE IVYQPDGSAY IIEDSKESGQ NAQTGANSKL FSTAMFLDSL TSAGEKNEQS
ASAPMSFYPQ IINTFHIASS LGKPFTADQA FPNTSALAGV GPVLHSFRVY DLRHKRDKDY
LTSDGSAKNS CVSKDVPNNV DLSKFDGCVS DGKRKPVLMC FLCKLSFGYI RSFVTHAVHD
HRMTLNEEEQ KLLSNKYVSA IIQGIGKDKE PLISFLEPKK STSVYPHFST TNLIGPDPTF
RGLWSAFHVE NGDSLQAGFA FLKGSASTAG SAEQPLGITQ MPKAEVNLGG LSSLVATTPI
TSVSLSHSSS ESNKLSESKD QENNCERQKE TNTLHPNGEF PIKSEPTEPV EEEDEDTYSN
ELDDDEVLGE LADSIGSKDF PLLNQSISPL SSSVLKFIEK GTSSSSATVS DDTDKTKQTA
AHRHSSNVTS NYSISGKDFA DASASKDSPT ALHPNETVRG DEDSSVTPHQ HSFTPSTPSA
GDGSPGSGIE CPKCDTVLGS SRSLGGHMTM MHSRNSCKTL KCPKCNWHYK YQQTLEAHMK
EKHPEPGGSC VYCKTGQPHP RLARGESYTC GYKPFRCEVC NYSTTTKGNL SIHMQSDKHL
NNVQNLQNGN GEQVYGHTAP PANAALSGCG TPSPSKPKQK PTWRCEVCDY ETNVARNLRI
HMTSEKHMHN MMLLQQNMKQ IQHNLHLGLA PAEAELYQYY LAQNIGLTGM KLENPADPQM
MINPFQLDPA TAAALAPGLG ELSPYISDPA LKLFQCAVCN KFTSDSLEAL SVHVSTERSL
PEEEWRAVIG DIYQCKLCNY NTQLKANFQL HCKTDKHMQK YQLVAHIKEG GKTNEWRLKC
IAIGNPVHLK CNACDYYTNS VDKLRLHTTN HRHEAALKLY KHLQKHESAV NPESCYYYCA
LCDYSTKVKL NLVQHVRSVK HQQTEGLRKL QLHQQGLAPE EDNLSEIFFV KDCPPNELVS
FLRSFLCLST EEQGEDTEGS AKSTSVAVAD DKDSSERDNS EGKKSCKDSV NTVVGAQQLL
LAKEEDGAAK KSKPPEDNKF CHEQFYQCPY CNYNSRDPNR IQMHVLSQHS MQPVICCPLC
QDVLSNKMHL QLHLTHLHSV SPDCVEKLLM TVPVPDVMMP NSMLLPATAS EKSERDTPAT
ITAEGSGKYP GESPVDEKST PGIDESKTGM EIKTEEQKPP KESTETPDWN KSSSKDIKTT
DSMTDQLNEQ QKKQQLSVSD RHVYKYRCNH CSLAFKTMQK LQIHSQYHAI RAATMCSLCQ
RSFRTFQALK KHLEAGHPEL NEAELQQLCA SLPVNGELWA ESESMAQDDH ALEQEIERDY
EMDQEGKASP VGSDSSSIPD DMGSEPKRTL PFRKGPNFTM EKFLDPSRPY KCTVCKESFT
QKNILLVHYN SVSHLHKLKK VLQEASSPVP QETNSSTDNK PYKCSICNVA YSQSSTLEIH
MRSVLHQTKA RAAKLEPSGN ISSGNSVAGN VNSPSQGMLE SMSLPAVNSK ETHLDAKELN
KKQASELISA QPTHHPPQSP AQLQMQLQHE LQQQAAFFQP QFLNPAFLPH FPMTPEALLQ
FQQPQFLFPF YIPGTEFSLS PDLGLPGSAT FGMPGMAGMT GSLLEDLKQQ IQTQHHVGQT
QLQILQQQAQ QYQSTQPQLQ SQKQQQQSSK LMKAEQNTLV STDCQLIKDM PSYKESEEIS
EKQEKPKQEF TNENEGLKEN KDMKKPKSSE SAIPPPRIAS GARGNAAKAL LENFGFELVI
QYNENRQKVQ KKSKTGEGEN TDKLECGTCS KLFSNILILK SHQEHVHGQF FPYGALEKFA
RQYREAYDKL YPISPSSPET PPPPPPPPPP PPPPPPPTPS QPSSAGAGKI QNTTPTPLQA
PPPTPPPPPP PPPPPPPPPP PPSAPPQVQL PLGLDPNFLR HSQFKRPRTR ITDDQLKILR
AYFDINNSPS EEQIQEMAEK SGLSQKVIKH WFRNTLFKER QRNKDSPYNF SNPPITVLED
IRIDPQPSAV EPYKSDASFS KRSSRTRFTD YQLRVLQDFF DTNAYPKDDE IEQLSTVLNL
PTRVIVVWFQ NARQKARKSY ENQAETKDNE KRELTNERYI RTSNMQYQCK KCSVVFPRIF
DLITHQKKQC YKDEDDDAQD ESQTEDSLDA SDQTVYKNCT VSSQNDSSKS LAVTAASSGS
GSSTPLIPSP KPEPEKASPK PESTEKPKQN ETISKQTDTT SQSAKPVQST PVTSSDSQPA
ASQPQQQKQS QIVGRPPSAS QTTPVPSSPL PISMTSLQNS LPPQLLQYQC DQCTVAFPTL
ELWQEHQHMH FLAAQNQFLH SQFLERPMDM PYMIFDPNNP LMTGQLLNSS LAQMPPQTGS
SHTTHPATVS GSLKRKLDDK EDNNCSEKEG GNSGEDQHRD KRLRTTITPE QLEILYEKYL
LDSNPTRKML DHIAREVGLK KRVVQVWFQN TRARERKGQF RAVGPAQSHK RCPFCRALFK
AKSALESHIR SRHWNEGKQA GYSLPPSPLI STEDGGESPQ KYIFFDYPSL SLAKTELSSE
NELASTVSTP VSKTAEMSPK NLLSPSSFKA ESSEDIENLN APPADSGYDQ NKTDFDETSS
INTAISDATT GDEGNNEMES TTGSSGDAKP ASPPKEPKPL VNDTLPKAAT TPTNENTDDK
FIFSLTSPSI HFSEKDGDHD QSYYITDDPD DNADRSETSS IADPSSPNPF GASNPFKSKN
SDRPGHKRFR TQMSNLQLKV LKACFSDYRT PTMQECEMLG NEIGLPKRVV QVWFQNARAK
EKKFKINIGK PFMINQTGPD GTKPECSLCG VKYSARLSIR DHIFSKQHIT KVRETVGSQL
DREKDYLAPT TVRQLMAQQE LDRIKKATDV LGLTVQQPGM MDSSSLHGIS LPAAYPGLPG
LPPVLLPGMN GPSSLPGFPQ SSNTLTSPGA GMLGFPTSAT SSPALSLSSA PSKPLLQTPP
PPPPPPPPPP PPPPPPPPPP PSSFLSGQQT EQQSKESEKK NTINKPNKVK KIKEEELEAN
KPEKHLKKEE KISSALSVLG KVVGEAHVDP TQLQALQNAI AGDPASFIGG QFLPYFIPGF
ASYFTPQLPG TVQGGYLPPV CGMESLFPYG PAVPQTIAGL SPGALLQQYQ QYQQNLQDSL
QKQQKQQQEQ QQKQVQAKSS KAENDQQQNS SDTSETKEDR SSATESTKEE PQLDSKSADF
SDTYIVPFVK YEFICRKCQM MFTDEDAAVN HQKSFCYFGQ PLIDPQETVL RVPVSKYQCL
ACDVAISGNE ALSQHLQSSL HKEKTIKQAM RNAKEHVRLL PHSVCSPNPN TTSTSQSAAS
SNNTYPHLSC FSMKSWPNIL FQASARKAAS SPSSPPSLSL PSTVTSSLCS TSGVQTSLPT
ESCSDESDSE LSQKLEDLDN SLEVKAKPAS GLDGNFNSIR MDMFSV
//