ID A0A4W3JM82_CALMI Unreviewed; 606 AA.
AC A0A4W3JM82;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 28-JAN-2026, entry version 31.
DE SubName: Full=Forkhead box P4 {ECO:0000313|Ensembl:ENSCMIP00000033205.1};
OS Callorhinchus milii (Ghost shark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Holocephali; Chimaeriformes; Callorhinchidae; Callorhinchus.
OX NCBI_TaxID=7868 {ECO:0000313|Ensembl:ENSCMIP00000033205.1, ECO:0000313|Proteomes:UP000314986};
RN [1] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=17185593; DOI=10.1126/science.1130708;
RA Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA Brenner S.;
RT "Ancient noncoding elements conserved in the human genome.";
RL Science 314:1892-1892(2006).
RN [2] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=17407382;
RA Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA Brenner S.;
RT "Survey sequencing and comparative analysis of the elephant shark
RT (Callorhinchus milii) genome.";
RL PLoS Biol. 5:E101-E101(2007).
RN [3] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24402279; DOI=10.1038/nature12826;
RG International Elephant Shark Genome Sequencing Consortium;
RA Venkatesh B., Lee A.P., Ravi V., Maurya A.K., Lian M.M., Swann J.B.,
RA Ohta Y., Flajnik M.F., Sutoh Y., Kasahara M., Hoon S., Gangu V., Roy S.W.,
RA Irimia M., Korzh V., Kondrychyn I., Lim Z.W., Tay B.H., Tohari S.,
RA Kong K.W., Ho S., Lorente-Galdos B., Quilez J., Marques-Bonet T.,
RA Raney B.J., Ingham P.W., Tay A., Hillier L.W., Minx P., Boehm T.,
RA Wilson R.K., Brenner S., Warren W.C.;
RT "Elephant shark genome provides unique insights into gnathostome
RT evolution.";
RL Nature 505:174-179(2014).
RN [4] {ECO:0000313|Ensembl:ENSCMIP00000033205.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [5] {ECO:0000313|Ensembl:ENSCMIP00000033205.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00089}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A4W3JM82; -.
DR STRING; 7868.ENSCMIP00000033205; -.
DR Ensembl; ENSCMIT00000033710.1; ENSCMIP00000033205.1; ENSCMIG00000014179.1.
DR GeneTree; ENSGT00940000158700; -.
DR InParanoid; A0A4W3JM82; -.
DR Proteomes; UP000314986; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IEA:TreeGrafter.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:TreeGrafter.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-KW.
DR CDD; cd20067; FH_FOXP4; 1.
DR FunFam; 1.20.5.340:FF:000005; Forkhead box P1, isoform CRA_f; 1.
DR FunFam; 1.10.10.10:FF:000010; Forkhead box P2 isoform B; 1.
DR Gene3D; 1.20.5.340; -; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR047414; FH_FOXP4.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR050998; FOXP.
DR InterPro; IPR032354; FOXP-CC.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR45796; FORKHEAD BOX P, ISOFORM C; 1.
DR PANTHER; PTHR45796:SF7; FORKHEAD BOX PROTEIN P4; 1.
DR Pfam; PF00250; Forkhead; 1.
DR Pfam; PF16159; FOXP-CC; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Reference proteome {ECO:0000313|Proteomes:UP000314986};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 392..465
FT /note="Fork-head"
FT /evidence="ECO:0000259|PROSITE:PS50039"
FT DNA_BIND 392..465
FT /note="Fork-head"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00089"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 536..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 16..27
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 542..554
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 595..606
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 606 AA; 65942 MW; 4B1C26CF92AC3E33 CRC64;
MMVESASEAI RSSAANQNGV SSLSSQPAAG GRDGGANGEI NGEVSPVDLL HLQQQQVQVS
HSLTLTLAQS LSHPQLVPVS VAMMTPQVIT PQQMQQILSP PQLQALLQQQ QAVMLQQVRS
SEGGSSVSGW GEGSVSGGLR RIRGTVGDRG VRNRQVSLWS WELGVTEVLF GFPAMCPTDL
QQLWKEVTSA QNLEDSLKSD GLDLSSTSPA STFLTCKISP SISHHSLLNG GSAMHTPKRE
SVSHEEATGT HQLYGHLSNE HALDDRSTAQ CRVQMQVVQQ LEIQLSKESE RLQAMMAHLH
MRPSDPKPFA QPLNLVSSVT LSKAEPFGDL LPHTPTSASS PATPIRQGPS VISSASLHSI
GSIRRRHADK YCLPISSELA QNHEFYKNAD VRPPFTYASL IRQAILETSD RQLTLNEIYN
WFTRMFAYFR RNTATWKNAV RHNLSLHKCF VRVENVKGAV WTVDELEYQK RRPPKMTGSP
TLVKNVITGL GYGAALNATY QAALAESSLP LLGSPGLISN TSTSGLMNVG HDDVSSTVEQ
VNSNGSSSPG LSPPQHGHQL HIKEEPAEME EEERLVSLTA PPVASVNPEM PDGRELEEEL
RAEDLE
//