ID A0A091JMJ2_EGRGA Unreviewed; 631 AA.
AC A0A091JMJ2;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE SubName: Full=Forkhead box protein P1 {ECO:0000313|EMBL:KFP21832.1};
DE Flags: Fragment;
GN ORFNames=Z169_04901 {ECO:0000313|EMBL:KFP21832.1};
OS Egretta garzetta (Little egret).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Ardeidae; Egretta.
OX NCBI_TaxID=188379 {ECO:0000313|EMBL:KFP21832.1, ECO:0000313|Proteomes:UP000053119};
RN [1] {ECO:0000313|EMBL:KFP21832.1, ECO:0000313|Proteomes:UP000053119}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_Z169 {ECO:0000313|EMBL:KFP21832.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00089}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK502270; KFP21832.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091JMJ2; -.
DR STRING; 188379.A0A091JMJ2; -.
DR Proteomes; UP000053119; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd20065; FH_FOXP2; 1.
DR Gene3D; 1.20.5.340; -; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR047412; FH_FOXP1_P2.
DR InterPro; IPR001766; Fork_head_dom.
DR InterPro; IPR032354; FOXP-CC.
DR InterPro; IPR030456; TF_fork_head_CS_2.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR45796; FORKHEAD BOX P, ISOFORM C; 1.
DR PANTHER; PTHR45796:SF3; FORKHEAD BOX PROTEIN P1; 1.
DR Pfam; PF00250; Forkhead; 1.
DR Pfam; PF16159; FOXP-CC; 1.
DR PRINTS; PR00053; FORKHEAD.
DR SMART; SM00339; FH; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS00658; FORK_HEAD_2; 1.
DR PROSITE; PS50039; FORK_HEAD_3; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00089}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00089};
KW Reference proteome {ECO:0000313|Proteomes:UP000053119};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 419..492
FT /note="Fork-head"
FT /evidence="ECO:0000259|PROSITE:PS50039"
FT DNA_BIND 419..492
FT /note="Fork-head"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00089"
FT REGION 224..251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 348..385
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 564..631
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 224..240
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..579
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFP21832.1"
FT NON_TER 631
FT /evidence="ECO:0000313|EMBL:KFP21832.1"
SQ SEQUENCE 631 AA; 70932 MW; 8C4F35137BB3ADB9 CRC64;
LSSQALQVAR QLLLQQQQQQ QQQQQQPEKI SGLKSPKRND KQPALQVPVS VAMMTPQVIT
PQQMQQILQQ QVLTPQQLQV LLQQQQALML QQQQLQEFYK KQQEQLQLQL LQQQHAGKQP
KEPQQQQVAT QQLAFQQQLL QMQQLQQQHL LSLQRQGLLT IQPGQPTLPL QPLAQGMIPT
ELQQLWKEVT SSHTAEEAAS NNHSSLDLST TCVSSSAPSK TSLIINPHAS TNGQLSVHTP
KRESLSHEEH SHSHPLYGHG VCKWPGCEAV CEDFQSFLKH LNSEHALDDR STAQCRVQMQ
VVQQLELQLA KDKERLQAMM THLHVKSTEP KATPQPLNLV SSVTLSKTAS EASPQSLPHT
PTTPTAPITP VTQGPSVITT TSMHNVGPIR RRYSDKYNVP ISSADIAQNQ EFYKNAEVRP
PFTYASLIRQ AILESPEKQL TLNEIYNWFT RMFAYFRRNA ATWKNAVRHN LSLHKCFVRV
ENVKGAVWTV DELEFQKRRP QKISGNPSLI KNIQTSHTYC TPLNAALQAS MAENSIPLYT
TASMGNPTLG NLANAMREEL NGAMEHTNSN GSDSSPGRSP MQAMHPVHVK EEPLDPDENE
GPLSLVTTAN HSPDFDHDRD YEDEPVNEDI E
//