ID A0A087RA23_APTFO Unreviewed; 395 AA.
AC A0A087RA23;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 22-FEB-2023, entry version 31.
DE SubName: Full=Homeobox-containing protein 1 {ECO:0000313|EMBL:KFM10327.1};
DE Flags: Fragment;
GN ORFNames=AS27_11777 {ECO:0000313|EMBL:KFM10327.1};
OS Aptenodytes forsteri (Emperor penguin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae;
OC Aptenodytes.
OX NCBI_TaxID=9233 {ECO:0000313|EMBL:KFM10327.1, ECO:0000313|Proteomes:UP000053286};
RN [1] {ECO:0000313|EMBL:KFM10327.1, ECO:0000313|Proteomes:UP000053286}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_AS27 {ECO:0000313|EMBL:KFM10327.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL226234; KFM10327.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A087RA23; -.
DR STRING; 9233.A0A087RA23; -.
DR Proteomes; UP000053286; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:InterPro.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd00093; HTH_XRE; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR InterPro; IPR001387; Cro/C1-type_HTH.
DR InterPro; IPR040363; HMBOX1.
DR InterPro; IPR006899; HNF-1_N.
DR InterPro; IPR044869; HNF-1_POU.
DR InterPro; IPR044866; HNF_P1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14618:SF0; HOMEOBOX-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR14618; HOMEODOX-CONTAINING PROTEIN 1 HMBOX1; 1.
DR Pfam; PF04814; HNF-1_N; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR PROSITE; PS51937; HNF_P1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51936; POU_4; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000053286};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 14..45
FT /note="HNF-p1"
FT /evidence="ECO:0000259|PROSITE:PS51937"
FT DOMAIN 142..238
FT /note="POU-specific atypical"
FT /evidence="ECO:0000259|PROSITE:PS51936"
FT DOMAIN 262..337
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 264..338
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 52..117
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 350..395
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..117
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..395
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFM10327.1"
FT NON_TER 395
FT /evidence="ECO:0000313|EMBL:KFM10327.1"
SQ SEQUENCE 395 AA; 44846 MW; 2DFA2774B2E89E69 CRC64;
FVFSLLETMS HYTDEPRFTI EQIDLLQRLR RTGMTRHEIL HALETLDRLD QEHSDKFGRR
SSYGGGSYSN STNNVPASSS TATASTQTQH SGMSPSPSNS YDTSPQPCTT NQNGRESNER
LSAFNGKMSP TRYPLANSLA QRSYSFEASE EDLDIDDKVE ELMRRDSSVI KEEIKAFLAN
RRISQAVVAQ VTGISQSRIS HWLLQQGSDL SEQKKRAFYR WYQLEKTNPG ATLSMRPAPI
PIEEPEWRQT PPPVTATSGT FRLRRGSRFT WRKECLAVME SYFNENQYPD EAKREEIANA
CNAVIQKPGK KLSDLERVTS LKVYNWFANR RKEIKRRANI EAAILESHGI DVQSPGGHSN
SDDVDGNDYS EQDTWQVRNG EEEGRCSEGG REAEK
//