ID A0A091HMA2_CALAN Unreviewed; 396 AA.
AC A0A091HMA2;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE SubName: Full=Homeobox protein Hox-B3 {ECO:0000313|EMBL:KFO96257.1};
GN ORFNames=N300_12414 {ECO:0000313|EMBL:KFO96257.1};
OS Calypte anna (Anna's hummingbird) (Archilochus anna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Caprimulgimorphae; Apodiformes;
OC Trochilidae; Calypte.
OX NCBI_TaxID=9244 {ECO:0000313|EMBL:KFO96257.1, ECO:0000313|Proteomes:UP000054308};
RN [1] {ECO:0000313|EMBL:KFO96257.1, ECO:0000313|Proteomes:UP000054308}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N300 {ECO:0000313|EMBL:KFO96257.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Sequence-specific transcription factor which is part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis.
CC {ECO:0000256|ARBA:ARBA00003263}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the Antp homeobox family.
CC {ECO:0000256|ARBA:ARBA00009107}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL217525; KFO96257.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091HMA2; -.
DR STRING; 9244.A0A091HMA2; -.
DR Proteomes; UP000054308; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR025281; DUF4074.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001827; Homeobox_Antennapedia_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR45664:SF11; HOMEOBOX PROTEIN HOX-B3; 1.
DR PANTHER; PTHR45664; PROTEIN ZERKNUELLT 1-RELATED; 1.
DR Pfam; PF13293; DUF4074; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00032; ANTENNAPEDIA; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000054308};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 159..215
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 161..216
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 18..114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 129..165
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 212..242
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..396
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..63
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 75..90
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..114
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 367..389
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 396 AA; 43049 MW; 3EFE494F927BCB3D CRC64;
MQKTTYYDSS TLFGGYSYGS ANGFGYEGPQ QPFQPSSHVE NDFQRSACSL QSLGNTTPHA
KSKDLNGSCM RPSLPQEHHP PPPISPPPNP ATNSTSSNSN NQSGSGKTVP SKANLSANAS
LTKQIFPWMK ESRQNSKQKT SSPSTAETCS GEKSPPGSSA SKRARTAYTS AQLVELEKEF
HFNRYLCRPR RVAMANIERQ IKIWFQNRRM KYKKDQKSKG MGSSSGGPSP TGSPPQPMQS
SAGFMNALHT MSSNYDAPSP PSFNKPHQNA YAMSTNYQNP IKGCPSQQKY ANTAPEYDPH
VLQGNGVAYG TPNMQGSPVY VGGNYVDSMP TSGPSLYGLN HLPHHQAANM DFNGPPQMPP
SQHHGPCETH PTYTDLSTHH ASSQGRIQEA PKLTHL
//