ID A0A3M0J770_HIRRU Unreviewed; 302 AA.
AC A0A3M0J770;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 22-FEB-2023, entry version 19.
DE RecName: Full=Homeobox protein {ECO:0000256|PIRNR:PIRNR000563};
GN ORFNames=DUI87_26840 {ECO:0000313|EMBL:RMB96774.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMB96774.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMB96774.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMB96774.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMB96774.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR000563, ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. Bicoid subfamily.
CC {ECO:0000256|ARBA:ARBA00006503, ECO:0000256|PIRNR:PIRNR000563}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMB96774.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000172; RMB96774.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0J770; -.
DR STRING; 333673.A0A3M0J770; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR016233; Homeobox_Pitx/unc30.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR45882:SF1; PITUITARY HOMEOBOX 1; 1.
DR PANTHER; PTHR45882; PITUITARY HOMEOBOX HOMOLOG PTX1; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR PIRSF; PIRSF000563; Homeobox_protein_Pitx/Unc30; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473,
KW ECO:0000256|PIRNR:PIRNR000563};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PIRNR:PIRNR000563};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR000563};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221}.
FT DOMAIN 76..136
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 268..281
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 78..137
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 282..302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..42
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..76
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 302 AA; 33823 MW; BCA5430C59AFBFF4 CRC64;
MDSFKGGMNL ERLPESLRPQ PSHDMASSFH LQRSSEPRDP IDNSASESSD TEVPEKERSG
EQKNEDGAAD DPAKKKKQRR QRTHFTSQQL QELEATFQRN RYPDMSMREE IAVWTNLTEP
RVRVWFKNRR AKWRKRERNQ QMDLCKNGYV PQFSGLMQPY DDMYAGYPYN NWATKSLTPA
PLSTKSFTFF NSMSPLSSQS MFSAPSSIPS MNMPSSMGHS AVPGMANSGL NNINNISGSS
LNSAMSSPAC PYGPPGSPYS VYRDTCNSSL ASLRLKSKQH SSFGYSSLQS PGSSLNACQY
NS
//