ID A0A218VCG0_9PASE Unreviewed; 564 AA.
AC A0A218VCG0;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Homeobox protein ARX {ECO:0000313|EMBL:OWK63745.1};
GN Name=ARX {ECO:0000313|EMBL:OWK63745.1};
GN ORFNames=RLOC_00003709 {ECO:0000313|EMBL:OWK63745.1};
OS Lonchura striata domestica (Bengalese finch).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Passeroidea; Estrildidae;
OC Estrildinae; Lonchura.
OX NCBI_TaxID=299123 {ECO:0000313|EMBL:OWK63745.1, ECO:0000313|Proteomes:UP000197619};
RN [1] {ECO:0000313|EMBL:OWK63745.1, ECO:0000313|Proteomes:UP000197619}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=White83orange57 {ECO:0000313|EMBL:OWK63745.1};
RA Colquitt B.M., Brainard M.S.;
RT "Genome of assembly of the Bengalese finch, Lonchura striata domestica.";
RL Submitted (MAY-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OWK63745.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MUZQ01000009; OWK63745.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A218VCG0; -.
DR STRING; 299123.ENSLSDP00000008785; -.
DR Proteomes; UP000197619; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR24329; HOMEOBOX PROTEIN ARISTALESS; 1.
DR PANTHER; PTHR24329:SF337; HOMEOBOX PROTEIN ARX; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000197619}.
FT DOMAIN 273..333
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 532..545
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 275..334
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 48..272
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..144
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 178..211
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..255
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 564 AA; 59426 MW; 51F42F7C31D69359 CRC64;
MSGPYPEESC AERPECKSKS PTLLSSYCID SILGRRSPCK VRLLGTAPTL PAIAARPDTD
KAAQGSPKAS PPFEPAELHL PPKLRRLYGP GGGRLLPPRA AGPRGDRTEG RPEGGAGPVP
AAAPWDTLKI SQAPQVSISR SKSYRENAPF VAPPPARDEL GSPAAHVEER PGPAAPAAEE
EEEEEDEEML EEEEDEEEEE LEDDEEEELL GEDGGGLLKD ARRGATAAPG AEGGDLSPKE
ELLLHPEDGE GKDGEESVCL SAGSDSEEGL LKRKQRRYRT TFTSYQLEEL ERAFQKTHYP
DVFTREELAM RLDLTEARVQ VWFQNRRAKW RKREKAGAQT HPPGLPFPGP LSATHPLSPY
LDASPFPPHH PALDSAWTAA AAAAAAFPSL PPPPPGSAAL PPGGTPLGLG TFLGAAVFRH
PAFISPAFGR YRPRRARGSA GQCGAVRGTS LCPHASALGG KEATGAVRAG GPRRGGDGRL
CFPCRLFSTM SPLGSASSAA ALLRQPAPAA EGAVGSAGLG DPASAAADRR ASSIAALRLK
AKEHAAQLTQ LNILPGNSTG KEVC
//