ID W5NC56_LEPOC Unreviewed; 398 AA.
AC W5NC56;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Homeobox protein ARX-like {ECO:0000313|Ensembl:ENSLOCP00000018215.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000018215.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000018215.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01015656; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_006632904.1; XM_006632841.2.
DR AlphaFoldDB; W5NC56; -.
DR STRING; 7918.ENSLOCP00000018215; -.
DR Ensembl; ENSLOCT00000018247.1; ENSLOCP00000018215.1; ENSLOCG00000014793.1.
DR GeneID; 102684999; -.
DR KEGG; loc:102684999; -.
DR eggNOG; KOG0490; Eukaryota.
DR GeneTree; ENSGT00940000160633; -.
DR HOGENOM; CLU_692539_0_0_1; -.
DR InParanoid; W5NC56; -.
DR OMA; KDTSERH; -.
DR OrthoDB; 3042435at2759; -.
DR Proteomes; UP000018468; Linkage group LG7.
DR Bgee; ENSLOCG00000014793; Expressed in embryo and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003654; OAR_dom.
DR PANTHER; PTHR24329; HOMEOBOX PROTEIN ARISTALESS; 1.
DR PANTHER; PTHR24329:SF340; HOMEOBOX PROTEIN ESX1; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000018468}.
FT DOMAIN 193..253
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 377..390
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 195..254
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 124..174
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..144
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 398 AA; 44447 MW; 34D607092D1A4CE1 CRC64;
MQVSQGTRAV YDGPPGKTDA AKGPLNFPLS SSHFIDSILS KASSAVGKNK GTHGDMVEAG
DLNGPESIKT MECPSDVKMT FLHQDHGREG RKHSGLHSES IIREMETLYS NYIKSHQMCY
SLNPRSHDGK LHSQPPSLQT GELTSPKDVD LKEMPGNLTQ ENLREETEKD TSERHRVMKC
ISPGRYSSSS LKRKQRRYRT TFTNFQLEEL ERAFRKSHYP DVFSREELAM RLDLTEARVQ
VWFQNRRAKW RKREKVGVLG SVPGLTMASP LGMYLDVPLT QTSALDPCWG STGVPPFGIP
QTAPVFSPTS LGNLGISTLT WASLFRHPLL NPHFNRFFTT MSPLVNTMGL IAKSPAQPFD
LSAIALNDPV TRERKTSSIA TLRLKAKEHS AQIPQLDT
//