ID A0A452RC91_URSAM Unreviewed; 310 AA.
AC A0A452RC91;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE SubName: Full=Paired box 4 {ECO:0000313|Ensembl:ENSUAMP00000016316.1};
GN Name=PAX4 {ECO:0000313|Ensembl:ENSUAMP00000016316.1};
OS Ursus americanus (American black bear) (Euarctos americanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ursus.
OX NCBI_TaxID=9643 {ECO:0000313|Ensembl:ENSUAMP00000016316.1, ECO:0000313|Proteomes:UP000291022};
RN [1] {ECO:0000313|Proteomes:UP000291022}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Korstanje R., Srivastava A., Sarsani V.K., Sheehan S.M., Seger R.L.,
RA Barter M.E., Lindqvist C., Brody L.C., Mullikin J.C.;
RT "De novo assembly and RNA-Seq shows season-dependent expression and editing
RT in black bear kidneys.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSUAMP00000016316.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family.
CC {ECO:0000256|ARBA:ARBA00005733}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452RC91; -.
DR Ensembl; ENSUAMT00000018274.1; ENSUAMP00000016316.1; ENSUAMG00000012992.1.
DR GeneTree; ENSGT00940000161709; -.
DR Proteomes; UP000291022; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636:SF8; PAIRED BOX PROTEIN PAX-4; 1.
DR PANTHER; PTHR45636; PAIRED BOX PROTEIN PAX-6-RELATED-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00292; PAX; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Paired box {ECO:0000256|ARBA:ARBA00022724};
KW Reference proteome {ECO:0000313|Proteomes:UP000291022};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 1..112
FT /note="Paired"
FT /evidence="ECO:0000259|PROSITE:PS51057"
FT DOMAIN 149..209
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 151..210
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 132..153
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 219..241
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 310 AA; 33658 MW; 75AD01EA0C386C3B CRC64;
MLNSAAWASK NTHPLPIMPS SVPASHLTQV SNGCVSKILA RYYRTGVLEP KGIGGSKPRL
ATPPVVARIA QLKGECPALF AWEIQRQLCA EGLCTQDKTP SVSSINRVLR ALQEDQRLPW
AQLKSPAVLA PVPHTPHSGS EAPRGPHPGT GHRNRTIFSP GQAEALEKEF QRGQYPDSVA
RGKLAAATSL PEDTVRVWFS NRRAKWRRQE KLKWEMQVPG TSQDLTLPSA SPGTTSAQRS
PCSVPTAVLP ALESLGPSCY QLCWGTAPDR CLSDTPPQAT CLLQLSSLDS VLLCHPCPSF
HCLHCQSWCP
//