ID W5PDU3_SHEEP Unreviewed; 400 AA.
AC W5PDU3;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE RecName: Full=Thyroid nuclear factor 1 {ECO:0000256|ARBA:ARBA00044338};
GN Name=NKX2-1 {ECO:0000313|Ensembl:ENSOARP00000008607.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000008607.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000008607.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000008607.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000008607.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the NK-2 homeobox family.
CC {ECO:0000256|ARBA:ARBA00005661}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01042474; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5PDU3; -.
DR SMR; W5PDU3; -.
DR STRING; 9940.ENSOARP00000008607; -.
DR PaxDb; 9940-ENSOARP00000008607; -.
DR Ensembl; ENSOART00000008732.1; ENSOARP00000008607.1; ENSOARG00000008021.1.
DR eggNOG; KOG0842; Eukaryota.
DR HOGENOM; CLU_052416_0_0_1; -.
DR OMA; PPYQETM; -.
DR Proteomes; UP000002356; Chromosome 18.
DR Bgee; ENSOARG00000008021; Expressed in thyroid gland and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR020479; Homeobox_metazoa.
DR PANTHER; PTHR24340; HOMEOBOX PROTEIN NKX; 1.
DR PANTHER; PTHR24340:SF33; HOMEOBOX PROTEIN NKX-2.1; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00024; HOMEOBOX.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 189..249
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 191..250
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 249..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 337..368
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 253..285
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 400 AA; 41734 MW; 90AC7551FA58BE39 CRC64;
MWSGGSGKAR GWEAAAGGRS GPGRLSRRRI MSMSPKHTTP FSVSDILSPL EESYKKVGME
GGGLGPLAAY RQGQAAPPAA MQQHAVGHHG AVTAAYHMTA AGVPQLSHSA VGGYCNGNLG
NMSELPPYQD TMRNSASGPG WYGANPDPRF PAISRFMGPS AGVNVAAMGS LTGIADTAKS
LAPLHAAAAP RRKRRVLFSQ AQVYELERRF KQQRYLSAPE REHLASMIHL TPTQVKIWFQ
NHRYKMKRQA KDKAAQQQLQ QDSGGGGGGA GCQQQQQQAQ QQSPRRVAVP VLVKDGKPCQ
AGAPTPGAAS LQGHAQQQAQ QQAQAAQAAA AAISVGSGGP GLGAHPGHQP GSAGQSPDLA
HHAASPAALQ GQVSSLPHLN SSGSDYGTMS CSTLLYGRTW
//