GenomeNet

Database: UniProt
Entry: W5PD79_SHEEP
LinkDB: W5PD79_SHEEP
Original site: W5PD79_SHEEP 
ID   W5PD79_SHEEP            Unreviewed;       634 AA.
AC   W5PD79;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   24-JAN-2024, entry version 61.
DE   RecName: Full=Basic helix-loop-helix ARNT-like protein 1 {ECO:0000256|ARBA:ARBA00041169};
DE   AltName: Full=Aryl hydrocarbon receptor nuclear translocator-like protein 1 {ECO:0000256|ARBA:ARBA00042144};
DE   AltName: Full=Brain and muscle ARNT-like 1 {ECO:0000256|ARBA:ARBA00041751};
GN   Name=ARNTL {ECO:0000313|Ensembl:ENSOARP00000008391.1};
OS   Ovis aries (Sheep).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Ovis.
OX   NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000008391.1, ECO:0000313|Proteomes:UP000002356};
RN   [1] {ECO:0000313|Ensembl:ENSOARP00000008391.1, ECO:0000313|Proteomes:UP000002356}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000008391.1,
RC   ECO:0000313|Proteomes:UP000002356};
RX   PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA   Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA   Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA   Wang W., Xun X.;
RT   "The sheep genome reference sequence: a work in progress.";
RL   Anim. Genet. 41:449-453(2010).
RN   [2] {ECO:0000313|Ensembl:ENSOARP00000008391.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC       Nucleus, PML body {ECO:0000256|ARBA:ARBA00004322}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMGL01031899; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; W5PD79; -.
DR   SMR; W5PD79; -.
DR   STRING; 9940.ENSOARP00000008391; -.
DR   PaxDb; 9940-ENSOARP00000008391; -.
DR   Ensembl; ENSOART00000008513.1; ENSOARP00000008391.1; ENSOARG00000007809.1.
DR   eggNOG; KOG3561; Eukaryota.
DR   OMA; YHHEDIP; -.
DR   Proteomes; UP000002356; Chromosome 15.
DR   Bgee; ENSOARG00000007809; Expressed in major salivary gland and 52 other cell types or tissues.
DR   ExpressionAtlas; W5PD79; baseline.
DR   GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR   GO; GO:0005634; C:nucleus; IEA:InterPro.
DR   GO; GO:0005667; C:transcription regulator complex; IEA:InterPro.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR   CDD; cd11438; bHLH-PAS_ARNTL_PASD3; 1.
DR   CDD; cd00130; PAS; 2.
DR   Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR   Gene3D; 3.30.450.20; PAS domain; 2.
DR   InterPro; IPR011598; bHLH_dom.
DR   InterPro; IPR036638; HLH_DNA-bd_sf.
DR   InterPro; IPR001067; Nuc_translocat.
DR   InterPro; IPR001610; PAC.
DR   InterPro; IPR000014; PAS.
DR   InterPro; IPR035965; PAS-like_dom_sf.
DR   InterPro; IPR013767; PAS_fold.
DR   NCBIfam; TIGR00229; sensory_box; 1.
DR   PANTHER; PTHR23042:SF52; ARYL HYDROCARBON RECEPTOR NUCLEAR TRANSLOCATOR-LIKE PROTEIN 1; 1.
DR   PANTHER; PTHR23042; CIRCADIAN PROTEIN CLOCK/ARNT/BMAL/PAS; 1.
DR   Pfam; PF00010; HLH; 1.
DR   Pfam; PF00989; PAS; 1.
DR   Pfam; PF14598; PAS_11; 1.
DR   PRINTS; PR00785; NCTRNSLOCATR.
DR   SMART; SM00353; HLH; 1.
DR   SMART; SM00086; PAC; 1.
DR   SMART; SM00091; PAS; 2.
DR   SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR   SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR   PROSITE; PS50888; BHLH; 1.
DR   PROSITE; PS50112; PAS; 2.
PE   4: Predicted;
KW   Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW   Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          79..132
FT                   /note="BHLH"
FT                   /evidence="ECO:0000259|PROSITE:PS50888"
FT   DOMAIN          151..216
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   DOMAIN          355..404
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   REGION          1..20
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          26..46
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          472..500
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          519..603
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        520..540
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        560..582
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   634 AA;  69618 MW;  8E69720D35344E7C CRC64;
     MADQRMDISS TISDFMSPGA TDLLSSPLGT GGVDCNRKRK GSSTDYQLNG FSLEESMDTD
     KDDPHGRLEY TEHQGRIKNA REAHSQIEKR RRDKMNSFID ELASLVPTCN AMSRKLDKLT
     VLRMAVQHMK TLREGATNPY TEANYKPTFL SDDELKHLIL RAADGFLFVV GCDRGKILFV
     SESVFKILNY SQNDLIGQSL FDYLHPKDIA KVKEQLSSSD TAPRERLIDA KTGLPVKTDI
     TPGPSRLCSG ARRSFFCRMK CNRPSVKVED KDFPSTCSKK KADRKSFCTI HSTGYLKSWP
     PTKMGLDEDN EPDNEGCNLS CLVAIGRLHS HMVPQPANGE IRVKSMEYVS RHAIDGKFVF
     VDQRATAILA YLPQELLGTS CYEYFHQDDI GHLAECHRQV LQTREKITTN CYKFKIKDGS
     FITLRSRWFS FMNPWTKEVE YIVSTNTVVL ANVLEGGDPS FPQLTASPHS MDSMLPSGEG
     GPKRIHPTVP GIPGGTRAGA GKIGRMIAEE IMEIHRIRGS SPSSCGSSPL NITSTPPPDA
     SSPGGKKILN GGTPDIPSSG LLPGQAQENP GYPYSDSSSI LGENPHIGID MIDNDQGSSS
     PSNDEAAMAV IMSLLEADAG LGGPVDFSDL PWPL
//
DBGET integrated database retrieval system