ID A0A452IVQ8_9SAUR Unreviewed; 455 AA.
AC A0A452IVQ8;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSGAGP00000031981.1};
OS Gopherus agassizii (Agassiz's desert tortoise).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Durocryptodira;
OC Testudinoidea; Testudinidae; Gopherus.
OX NCBI_TaxID=38772 {ECO:0000313|Ensembl:ENSGAGP00000031981.1, ECO:0000313|Proteomes:UP000291020};
RN [1] {ECO:0000313|Proteomes:UP000291020}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28562605;
RA Tollis M., DeNardo D.F., Cornelius J.A., Dolby G.A., Edwards T.,
RA Henen B.T., Karl A.E., Murphy R.W., Kusumi K.;
RT "The Agassiz's desert tortoise genome provides a resource for the
RT conservation of a threatened species.";
RL PLoS ONE 12:e0177708-e0177708(2017).
RN [2] {ECO:0000313|Ensembl:ENSGAGP00000031981.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452IVQ8; -.
DR STRING; 38772.ENSGAGP00000031981; -.
DR Ensembl; ENSGAGT00000036261.1; ENSGAGP00000031981.1; ENSGAGG00000022936.1.
DR Proteomes; UP000291020; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:InterPro.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd00093; HTH_XRE; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR InterPro; IPR001387; Cro/C1-type_HTH.
DR InterPro; IPR040363; HMBOX1.
DR InterPro; IPR006899; HNF-1_N.
DR InterPro; IPR044869; HNF-1_POU.
DR InterPro; IPR044866; HNF_P1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR PANTHER; PTHR14618:SF0; HOMEOBOX-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR14618; HOMEODOX-CONTAINING PROTEIN 1 HMBOX1; 1.
DR Pfam; PF04814; HNF-1_N; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR PROSITE; PS51937; HNF_P1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51936; POU_4; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000291020};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 18..49
FT /note="HNF-p1"
FT /evidence="ECO:0000259|PROSITE:PS51937"
FT DOMAIN 146..242
FT /note="POU-specific atypical"
FT /evidence="ECO:0000259|PROSITE:PS51936"
FT DOMAIN 266..341
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 268..342
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 56..121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 68..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 379..418
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 455 AA; 51425 MW; 78F68F0665355EDF CRC64;
MLRTFPVVLL ETMSHYTDEP RFTIEQIDLL QRLRRTGMTR HEILHALETL DRLDQEHSDK
FGRRSSYGGG SYGNSTNNVP ASSSTATAST QTQHSGMSPS PSNSYDTSPQ PCTTNQNGRE
SNERLSAFNG KMSPTRYPLA NSLAQRSYSF EASEEDLDVD DKVEELMRRD SSVIKEEIKA
FLANRRISQA VVAQVTGISQ SRISHWLLQQ GSDLSEQKKR AFYRWYQLEK TNPGATLSMR
PAPIPVEEPE WRQTPPPVTA TSGTFRLRRG SRFTWRKECL AVMESYFNEN QYPDEAKREE
IANACNAVIQ KPGKKLSDLE RVTSLKVYNW FANRRKEIKR RANIEAAILE SHGIDVQSPG
GHSNSDDVDG NDYSEQDTWQ VRNGEEEGRC SEGGREAEKV EEDRRICSKQ DDSTSHSDHQ
DPISLAVEMA AVNHTILALA RQGTNEIKTE ALDDD
//