ID G3WVM9_SARHA Unreviewed; 784 AA.
AC G3WVM9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Prospero homeobox 1 {ECO:0000313|Ensembl:ENSSHAP00000019484.2};
GN Name=PROX1 {ECO:0000313|Ensembl:ENSSHAP00000019484.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000019484.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000019484.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000019484.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WVM9; -.
DR STRING; 9305.ENSSHAP00000019484; -.
DR Ensembl; ENSSHAT00000019641.2; ENSSHAP00000019484.2; ENSSHAG00000030877.1.
DR eggNOG; KOG3779; Eukaryota.
DR GeneTree; ENSGT00940000154790; -.
DR HOGENOM; CLU_016051_0_0_1; -.
DR InParanoid; G3WVM9; -.
DR TreeFam; TF316638; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0050692; F:DNA binding domain binding; IEA:Ensembl.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IEA:Ensembl.
DR GO; GO:0050693; F:LBD domain binding; IEA:Ensembl.
DR GO; GO:0016922; F:nuclear receptor binding; IEA:Ensembl.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR GO; GO:0090425; P:acinar cell differentiation; IEA:Ensembl.
DR GO; GO:0060414; P:aorta smooth muscle tissue morphogenesis; IEA:Ensembl.
DR GO; GO:0055009; P:atrial cardiac muscle tissue morphogenesis; IEA:Ensembl.
DR GO; GO:0060837; P:blood vessel endothelial cell differentiation; IEA:Ensembl.
DR GO; GO:0061114; P:branching involved in pancreas morphogenesis; IEA:Ensembl.
DR GO; GO:0021707; P:cerebellar granule cell differentiation; IEA:Ensembl.
DR GO; GO:0007623; P:circadian rhythm; IEA:Ensembl.
DR GO; GO:0021542; P:dentate gyrus development; IEA:Ensembl.
DR GO; GO:0060059; P:embryonic retina morphogenesis in camera-type eye; IEA:Ensembl.
DR GO; GO:0060214; P:endocardium formation; IEA:Ensembl.
DR GO; GO:0010631; P:epithelial cell migration; IEA:Ensembl.
DR GO; GO:0002194; P:hepatocyte cell migration; IEA:Ensembl.
DR GO; GO:0070365; P:hepatocyte differentiation; IEA:Ensembl.
DR GO; GO:0072574; P:hepatocyte proliferation; IEA:Ensembl.
DR GO; GO:0048839; P:inner ear development; IEA:Ensembl.
DR GO; GO:0001822; P:kidney development; IEA:Ensembl.
DR GO; GO:0070309; P:lens fiber cell morphogenesis; IEA:Ensembl.
DR GO; GO:0046619; P:lens placode formation involved in camera-type eye formation; IEA:Ensembl.
DR GO; GO:0030324; P:lung development; IEA:Ensembl.
DR GO; GO:0001946; P:lymphangiogenesis; IEA:Ensembl.
DR GO; GO:0060838; P:lymphatic endothelial cell fate commitment; IEA:Ensembl.
DR GO; GO:0070858; P:negative regulation of bile acid biosynthetic process; IEA:Ensembl.
DR GO; GO:0007406; P:negative regulation of neuroblast proliferation; IEA:Ensembl.
DR GO; GO:0045071; P:negative regulation of viral genome replication; IEA:Ensembl.
DR GO; GO:0007405; P:neuroblast proliferation; IEA:Ensembl.
DR GO; GO:0048664; P:neuron fate determination; IEA:Ensembl.
DR GO; GO:0097150; P:neuronal stem cell population maintenance; IEA:Ensembl.
DR GO; GO:0045787; P:positive regulation of cell cycle; IEA:Ensembl.
DR GO; GO:1901978; P:positive regulation of cell cycle checkpoint; IEA:Ensembl.
DR GO; GO:0010595; P:positive regulation of endothelial cell migration; IEA:Ensembl.
DR GO; GO:0001938; P:positive regulation of endothelial cell proliferation; IEA:Ensembl.
DR GO; GO:2000979; P:positive regulation of forebrain neuron differentiation; IEA:Ensembl.
DR GO; GO:0060421; P:positive regulation of heart growth; IEA:Ensembl.
DR GO; GO:2000179; P:positive regulation of neural precursor cell proliferation; IEA:Ensembl.
DR GO; GO:0060298; P:positive regulation of sarcomere organization; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0042752; P:regulation of circadian rhythm; IEA:Ensembl.
DR GO; GO:0030240; P:skeletal muscle thin filament assembly; IEA:Ensembl.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0048845; P:venous blood vessel morphogenesis; IEA:Ensembl.
DR GO; GO:0055010; P:ventricular cardiac muscle tissue morphogenesis; IEA:Ensembl.
DR GO; GO:0055005; P:ventricular cardiac myofibril assembly; IEA:Ensembl.
DR GO; GO:0060412; P:ventricular septum morphogenesis; IEA:Ensembl.
DR Gene3D; 1.10.10.500; Homeo-prospero domain; 2.
DR InterPro; IPR023082; Homeo_prospero_dom.
DR InterPro; IPR037131; Homeo_prospero_dom_sf.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR039350; Prospero_homeodomain.
DR PANTHER; PTHR12198; HOMEOBOX PROTEIN PROSPERO/PROX-1/CEH-26; 1.
DR PANTHER; PTHR12198:SF6; PROSPERO HOMEOBOX PROTEIN 1; 1.
DR Pfam; PF05044; HPD; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51818; HOMEO_PROSPERO; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 576..782
FT /note="Prospero"
FT /evidence="ECO:0000259|PROSITE:PS51818"
FT REGION 103..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 178..242
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..342
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 444..475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 103..138
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 227..242
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 784 AA; 88729 MW; 800E3B8D90E2D931 CRC64;
MPDHDSTALL SRQTKRRRVD IGVKRTVGTA SAFFAKARAT FFSAMNPQGS EQDVEYSVVQ
HADGEKSNVL RKLLKRANSY EDAMMPFPGA TIISQLLKNN MNKNGGTEPS FQASGLSSTG
SEVHQEDICS NSSRDSPPEC LSPFGRPTMS QFDMDRLCDE HLRAKRARVE NIIRGMSHSP
SVALRGNENE REMAPQSVSP RESYRENKRK QKLPQQQQQS FQQLVSARKE QKREERRQLK
QQLEDMQKQL RQLQEKFYQI YDSTDSENDE DGNLSEDSMH SEILDVRAQD SVGRSDNEMC
ELDPGQFIDR ARALIREQEI AENKPKREGN KERDQGPNSL HSEGKHLAET LKQELNTAMS
QVVDTVVKVF SSKPSRQLPQ VFPPLQIPQA RFAVNGENHN FHTANQRLQC FGDVIIPNPL
DTFGTMQMPS STDQTEALPL VVRKNSSDQS ASGPPAGGHH QSLHQSPLST TTGFTTSSFR
HPFSLPLMAY PFQNPLGAPS ASFPGKDRAS PESLDLTRET ASLRTKMSSH HMNHHPCSPA
HPPSTAEGLS LSLIKSECGD LQDMSDISPF SGSAMQEGLS PNHLKKAKLM FFYTRYPSSN
MLKTYFSDVK HLSSRRIPKR FTNCIHRDHS RTTEMQPPAG GWKAAAILSQ QHYTTVDFNR
CITSQLIKWF SNFREFYYIQ MEKYARQAIN DGVTSTEELS ITRDCELYRA LNMHYNKAND
FEQVPERFLE VAQITLREFF NAIIAGKDVD PSWKKAIYKV ICKLDSEVPE IFKSPNCLQE
LLHE
//