ID V8NZ76_OPHHA Unreviewed; 517 AA.
AC V8NZ76;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Prospero homeobox protein 2 {ECO:0000313|EMBL:ETE67365.1};
DE Flags: Fragment;
GN Name=PROX2 {ECO:0000313|EMBL:ETE67365.1};
GN ORFNames=L345_06848 {ECO:0000313|EMBL:ETE67365.1};
OS Ophiophagus hannah (King cobra) (Naja hannah).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Lepidosauria; Squamata; Bifurcata; Unidentata; Episquamata; Toxicofera;
OC Serpentes; Colubroidea; Elapidae; Elapinae; Ophiophagus.
OX NCBI_TaxID=8665 {ECO:0000313|EMBL:ETE67365.1, ECO:0000313|Proteomes:UP000018936};
RN [1] {ECO:0000313|EMBL:ETE67365.1, ECO:0000313|Proteomes:UP000018936}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Blood {ECO:0000313|EMBL:ETE67365.1};
RX PubMed=24297900; DOI=10.1073/pnas.1314702110;
RA Vonk F.J., Casewell N.R., Henkel C.V., Heimberg A.M., Jansen H.J.,
RA McCleary R.J., Kerkkamp H.M., Vos R.A., Guerreiro I., Calvete J.J.,
RA Wuster W., Woods A.E., Logan J.M., Harrison R.A., Castoe T.A.,
RA de Koning A.P., Pollock D.D., Yandell M., Calderon D., Renjifo C.,
RA Currier R.B., Salgado D., Pla D., Sanz L., Hyder A.S., Ribeiro J.M.,
RA Arntzen J.W., van den Thillart G.E., Boetzer M., Pirovano W., Dirks R.P.,
RA Spaink H.P., Duboule D., McGlinn E., Kini R.M., Richardson M.K.;
RT "The king cobra genome reveals dynamic gene evolution and adaptation in the
RT snake venom system.";
RL Proc. Natl. Acad. Sci. U.S.A. 110:20651-20656(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETE67365.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZIM01001316; ETE67365.1; -; Genomic_DNA.
DR AlphaFoldDB; V8NZ76; -.
DR Proteomes; UP000018936; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0048468; P:cell development; IEA:UniProt.
DR GO; GO:0007399; P:nervous system development; IEA:UniProt.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 1.10.10.500; Homeo-prospero domain; 2.
DR InterPro; IPR023082; Homeo_prospero_dom.
DR InterPro; IPR037131; Homeo_prospero_dom_sf.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR039350; Prospero_homeodomain.
DR PANTHER; PTHR12198; HOMEOBOX PROTEIN PROSPERO/PROX-1/CEH-26; 1.
DR PANTHER; PTHR12198:SF5; PROSPERO HOMEOBOX PROTEIN 2; 1.
DR Pfam; PF05044; HPD; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51818; HOMEO_PROSPERO; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000313|EMBL:ETE67365.1};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000313|EMBL:ETE67365.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000018936};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 436..517
FT /note="Prospero"
FT /evidence="ECO:0000259|PROSITE:PS51818"
FT REGION 116..164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 128..143
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..164
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ETE67365.1"
FT NON_TER 517
FT /evidence="ECO:0000313|EMBL:ETE67365.1"
SQ SEQUENCE 517 AA; 58222 MW; D1D89A513239BECB CRC64;
MNVNSIANQY NYIPSSVCVD GQNGEFCTEG NYSFRPPYYE SIISHLLNQS ELNSELDISF
LLPCSQKTEH PSSKEAIAMS PLPACGFGNT NQFHNEQLQA KRARVENIIQ GMSIMPNPIV
PGTLGEQDHN FEKGKESSRQ NKRKQKLPQQ QSLPGISLTR PPRSSISAEE FLQVKKQLHA
LQHQLKQLGE RFLQHDEEND SDPSQESIER TMGLLKMYGA KIDTNSQVLC CDQHKDYLWK
STPRVRDLVL SEKEDTGIQN LEEKTLSEIL KQELTQVMTH AVDSVLKKIL PKSSSLPSHL
PNNSIGTALG RNIPSKWLPK ISSPKGPISL ITEKSPGFPI HSIQTKTERK PCQTPENYPL
ILTSEVQENG ILSQMLFCGQ KCHWESPPPR MVSPESLEVP WQPIKPSGMK QQYLPRLESL
ASLPSTNAMF AEMHAMEALT PGHLKKAKLM FFFSRYPTSS LLKAYFLDIQ VPDGFLDVAS
LTLQKFFSAV RAGRDLDPSW KKPIYKIISK LDSEIPD
//