ID H2PHY2_PONAB Unreviewed; 2718 AA.
AC H2PHY2; A0A2J8WN00;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 25-MAY-2022, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=HIVEP zinc finger 1 {ECO:0000313|Ensembl:ENSPPYP00000018158.3};
DE SubName: Full=HIVEP1 isoform 4 {ECO:0000313|EMBL:PNJ71154.1};
GN Name=HIVEP1 {ECO:0000313|Ensembl:ENSPPYP00000018158.3};
GN ORFNames=CR201_G0009388 {ECO:0000313|EMBL:PNJ71154.1};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000018158.3, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000018158.3, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PNJ71154.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Susie {ECO:0000313|EMBL:PNJ71154.1};
RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT "High-resolution comparative analysis of great ape genomes.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000018158.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDHI03003384; PNJ71154.1; -; Genomic_DNA.
DR STRING; 9601.ENSPPYP00000018158; -.
DR Ensembl; ENSPPYT00000018882.3; ENSPPYP00000018158.3; ENSPPYG00000016236.3.
DR eggNOG; KOG1721; Eukaryota.
DR GeneTree; ENSGT00940000158242; -.
DR HOGENOM; CLU_000719_2_1_1; -.
DR OMA; EGKQDCH; -.
DR OrthoDB; 3353821at2759; -.
DR TreeFam; TF331837; -.
DR Proteomes; UP000001595; Chromosome 6.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0005739; C:mitochondrion; IEA:Ensembl.
DR GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR GO; GO:0001227; F:DNA-binding transcription repressor activity, RNA polymerase II-specific; IEA:Ensembl.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR GO; GO:0030509; P:BMP signaling pathway; IEA:Ensembl.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 4.
DR InterPro; IPR034729; ZF_CCHC_HIVEP.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR45944; SCHNURRI, ISOFORM F; 1.
DR PANTHER; PTHR45944:SF3; ZINC FINGER PROTEIN 40; 1.
DR Pfam; PF00096; zf-C2H2; 3.
DR SMART; SM00355; ZnF_C2H2; 5.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 3.
DR PROSITE; PS51811; ZF_CCHC_HIVEP; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 3.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 4.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001595};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 406..433
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 434..457
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 956..986
FT /note="CCHC HIVEP-type"
FT /evidence="ECO:0000259|PROSITE:PS51811"
FT DOMAIN 2088..2115
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 2116..2141
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 141..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 335..373
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 484..512
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 573..613
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 645..729
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1021..1060
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1138..1169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1206..1282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1384..1414
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1525..1548
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1871..1913
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2155..2228
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2266..2302
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2330..2381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2566..2718
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 141..159
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 160..182
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 232..254
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 590..613
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 645..723
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1026..1042
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1043..1060
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1239..1262
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1384..1412
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2160..2177
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2178..2199
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2200..2225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2356..2375
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2575..2606
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2627..2643
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2647..2686
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2718 AA; 296679 MW; 15148C7672A1FE75 CRC64;
MPRTKQIHPR NLRDKIEEAQ KELNGAEVSK KEILQAGVKG TSESLKGVKR KKIVAENHLK
KIPKSPLRNP LQAKHKQNTE ESSFALLHSA LESHKKQNYI PVKNGKQFTK QNGETPGIIA
EATKSDESVS PKKPLFLQHP SELRRWRSEG ADPAKFSDLD EQCDSSSLSS KTRTDNSECI
SSHCGTTSPS YTNTAFDVLL KAMEPELSTL SQKGSSCAIK TEKLRPNKAA RSPSKLKNSS
MDTPNQTSQE LVAESQSSCT SYTVHVSAAQ KNEQGAVQSA SHLYHQHEHF VPKSNQHNQQ
LPRCSGFTGS LTNLQNQENA KLEQVYNIAV TSSVGLTSPS SRSQVTPQNQ QMDSASPLSI
SPANSTQSPP MPIYNSTHVA SVVNQSVEQM CNLLLKDQKP KKQGKYICEY CNRACAKPSV
LLKHIRSHTG ERPYPCVTCG FSFKTKSNLY KHKKSHAHTI KLGLVLQPDA GGLFLSHESP
KALSIHSDVE DSGESEEEGA TEERQHDLGA MELQPVHIIK RMSNAETLLK SSFTPSNPEN
VVGDFLLQDK SAESQAVTEL PKVVVHHVTV SPLRTDSPKA VDPKPELSSA QKQKDLQVTN
VQPLSANMSQ GGISRLETNE KSHQKGDMNP LEGKQDSHIG TVHAQLQRQQ ATDYSQEQQG
KLLSPRSLGS TDSGYFSRSE SADQTVSPPT PFTRRLPSTE QDSGRSNGPS AALVTTSTPS
ALPTGEKALL LPGQMRPPLA TKTLEERISK LISDNEALVD DKQLDSVKPR RTSLSRRGSI
DSPKSYIFKD SFQFDLKPVG RRTSSSSDIP KSPFTPTEKS KQVFLLSVPS LDCLPITRSN
SMPTTGYSAV PANIIPPPHP LRGSQSFDDK IGAFYDDVFV PGPNTPVPQS GHPRTLVRQA
AIEDSSANES HVLGTGQSLD ESHQGCHAAG EATSARSKAL AQGPHIEKKK SHQGRGTMFE
CETCRNRYRK LENFENHKKF YCSELHGPKT KVAMREPEHS PVPGGPQPQI LHYRVAGSSG
IWEQTPQIRK RRKMKSVGDD DELQQNESGT SPKSSEGLQF QNALGCNPSL PKHNVTIRSD
QQHKNIQLQN SQIHLVARGP EQTMDPKLST IMEQQISSAA QDKIELQRHG TGISVIQHTN
SLSRPNSFDK PEPFERASPV SFQELNRTGK SGSLKVIGIS QEESHPSRDG SHPHQLALSD
ALRGELQESS RKSPSERHVL GQPSRLVRQH NIQVPEILVT EEPDRDLEAQ CHDQEKSEKF
SWPQRSETLS KLPTEKLPPK KKRLRLAEIE HSSTESSFDS TLSRSLSRES SLSHTSSFSA
SLDIEDVSKT EASPKIDFLN KAEFLMIPAG LNTLNVPGSH REMRRTASEQ INCTQTSMEV
SDLRSKSFDC GSITPPQTTP LTELQPPSSP SRVGVTGHVP LLERRRGPLV RQISLNIAPD
SHLSPVHPTS FQNIALPSVN AVPYQGPPLT STYLAEFSAN TLHSQTQVKD LQAETSNSSS
TNIFPVQQLC DINLLNQIHA PPSHQSTQLS LQVSTQGSKP DKNSVLSGSS KSENCFAPKY
QLHCQVFTSG PSCSSNLVHS LPNQVISDPV GTDHCVTSAA LPTKLIDSLS NSHPLLPPEL
RPLGSQVQKV PSSFMLPVCL QSNVPAYCFA TLTSLPQILV TQDLPNQPVC QTNHSVVPIS
EEQNSVPTLQ KGHQNALPNP EKEFLYENVF LKMGQNSSLS ESLPITQKIS VGRLSPQQES
SASSKRMLSP ANSLDIAMEK HQKRAKDENG AVCATDVRPL EPLSSRVNEA SKQKKPILVR
QVCTTEPLDG VMLEKDVFSQ PEISSEAVNL TNVLPADNSS TGCSKFVVIE PISELQEFEN
IKSSTSLTLT VRSSPVPSEN THISPLKCTD NNQERKSPGV KNQGDKVNIQ EQSQRPVTSL
SLFNIKDTQQ LAFPSLKTTT NFTWCYLLRQ KSLHLPQKDQ KTSAYTDWTV SSSNPNPLGL
PTKVALSLLN SKQNTGKSLY CQAITTHSKS DLLVYSSKWK SNLSKRALGN QKSTVVEFSN
KDASEINSEQ DKENSLIKSE PRRIKIFDGG YKSNEEYVYV RGRGRGKYIC EECGIRCKKP
SMLKKHIRTH TDVRPYHCTY CNFSFKTKGN LTKHMKSKAH SKKCVDLGVS VGLIDEQDTE
ESDEKQRFSY ERSGYDLEES DGPDEDDNEN EDDDEDSQAE SVLSATPSVT ASPQHLPSRS
SLQDPVSADE DVRITDCFSG VHTDPMDVLP RALLTKMTVL STAQSDYNRK TLSPGKARQR
AARDENDTIP SVDTSRSPCH QMSVDYPESE EILRSSVAGK AIAITQSPSS VRLPPAAAEH
SPQTAAGVPS VASPHPDPQE QKQQITLQPT PGLPSPHTHL FSHLPLHSQQ QSRTPYNMVP
VGGIHVVPAG LTYSTFVPLQ AGPVQLTIPA VSVVHRTLGT PGNTVTEVSG TTNPAGVAEL
SSVVPCIPIG QIRVPGLQNL STPGLQSLPS LSMETVNIVG LANTNMAPQV HPPGLALNAV
GLQVLTANPS SQSSPAPQAH IPGLQILNIA LPTLIPSVSQ VTVDAQGAPE MPASQSKACE
TQPKQTSVAS ANQVSRTESP QGSPKVQGEN AKKVLNPPAP AGDHARLDGL SKMDTEKAAS
ANHVKPKPEL TSVQGQPAST SQPLLKAHSE VFTKPSGQQT LSPDRQVPRP TALPRRQPTV
QFSDVSSDDD EDRLVIAT
//