ID A0A026WZT0_OOCBI Unreviewed; 881 AA.
AC A0A026WZT0;
DT 09-JUL-2014, integrated into UniProtKB/TrEMBL.
DT 09-JUL-2014, sequence version 1.
DT 13-SEP-2023, entry version 38.
DE SubName: Full=Upstream-binding protein {ECO:0000313|EMBL:EZA60619.1};
GN ORFNames=X777_14645 {ECO:0000313|EMBL:EZA60619.1};
OS Ooceraea biroi (Clonal raider ant) (Cerapachys biroi).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Formicoidea;
OC Formicidae; Dorylinae; Ooceraea.
OX NCBI_TaxID=2015173 {ECO:0000313|EMBL:EZA60619.1, ECO:0000313|Proteomes:UP000053097};
RN [1] {ECO:0000313|EMBL:EZA60619.1, ECO:0000313|Proteomes:UP000053097}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24508170; DOI=10.1016/j.cub.2014.01.018;
RA Oxley P.R., Ji L., Fetter-Pruneda I., McKenzie S.K., Li C., Hu H.,
RA Zhang G., Kronauer D.J.;
RT "The genome of the clonal raider ant Cerapachys biroi.";
RL Curr. Biol. 24:451-458(2014).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU01313}.
CC -!- SIMILARITY: Belongs to the grh/CP2 family. CP2 subfamily.
CC {ECO:0000256|ARBA:ARBA00010852}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK107077; EZA60619.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A026WZT0; -.
DR STRING; 2015173.A0A026WZT0; -.
DR EnsemblMetazoa; XM_026968264.1; XP_026824065.1; LOC105288150.
DR OMA; INHVYRQ; -.
DR Proteomes; UP000053097; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR007604; CP2.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR041418; SAM_3.
DR InterPro; IPR040167; TF_CP2-like.
DR PANTHER; PTHR11037:SF21; GEMINI, ISOFORM C; 1.
DR PANTHER; PTHR11037; TRANSCRIPTION FACTOR CP2; 1.
DR Pfam; PF04516; CP2; 1.
DR Pfam; PF18016; SAM_3; 1.
DR PROSITE; PS51968; GRH_CP2_DB; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU01313};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU01313};
KW Reference proteome {ECO:0000313|Proteomes:UP000053097}.
FT DOMAIN 324..553
FT /note="Grh/CP2 DB"
FT /evidence="ECO:0000259|PROSITE:PS51968"
FT REGION 46..101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 149..169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 495..519
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 73..93
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 155..169
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 495..517
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 881 AA; 97667 MW; 5EF3BB010B2FB728 CRC64;
MDGGAETLNG AMWFTNTSSP SYEQLSPVRP NRCDVVATTQ QSTIGYHSPQ NSLWQDKHNS
PKGSPLGGDQ MHLNGGSPQQ LSSSGSSPNY EQNNLKRRAE EPLQHITELE KKHIRRDGVL
IHRCSRNRMK WCEVDRSRVS LLQSRRYPPV TTKSRRRHVA QETKDRPTTM HHQLHVGYNG
AQGGSIGSNG SGQGWTTQVE ELAEHLAADF DGLSALATSD LATAVASYNM SEALLALPSL
TVFKQEAPSP ENQHNNLNHV SQRAINTASV NNGNVSSVEA DNNNNGQTAT TLHQLLYSSN
EEYPPTSTAT NHVSSSQQGN ELNEDCRFQY VLAAATSIAT KVNEETLTYL NQGQSYEIKL
KKLGDLSAYR GKILKSTIRI CFHERRLQYT EREQMLAWQR ARPGERLLEV DVPLSYGMVD
VYQPSPSNNS VDFMWDPTKE VGVYIKVNCI STEFTPKKHG GEKGVPFRIQ VETRLPGGPR
LHAASCQVKV FKLKGADRKH KQDRDKILRR PPHEQDKYQP SYDCTIPLES LSPSSPTQNG
GSPFVSTDAI ESVLDMRDYW DMRSCWSIRR PHYPFDRFVT WLLEHEVIVS GGPPHVQVVV
KFVHALLNVQ VKRGYVMRGG LVSLQIKKRD VKRLNGCKKI KLVVHLKEHS ASPQTRSTIG
GNSSPVVAPL ALSVSNSGIV PLVPASDAVK ENVGTAPALP SPAAILPDQP CIESDSAPCL
TELPPDANAT QTTAWLRASR FNAFESTFAS FSASDILRLS REDLIQICGV ADGIRLFNTL
HSKAPTPKLT LYFSLEGNGS LWRIAYLDNL TSSALTNKLL NTLNLPPDRL HSILFLGPQG
IHVLVTDDLV ANMKDESMYL VETIRDPGSE RYKLLIKSKN I
//