ID H3CCY6_TETNG Unreviewed; 1375 AA.
AC H3CCY6;
DT 18-APR-2012, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 1.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Capicua transcriptional repressor b {ECO:0000313|Ensembl:ENSTNIP00000006109.1};
OS Tetraodon nigroviridis (Spotted green pufferfish) (Chelonodon
OS nigroviridis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Tetraodontiformes; Tetradontoidea; Tetraodontidae; Tetraodon.
OX NCBI_TaxID=99883 {ECO:0000313|Ensembl:ENSTNIP00000006109.1, ECO:0000313|Proteomes:UP000007303};
RN [1] {ECO:0000313|Proteomes:UP000007303}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15496914; DOI=10.1038/nature03025;
RA Jaillon O., Aury J.-M., Brunet F., Petit J.-L., Stange-Thomann N.,
RA Mauceli E., Bouneau L., Fischer C., Ozouf-Costaz C., Bernot A., Nicaud S.,
RA Jaffe D., Fisher S., Lutfalla G., Dossat C., Segurens B., Dasilva C.,
RA Salanoubat M., Levy M., Boudet N., Castellano S., Anthouard V., Jubin C.,
RA Castelli V., Katinka M., Vacherie B., Biemont C., Skalli Z., Cattolico L.,
RA Poulain J., De Berardinis V., Cruaud C., Duprat S., Brottier P.,
RA Coutanceau J.-P., Gouzy J., Parra G., Lardier G., Chapple C.,
RA McKernan K.J., McEwan P., Bosak S., Kellis M., Volff J.-N., Guigo R.,
RA Zody M.C., Mesirov J., Lindblad-Toh K., Birren B., Nusbaum C., Kahn D.,
RA Robinson-Rechavi M., Laudet V., Schachter V., Quetier F., Saurin W.,
RA Scarpelli C., Wincker P., Lander E.S., Weissenbach J., Roest Crollius H.;
RT "Genome duplication in the teleost fish Tetraodon nigroviridis reveals the
RT early vertebrate proto-karyotype.";
RL Nature 431:946-957(2004).
RN [2] {ECO:0000313|Ensembl:ENSTNIP00000006109.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 99883.ENSTNIP00000006109; -.
DR Ensembl; ENSTNIT00000006257.1; ENSTNIP00000006109.1; ENSTNIG00000003521.1.
DR GeneTree; ENSGT00940000159960; -.
DR HOGENOM; CLU_003486_0_0_1; -.
DR InParanoid; H3CCY6; -.
DR OMA; VVCGRWA; -.
DR TreeFam; TF323412; -.
DR Proteomes; UP000007303; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0060319; P:primitive erythrocyte differentiation; IEA:Ensembl.
DR CDD; cd21990; HMG-box_CIC-like; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR PANTHER; PTHR13059; HMG-BOX TRANSCRIPTION FACTOR BBX; 1.
DR PANTHER; PTHR13059:SF13; PROTEIN CAPICUA HOMOLOG; 1.
DR Pfam; PF00505; HMG_box; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000007303};
KW Signal {ECO:0000256|SAM:SignalP};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..1375
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003582003"
FT DOMAIN 181..249
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 181..249
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 89..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 254..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 383..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 552..591
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 666..697
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1057..1115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1208..1238
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1317..1375
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 383..399
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 407..424
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 429..451
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 560..586
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 673..689
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1075..1089
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1317..1340
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1375 AA; 147191 MW; 7CF721E3D3224028 CRC64;
MLGLKVLLLT DALSSAVWTN VEPRSVPVFP WHSLVPFLAP TQSDASSQLG EGPHPVNHPQ
AASLNFKHLW LKVFLCCFQS ATGAAALSQE PAEAPPAAER GPLLPPPPSS EEAPSEKEKG
EAERERPDSE TESDVDDPFL PGVVPEPPLS TSPVKRRTQS LSALPKDGDK NSPGKREKDH
IRRPMNAFMI FSKRHRALVH QRHPNQDNRT VSKILGEWWY ALGPKEKQKY HDLAFQVKEA
HFKAHPDWKW CNKERKKSSS EGRGVPGGKD IRERSMSEST GWVHSHIPKY NRNLHVFLFF
KITNSYFSKS LVGVSERNAG EARPRALSQS AMHSFERGDR GNTQALAELA QVGSPAGTRE
PRVFQTVLSF PFSLKMCGDG GSQFSSHTPT LSQSQRGVSE DMTSDEERMV ICEEEGDDDV
IEDPYPSSSI DLKCKERVTD SDSENGSGDE GERKGVFAPV ICSSASSSST QHASHGRSIS
LKGAFLGEDG GGVQAASFSL SSGQSLLSTS QAGVTLSSSV SSLGANPLLG IGTVRVASTV
VTNVMRPVIS TPLPIASKPR DGGTSSSPLP PERKSLTPQQ HQPQLLIGTG GGAAASGGGY
YSASSPNPVG SGPGGVVTNL VLGGALSAQP AVQLITPTPQ PQSSQQQTLS SNAAITQVQY
ILPTLPANSH PKSPPRHLSQ TTSIFNLPPA PPTHASLANG KQQVTSAVTG YTSSQTVGVY
MRQNACDSCT QLTVFVFCFF ISASVQTQSP VLQGKMLVPM ATVRTAPAPA QQFPIVTPAL
PVQNGSQTGS KIIQIAPMPV VQSPLPQGGA VHPSNAFPVT VGTAAVVAPG SAPSQTVLLP
PAPTRITYVQ STPGVPSTLP LVPTTTGSST TQQALPVAGS AYVPSALATL GFTAIAPPGQ
TLVQPLITGQ RCQRSDKYRK KGRNMCSLVF YLSSQIVTAV YPPSPSVTMA TGVVSMTAVP
PSVVHSVSGP ANAPPHILTK HTAAAIASWS GRGKSPAAPA ARWRPRAAQR CPYGPAVLHF
LSKHLVLQHF VSQLTAHAIS LYILYLKLFF RQHTRNSKVK PAPRQDLSEA AGSGHFSFDA
QSSPSTHLAE EPPSERAPES SGTADPAENA RHPLTSDDWC QKPACVNPAR PRSRRPSTRW
TRCCRRCISR SASRSCRSSD RRRFCLLPPC KAWPPRPAPS WGATGARGRT PQIWTRRLTN
RFLRRGRVGG ARAAARSPTH PRVPPNARET SFTFDRPGTE GEDILTDLEF DKVPYSSLRR
TLDQRRALVM QLFQEQGFFP SAQATAAFQT RYSDIFPTKL CLQLKIREVR QKIMQTATPS
DASGLGASDS SGSLPGLLGS QSGEGSGRGE LQDDEDEQGT AGSPEDPRDS QESSR
//