ID F6SHF6_CIOIN Unreviewed; 582 AA.
AC F6SHF6;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 18-APR-2012, sequence version 2.
DT 27-MAR-2024, entry version 80.
DE RecName: Full=Transcription factor protein {ECO:0008006|Google:ProtNLM};
OS Ciona intestinalis (Transparent sea squirt) (Ascidia intestinalis).
OC Eukaryota; Metazoa; Chordata; Tunicata; Ascidiacea; Phlebobranchia;
OC Cionidae; Ciona.
OX NCBI_TaxID=7719 {ECO:0000313|Ensembl:ENSCINP00000012175.3, ECO:0000313|Proteomes:UP000008144};
RN [1] {ECO:0000313|Proteomes:UP000008144}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=12481130; DOI=10.1126/science.1080049;
RA Dehal P., Satou Y., Campbell R.K., Chapman J., Degnan B., De Tomaso A.,
RA Davidson B., Di Gregorio A., Gelpke M., Goodstein D.M., Harafuji N.,
RA Hastings K.E., Ho I., Hotta K., Huang W., Kawashima T., Lemaire P.,
RA Martinez D., Meinertzhagen I.A., Necula S., Nonaka M., Putnam N., Rash S.,
RA Saiga H., Satake M., Terry A., Yamada L., Wang H.G., Awazu S., Azumi K.,
RA Boore J., Branno M., Chin-Bow S., DeSantis R., Doyle S., Francino P.,
RA Keys D.N., Haga S., Hayashi H., Hino K., Imai K.S., Inaba K., Kano S.,
RA Kobayashi K., Kobayashi M., Lee B.I., Makabe K.W., Manohar C., Matassi G.,
RA Medina M., Mochizuki Y., Mount S., Morishita T., Miura S., Nakayama A.,
RA Nishizaka S., Nomoto H., Ohta F., Oishi K., Rigoutsos I., Sano M.,
RA Sasaki A., Sasakura Y., Shoguchi E., Shin-i T., Spagnuolo A., Stainier D.,
RA Suzuki M.M., Tassy O., Takatori N., Tokuoka M., Yagi K., Yoshizaki F.,
RA Wada S., Zhang C., Hyatt P.D., Larimer F., Detter C., Doggett N.,
RA Glavina T., Hawkins T., Richardson P., Lucas S., Kohara Y., Levine M.,
RA Satoh N., Rokhsar D.S.;
RT "The draft genome of Ciona intestinalis: insights into chordate and
RT vertebrate origins.";
RL Science 298:2157-2167(2002).
RN [2] {ECO:0000313|Ensembl:ENSCINP00000012175.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family.
CC {ECO:0000256|ARBA:ARBA00005733}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; EAAA01000467; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; F6SHF6; -.
DR Ensembl; ENSCINT00000012175.3; ENSCINP00000012175.3; ENSCING00000005904.3.
DR GeneTree; ENSGT00940000168721; -.
DR HOGENOM; CLU_026378_0_0_1; -.
DR Proteomes; UP000008144; Chromosome 10.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003654; OAR_dom.
DR InterPro; IPR043182; PAIRED_DNA-bd_dom.
DR InterPro; IPR001523; Paired_dom.
DR InterPro; IPR043565; PAX_fam.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR PANTHER; PTHR45636; PAIRED BOX PROTEIN PAX-6-RELATED-RELATED; 1.
DR PANTHER; PTHR45636:SF38; PROTEIN GOOSEBERRY-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00292; PAX; 1.
DR PRINTS; PR00027; PAIREDBOX.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00351; PAX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
DR PROSITE; PS00034; PAIRED_1; 1.
DR PROSITE; PS51057; PAIRED_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Paired box {ECO:0000256|ARBA:ARBA00022724};
KW Reference proteome {ECO:0000313|Proteomes:UP000008144};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 1..99
FT /note="Paired"
FT /evidence="ECO:0000259|PROSITE:PS51057"
FT DOMAIN 144..204
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 551..564
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 146..205
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 100..153
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 258..285
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 343..509
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..129
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..153
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..384
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 582 AA; 62220 MW; 663C5B091D305F17 CRC64;
MAAHGVRPCV ISRQLRVSHG CVSKILCRYQ ETGSIKPGAI GGSKPKVATA DVDNKIEEYK
KENPGIFSWE IRERLIKEGI CDRSNVPSVS SISRTLRAKG CDVENESESS ARLDPGNRSS
SSGGEPNEVG GSDSESEPDL PLKRKQRRSR TTFSAEQLDE LERCFERTHY PDIYTREELA
QRTRLTEARV QVWFSNRRAR WRKQMAAQQF HGIPAHHPHH LSPHLGYPHT MPGSVGQAAH
NYMLQTSGAH HVQSMHESFA HTASSHDHGS SLHRPHASVL TSPVHHASQA IRYDTTDPLN
QSAAAAYGLM GASAAAAAAA ARYSQYPSAA AFGDAYPIHY HHHSTSPPSG SSIFGFPSHT
RQTATNPVEL TTGNPHSGPN SSPIDVGSPG EAGNHPLHQQ HFVLAQSARR AHQASNQRNH
HNQHHSGETT TAPSTSPASN GDGHPTTGIE SQSRAYSALA GQQHHNTDLH SSTSVAGQQS
TAMLSSSVDH SSNVFAGQER GAPSSRAGYG VGPSGFATGE VLPTSSAASH QSYASCQYSP
YGQASGDYGS AGIAALRMKS REHSASIGLI PVGGGPSVQH AY
//