ID A0A226F365_FOLCA Unreviewed; 399 AA.
AC A0A226F365;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=Retinal homeobox protein Rx1 {ECO:0000313|EMBL:OXA63887.1};
GN ORFNames=Fcan01_02611 {ECO:0000313|EMBL:OXA63887.1};
OS Folsomia candida (Springtail).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA63887.1, ECO:0000313|Proteomes:UP000198287};
RN [1] {ECO:0000313|EMBL:OXA63887.1, ECO:0000313|Proteomes:UP000198287}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=VU population {ECO:0000313|EMBL:OXA63887.1,
RC ECO:0000313|Proteomes:UP000198287};
RC TISSUE=Whole body {ECO:0000313|EMBL:OXA63887.1};
RA Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT "The genome of Folsomia candida.";
RL Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family. Bicoid subfamily.
CC {ECO:0000256|ARBA:ARBA00006503}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXA63887.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LNIX01000001; OXA63887.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226F365; -.
DR OMA; CIVIEIN; -.
DR Proteomes; UP000198287; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003654; OAR_dom.
DR InterPro; IPR043562; RAX/RAX2.
DR PANTHER; PTHR46271; HOMEOBOX PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR46271:SF4; HOMEOBOX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF03826; OAR; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50803; OAR; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000198287}.
FT DOMAIN 159..219
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 376..389
FT /note="OAR"
FT /evidence="ECO:0000259|PROSITE:PS50803"
FT DNA_BIND 161..220
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..165
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 339..375
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..49
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 90..107
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 343..375
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 399 AA; 42311 MW; 2362F5C68B9C5311 CRC64;
MSVGTGFEFQ RGGGGGGRGR GGGSSDTSEE SRRGVKRSRI KDLHSSDLSA DEGDYSGIGT
KIRPGSVGSD TSSDFGTGDG LPNGGGGGSV CGGNISQSFV ANNNNNHHHH HQGTKGGHHH
PHNNHDHDDL DINDEDVDGP DEDHPSSPGG ADSIGEPKKK HRRNRTTFTT YQLHELERAF
EKSHYPDVYS REELAMKVNL PEVRVQVWFQ NRRAKWRRQE KMEAARLGLG DYPLSALGRS
PGGSSANQAA MAGLLGAAAS NQAALSFMTD PWLAASPLLG STLSHALPGF LSHPNTCYPS
YLTPPTSAGS LFSGSHHHHH HPHHHTTLIE PRSGVKLNSP LTAHHLSPIN TPTTVGGGGG
SPGATNSDSS PMGRSSSIAA LRMKAKEHIE TLSKVIQIT
//