ID A0A0L0CFS7_LUCCU Unreviewed; 615 AA.
AC A0A0L0CFS7;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE RecName: Full=POU domain protein {ECO:0000256|RuleBase:RU361194};
GN ORFNames=FF38_14336 {ECO:0000313|EMBL:KNC31263.1};
OS Lucilia cuprina (Green bottle fly) (Australian sheep blowfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Oestroidea;
OC Calliphoridae; Luciliinae; Lucilia.
OX NCBI_TaxID=7375 {ECO:0000313|EMBL:KNC31263.1, ECO:0000313|Proteomes:UP000037069};
RN [1] {ECO:0000313|EMBL:KNC31263.1, ECO:0000313|Proteomes:UP000037069}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LS {ECO:0000313|EMBL:KNC31263.1,
RC ECO:0000313|Proteomes:UP000037069};
RC TISSUE=Full body {ECO:0000313|EMBL:KNC31263.1};
RX PubMed=26108605; DOI=10.1038/ncomms8344;
RA Anstead C.A., Korhonen P.K., Young N.D., Hall R.S., Jex A.R., Murali S.C.,
RA Hughes D.S., Lee S.F., Perry T., Stroehlein A.J., Ansell B.R.,
RA Breugelmans B., Hofmann A., Qu J., Dugan S., Lee S.L., Chao H., Dinh H.,
RA Han Y., Doddapaneni H.V., Worley K.C., Muzny D.M., Ioannidis P.,
RA Waterhouse R.M., Zdobnov E.M., James P.J., Bagnall N.H., Kotze A.C.,
RA Gibbs R.A., Richards S., Batterham P., Gasser R.B.;
RT "Lucilia cuprina genome unlocks parasitic fly biology to underpin future
RT interventions.";
RL Nat. Commun. 6:7344-7344(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the POU transcription factor family.
CC {ECO:0000256|RuleBase:RU361194}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KNC31263.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JRES01000438; KNC31263.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0L0CFS7; -.
DR STRING; 7375.A0A0L0CFS7; -.
DR EnsemblMetazoa; KNC31263; KNC31263; FF38_14336.
DR EnsemblMetazoa; XM_046956631.1; XP_046812587.1; LOC111690339.
DR OMA; AHTFMRH; -.
DR Proteomes; UP000037069; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0048468; P:cell development; IEA:UniProt.
DR GO; GO:0048699; P:generation of neurons; IEA:UniProt.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:UniProt.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.260.40; lambda repressor-like DNA-binding domains; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR010982; Lambda_DNA-bd_dom_sf.
DR InterPro; IPR013847; POU.
DR InterPro; IPR000327; POU_dom.
DR PANTHER; PTHR11636; POU DOMAIN; 1.
DR PANTHER; PTHR11636:SF89; POU DOMAIN PROTEIN 2, ISOFORM B-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF00157; Pou; 1.
DR PRINTS; PR00028; POUDOMAIN.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00352; POU; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF47413; lambda repressor-like DNA-binding domains; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS00035; POU_1; 1.
DR PROSITE; PS00465; POU_2; 1.
DR PROSITE; PS51179; POU_3; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000037069};
KW Transcription {ECO:0000256|RuleBase:RU361194}.
FT DOMAIN 399..473
FT /note="POU-specific"
FT /evidence="ECO:0000259|PROSITE:PS51179"
FT DOMAIN 501..561
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 503..562
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 14..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 64..182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 282..385
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 560..615
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 14..35
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..95
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..118
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 140..158
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..352
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..385
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 565..615
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 615 AA; 68868 MW; 227CB0594DC9BDBB CRC64;
MMVVQQQMLW NTQQSSASAA QNSSKDDSRL LDNSSPPYLK TEILDDNEVE ELHHQQLSNN
HHHYNHHHVK HDDDEEEVVD REHDEHHHES VEQQKLTYLR HMSKSPSPIR SLSGFSSYHE
PQDDLEKYRT PSPTNLSNKR SRELDSEMEN EVLNLARHTP KRVARSPSPI TEKSANYETE
NPLNPNSPNL LSALQSPLAP LLLQNQLGLA AAATSLKPEE MQQAFQLQLQ GYMEMMRQMS
PENPAAAQFL LQNSLQAMLQ LQALQQMKQQ QQQQQQVQEE ILRKSPLNEL KNYSTPQDKS
PLRSPSLSPV SRHGSARLQT PNGTPASTNQ QTTPPNSANL PMSLSSAAMT PNTPGMPPAF
PSNSLSQAAM TYSSTPQNSK GSGANTLSMT ARTLDQSPEE TTDLEELEQF AKTFKQRRIK
LGFTQGDVGL AMGKLYGNDF SQTTISRFEA LNLSFKNMCK LKPLLQKWLE DADNTVSKPG
GIFNLTAMTS SALTTPENIM GRRRKKRTSI ETNVRTTLER AFNINCKPTS EEINQLSEQL
NMDKEVVRVW FCNRRQKEKR TNPSLDLDSP TGTPLSSHAF GYPPQSLNLT SGLEGSSLCG
SSISSLSPHY NGKQE
//