ID A0A0L0BSA7_LUCCU Unreviewed; 447 AA.
AC A0A0L0BSA7;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE SubName: Full=Homeobox protein goosecoid {ECO:0000313|EMBL:KNC22955.1};
GN ORFNames=FF38_13760 {ECO:0000313|EMBL:KNC22955.1};
OS Lucilia cuprina (Green bottle fly) (Australian sheep blowfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Oestroidea;
OC Calliphoridae; Luciliinae; Lucilia.
OX NCBI_TaxID=7375 {ECO:0000313|EMBL:KNC22955.1, ECO:0000313|Proteomes:UP000037069};
RN [1] {ECO:0000313|EMBL:KNC22955.1, ECO:0000313|Proteomes:UP000037069}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LS {ECO:0000313|EMBL:KNC22955.1,
RC ECO:0000313|Proteomes:UP000037069};
RC TISSUE=Full body {ECO:0000313|EMBL:KNC22955.1};
RX PubMed=26108605; DOI=10.1038/ncomms8344;
RA Anstead C.A., Korhonen P.K., Young N.D., Hall R.S., Jex A.R., Murali S.C.,
RA Hughes D.S., Lee S.F., Perry T., Stroehlein A.J., Ansell B.R.,
RA Breugelmans B., Hofmann A., Qu J., Dugan S., Lee S.L., Chao H., Dinh H.,
RA Han Y., Doddapaneni H.V., Worley K.C., Muzny D.M., Ioannidis P.,
RA Waterhouse R.M., Zdobnov E.M., James P.J., Bagnall N.H., Kotze A.C.,
RA Gibbs R.A., Richards S., Batterham P., Gasser R.B.;
RT "Lucilia cuprina genome unlocks parasitic fly biology to underpin future
RT interventions.";
RL Nat. Commun. 6:7344-7344(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KNC22955.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JRES01001431; KNC22955.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0L0BSA7; -.
DR STRING; 7375.A0A0L0BSA7; -.
DR EnsemblMetazoa; KNC22955; KNC22955; FF38_13760.
DR EnsemblMetazoa; XM_046955795.1; XP_046811751.1; LOC111687720.
DR OMA; IKSDPNH; -.
DR Proteomes; UP000037069; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR46643:SF1; HOMEOBOX PROTEIN GOOSECOID-2; 1.
DR PANTHER; PTHR46643; HOMEOBOX PROTEIN GOOSECOID-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000037069}.
FT DOMAIN 396..447
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT REGION 70..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 199..224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 264..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 370..400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 85..170
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 199..222
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..296
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 447 AA; 48935 MW; 205997F94E9DA20A CRC64;
MNLSQIKQQQ KVGVNRFQPP LNCTQIILVT PEKKINKILS SLWPAPDQLV GNLEPKLVAN
TAKFKITVIR SPASSPPPTP PLDINSLNPS KLNINTNKMV ETTSPPSGLT IKRCPSSPLS
DHHHNLQQQQ QQHLQQQQQL QQLHQQHLHN SMSNQNNPTT RQSPHSPATP NAAYLTTAML
LNSQQCGYLG QRLQSVFQQQ QQHTQSQTPS SDDGSQSGAT IIDDDRITLR ETAVGSATAT
AAAAASIFSI DSILGSRSVN HNNNNSINNN NIINSKRSDS PTSPSSNSSS AASSPLRPQR
VPAMLQHPGL HLGHLAAAAA SGFAASPSDF LVAYPNFYPN YMHAAAVAHV AAAQMQAHVS
GHSSSLNHNH GHAHHLHHGH HGHHMAGHLG HGPPPKRKRR HRTIFTEEQL EQLEATFEKT
HYPDVVLREQ LALKVDLKEE RVERETN
//