ID B4M799_DROVI Unreviewed; 697 AA.
AC B4M799;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 2.
DT 27-MAR-2024, entry version 96.
DE RecName: Full=Homeobox domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=Dvir\GJ16953 {ECO:0000313|EMBL:EDW62666.2};
GN ORFNames=Dvir_GJ16953 {ECO:0000313|EMBL:EDW62666.2};
OS Drosophila virilis (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila.
OX NCBI_TaxID=7244 {ECO:0000313|EMBL:EDW62666.2, ECO:0000313|Proteomes:UP000008792};
RN [1] {ECO:0000313|EMBL:EDW62666.2, ECO:0000313|Proteomes:UP000008792}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Tucson 15010-1051.87 {ECO:0000313|Proteomes:UP000008792};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the paired homeobox family.
CC {ECO:0000256|ARBA:ARBA00005733}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH940653; EDW62666.2; -; Genomic_DNA.
DR RefSeq; XP_002057180.2; XM_002057144.2.
DR AlphaFoldDB; B4M799; -.
DR SMR; B4M799; -.
DR STRING; 7244.B4M799; -.
DR EnsemblMetazoa; FBtr0438083; FBpp0394866; FBgn0204131.
DR eggNOG; KOG0494; Eukaryota.
DR HOGENOM; CLU_016909_0_0_1; -.
DR InParanoid; B4M799; -.
DR Proteomes; UP000008792; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR023339; CVC.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR46892; VISUAL SYSTEM HOMEOBOX 2; 1.
DR PANTHER; PTHR46892:SF1; VISUAL SYSTEM HOMEOBOX 2; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51496; CVC; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008792}.
FT DOMAIN 371..431
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 433..484
FT /note="CVC"
FT /evidence="ECO:0000259|PROSITE:PS51496"
FT DNA_BIND 373..432
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 19..96
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 198..326
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 359..378
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 489..568
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 608..665
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 48..96
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..272
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..325
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..523
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..560
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 612..664
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 697 AA; 71021 MW; C613D692FE09DCD7 CRC64;
MQATKMRTPF AIQEILGLGQ QSSQAAAAAT PSANEAPLSP AATPPPRNAN TPQQSAVSSS
AQNAAQTLEQ QQQQQQQQQQ QQQTSSRDSR SISPDVTRLS AAAAAAAAAA AAHCQLPVFN
PANFYAAAGY AAAADPANAA AAHHHHMTTL AAAAYLRGFL PPGFSTPHEF RGHFQQHFGA
QFAEQAAARG QDYASSLANA SGDEQSPSKS GAATSDTGGL LGSPTALSNG QAGGAANGAS
SNGHSAGSGS GSGSGSGSGS GSTASRCSPS APTTAAHHHA ALAAHQHAAA AAAAAHHHAA
AAAAAHHAHH AHAAHHAHGH VHGHVHHAHE AAFLGAAGAG GAANPNGLLA GHGGDVLVGP
GGAGPGGKKK NKRRHGRTIF TSSQLEELEK AFKEAHYPDV SARELLSMKT GLAEDRIQVW
YQNRRAKWRK TEKCWGHSTK MAEYGLYGAM VRHSLPLPET IIKSAKEDES VAPWLLGMHK
KSLEAAELLK SDESDRETPT SDDTNTSYSA GSTHNSPISS FSISRLLFDA PPPPPPGSAG
GKQSKHQQQQ QQQQQQAAAV QHAMHMHHHG TAGYAEAAVA AAAAAAAVAA RNAAAAVAAA
AAVSVSVSVA ESEPEEEPDE DEDMDKDCDA DVDPDADVDA DVDGDADVDG DADADGDIDI
CDDEDEALSH QQLLLKVKQE HVAGGGGGGG GPEGASR
//