ID W5MDQ1_LEPOC Unreviewed; 414 AA.
AC W5MDQ1;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=SRY-box transcription factor 18 {ECO:0000313|Ensembl:ENSLOCP00000006510.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000006510.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000006510.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01024399; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_015220505.1; XM_015365019.1.
DR AlphaFoldDB; W5MDQ1; -.
DR STRING; 7918.ENSLOCP00000006510; -.
DR Ensembl; ENSLOCT00000006518.1; ENSLOCP00000006510.1; ENSLOCG00000005401.1.
DR GeneID; 102698657; -.
DR KEGG; loc:102698657; -.
DR CTD; 54345; -.
DR eggNOG; KOG0527; Eukaryota.
DR GeneTree; ENSGT00940000156694; -.
DR HOGENOM; CLU_044994_0_0_1; -.
DR InParanoid; W5MDQ1; -.
DR OMA; FSMSHHG; -.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000018468; Linkage group LG18.
DR Bgee; ENSLOCG00000005401; Expressed in zone of skin and 12 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0001525; P:angiogenesis; IBA:GO_Central.
DR GO; GO:0060840; P:artery development; IEA:Ensembl.
DR GO; GO:0001946; P:lymphangiogenesis; IBA:GO_Central.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR GO; GO:0001570; P:vasculogenesis; IBA:GO_Central.
DR CDD; cd22048; HMG-box_SoxF_SOX18; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR033392; Sox7/17/18_central.
DR InterPro; IPR021934; Sox_C.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR10270:SF204; TRANSCRIPTION FACTOR SOX-18; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12067; Sox17_18_mid; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS51516; SOX_C; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000018468}.
FT DOMAIN 80..148
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 294..413
FT /note="Sox C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51516"
FT DNA_BIND 80..148
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..36
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 51..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 256..291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..283
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 414 AA; 46128 MW; 83A39ED670FCD2B6 CRC64;
MNISESNYCR EETSQPRGDC SWVAAHSPSS DRGLGFNQTR IGDPGSVVGA EGRTASPESG
CGLGPASGIS EGKSGAESRI RRPMNAFMVW AKDERKRLAQ QNPDLHNAVL SKMLGQSWKA
LSTVEKRPFV EEAERLRLQH LQDHPNYKYR PRRKKQAKKI KRMEPSLLLH GLSQTCGGEN
YSMNHQNGSQ SGHHQLPPLN HFRDLHSVGS EFESYGLPTP EMSPLDVLEE GDPVFFPPHM
HDDVNLVPWI NYHHNPHHHH HHHLPPPQKS PESHQEKRHI PGPEGTHMSF SQNPVGVAEA
VKGSHTSSVY YNQVYSGPQG GFTAHLGQLS PPPEAPTLDN VEQLNHSEFW TEVDRNEFDQ
YLNMSRTRLE GPGQHYTASM SKVPPRHISC EDSPLISALS DASSAVYYSA CITG
//