ID A0A151MZZ1_ALLMI Unreviewed; 392 AA.
AC A0A151MZZ1;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE SubName: Full=Transcription factor SOX-18 {ECO:0000313|EMBL:KYO30084.1};
GN Name=SOX18 {ECO:0000313|EMBL:KYO30084.1};
GN ORFNames=Y1Q_0021168 {ECO:0000313|EMBL:KYO30084.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO30084.1};
RN [1] {ECO:0000313|EMBL:KYO30084.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO30084.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO30084.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03004329; KYO30084.1; -; Genomic_DNA.
DR RefSeq; XP_006273131.1; XM_006273069.3.
DR AlphaFoldDB; A0A151MZZ1; -.
DR STRING; 8496.A0A151MZZ1; -.
DR GeneID; 102567331; -.
DR KEGG; amj:102567331; -.
DR CTD; 54345; -.
DR eggNOG; KOG0527; Eukaryota.
DR OrthoDB; 2902801at2759; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22048; HMG-box_SoxF_SOX18; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR033392; Sox7/17/18_central.
DR InterPro; IPR021934; Sox_C.
DR PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR10270:SF204; TRANSCRIPTION FACTOR SOX-18; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12067; Sox17_18_mid; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS51516; SOX_C; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 75..143
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 272..391
FT /note="Sox C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51516"
FT DNA_BIND 75..143
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 392 AA; 44093 MW; 4AF0F91D7FF48CDF CRC64;
MNRPEPSYRA DDIPQARADC SWAEAGLSTA FQPASRTPSP QPGSPPSRSP SPDAGYGYSP
PAGRADGKPG DDSRIRRPMN AFMVWAKDER KRLAQQNPDL HNAVLSKMLG QSWKALSTSD
KRPFVEEAER LRLQHLQDHP NYKYRPRRKK QAKKIKRMEP NILLHNLSQP CSDSFGMNHH
SGSQPAHHQP PPLNHFRELH SMGSDIENYG LPTPEMSPLD VLEQTEPAFF PPHMQEDCNM
MPFRGYHPHH QMEFPQDKSI GRDMAMPYAQ TPSHLADAMR TPHPSSIYYN QICSGPQNGL
STHLGQLSPP PEAHHLDSVD HLNQTELWTD VDRNEFDQYL NMSRTRPEPS GLPYHVSLSK
VTPRSISCEE SSLISALSDA SSAVYYSPCI TG
//