GenomeNet

Database: UniProt
Entry: A0A151MZZ1_ALLMI
LinkDB: A0A151MZZ1_ALLMI
Original site: A0A151MZZ1_ALLMI 
ID   A0A151MZZ1_ALLMI        Unreviewed;       392 AA.
AC   A0A151MZZ1;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   24-JAN-2024, entry version 30.
DE   SubName: Full=Transcription factor SOX-18 {ECO:0000313|EMBL:KYO30084.1};
GN   Name=SOX18 {ECO:0000313|EMBL:KYO30084.1};
GN   ORFNames=Y1Q_0021168 {ECO:0000313|EMBL:KYO30084.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO30084.1};
RN   [1] {ECO:0000313|EMBL:KYO30084.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO30084.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO30084.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03004329; KYO30084.1; -; Genomic_DNA.
DR   RefSeq; XP_006273131.1; XM_006273069.3.
DR   AlphaFoldDB; A0A151MZZ1; -.
DR   STRING; 8496.A0A151MZZ1; -.
DR   GeneID; 102567331; -.
DR   KEGG; amj:102567331; -.
DR   CTD; 54345; -.
DR   eggNOG; KOG0527; Eukaryota.
DR   OrthoDB; 2902801at2759; -.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-UniRule.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   CDD; cd22048; HMG-box_SoxF_SOX18; 1.
DR   Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR   InterPro; IPR009071; HMG_box_dom.
DR   InterPro; IPR036910; HMG_box_dom_sf.
DR   InterPro; IPR033392; Sox7/17/18_central.
DR   InterPro; IPR021934; Sox_C.
DR   PANTHER; PTHR10270; SOX TRANSCRIPTION FACTOR; 1.
DR   PANTHER; PTHR10270:SF204; TRANSCRIPTION FACTOR SOX-18; 1.
DR   Pfam; PF00505; HMG_box; 1.
DR   Pfam; PF12067; Sox17_18_mid; 1.
DR   SMART; SM00398; HMG; 1.
DR   SUPFAM; SSF47095; HMG-box; 1.
DR   PROSITE; PS50118; HMG_BOX_2; 1.
DR   PROSITE; PS51516; SOX_C; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00267}; Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT   DOMAIN          75..143
FT                   /note="HMG box"
FT                   /evidence="ECO:0000259|PROSITE:PS50118"
FT   DOMAIN          272..391
FT                   /note="Sox C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51516"
FT   DNA_BIND        75..143
FT                   /note="HMG box"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT   REGION          1..76
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   392 AA;  44093 MW;  4AF0F91D7FF48CDF CRC64;
     MNRPEPSYRA DDIPQARADC SWAEAGLSTA FQPASRTPSP QPGSPPSRSP SPDAGYGYSP
     PAGRADGKPG DDSRIRRPMN AFMVWAKDER KRLAQQNPDL HNAVLSKMLG QSWKALSTSD
     KRPFVEEAER LRLQHLQDHP NYKYRPRRKK QAKKIKRMEP NILLHNLSQP CSDSFGMNHH
     SGSQPAHHQP PPLNHFRELH SMGSDIENYG LPTPEMSPLD VLEQTEPAFF PPHMQEDCNM
     MPFRGYHPHH QMEFPQDKSI GRDMAMPYAQ TPSHLADAMR TPHPSSIYYN QICSGPQNGL
     STHLGQLSPP PEAHHLDSVD HLNQTELWTD VDRNEFDQYL NMSRTRPEPS GLPYHVSLSK
     VTPRSISCEE SSLISALSDA SSAVYYSPCI TG
//
DBGET integrated database retrieval system