ID A0A151MPL0_ALLMI Unreviewed; 729 AA.
AC A0A151MPL0;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE SubName: Full=Transcription factor SPT20-like protein isoform B {ECO:0000313|EMBL:KYO26442.1};
GN Name=SUPT20H {ECO:0000313|EMBL:KYO26442.1};
GN ORFNames=Y1Q_0010405 {ECO:0000313|EMBL:KYO26442.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO26442.1};
RN [1] {ECO:0000313|EMBL:KYO26442.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO26442.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SIMILARITY: Belongs to the SPT20 family.
CC {ECO:0000256|ARBA:ARBA00009112}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO26442.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03005510; KYO26442.1; -; Genomic_DNA.
DR RefSeq; XP_006267496.1; XM_006267434.3.
DR AlphaFoldDB; A0A151MPL0; -.
DR GeneID; 102569237; -.
DR KEGG; amj:102569237; -.
DR CTD; 55578; -.
DR OrthoDB; 4848120at2759; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0000124; C:SAGA complex; IEA:InterPro.
DR GO; GO:0003712; F:transcription coregulator activity; IEA:InterPro.
DR InterPro; IPR021950; Spt20.
DR InterPro; IPR046468; Spt20-like_SEP.
DR PANTHER; PTHR13526; TRANSCRIPTION FACTOR SPT20 HOMOLOG; 1.
DR PANTHER; PTHR13526:SF8; TRANSCRIPTION FACTOR SPT20 HOMOLOG; 1.
DR Pfam; PF12090; Spt20_SEP; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 75..216
FT /note="Spt20-like SEP"
FT /evidence="ECO:0000259|Pfam:PF12090"
FT REGION 423..525
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 636..671
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 705..729
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 460..481
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 482..497
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..525
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 729 AA; 79678 MW; 0B0BCAF811E135E2 CRC64;
MQQALELALD RAEYIIESAR QRPPKRKYLS SGRKSVFQKL YDLYIEECEK EPEIKKLRRN
VNLLEKLVMQ ETLSCLVVNL YPGNEGYSLM LRGKNGSDSE TIRLPYEEGE LLEYLDAEEL
PPILVDLLEK SQVNIFHCGC VIAEIRDYRQ SGNMKSPTYQ SKHILLRPTM QTLICDVHSI
TSDNHKWTQE DKLLLESQLI LATAEPLCLD PSIAVTCTAN RLLYNKQKMN TRPMKRCFKR
YSRSSLNRQQ DVAHCPTPPQ LRILDYLPKR KERKGAQQYD LKISKAGNCV DMWKQNPCCL
TAPSEVDVEK YAKVEKSIKP DDSQPTVWPA HEIKDDYVFE CEVGNQLQKT KLTIFQSLGN
PLYYGKIQTL KGDEENESLV TPSQFLIGSK TDAERVVNQY QELVQNEAKC PVKMFHNSGG
SVNLSHLSPG KEMEPESLSG SVQSSVLGKG VKHRPPPIKL PSSSGSSSSG NIFSPQQSSG
HLKSPTPPPP LPPSKPPSLS RKQSMDLSQV SMLSPAAMSP ASSSQRTAST QVMANSAGLN
FINVVGSVCG AQTLMSGSNT MLGCNTGAIA PAGINLSGIL PSGGLVPNAL PAAMQSASQA
GSTFGLKNTS SLRPLNLLQL PGGSLIFNPL QQQQQQQLSQ FSPQQQSQQP TTSSPQQQGE
QGSEQGACSQ DQTLSAQQAA VINLTGVGNF MQPQATAVAI LAANGYGSSS SSTSSSPTTS
ATFRQPLKK
//