ID A0A016SXW8_9BILA Unreviewed; 822 AA.
AC A0A016SXW8;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Myb-like domain-containing protein {ECO:0000259|SMART:SM00717};
GN Name=Acey_s0162.g3419 {ECO:0000313|EMBL:EYB95226.1};
GN Synonyms=Acey-eif-3.C {ECO:0000313|EMBL:EYB95226.1};
GN ORFNames=Y032_0162g3419 {ECO:0000313|EMBL:EYB95226.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYB95226.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYB95226.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01001498; EYB95226.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016SXW8; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR039467; TFIIIB_B''_Myb.
DR PANTHER; PTHR22929; RNA POLYMERASE III TRANSCRIPTION INITIATION FACTOR B; 1.
DR PANTHER; PTHR22929:SF0; TRANSCRIPTION FACTOR TFIIIB COMPONENT B'' HOMOLOG; 1.
DR Pfam; PF15963; Myb_DNA-bind_7; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000024635}.
FT DOMAIN 255..303
FT /note="Myb-like"
FT /evidence="ECO:0000259|SMART:SM00717"
FT REGION 1..101
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 165..200
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..440
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 479..783
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 32..61
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 378..393
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 405..437
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 504..519
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 550..567
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 581..602
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 612..647
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..702
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 724..749
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 822 AA; 90878 MW; 5385F036AD94C1D5 CRC64;
MLARRSRLQV KPNISQGSKS AAKPPVEKVV EPINSHSSST PTAEQSSIAD LAASLDNPTP
VENVSKEARG LGHASEKAIP IAEPAKPIIV PPEESDSGFI DPLAPERRVA HSSVEKTKTR
DRLFSSSDCY EPVLPRPRRK FTGDEELDPK KMRMMDMIYW NPKKQKGMSR KYKDTESVVG
DKPAQSSEKP ASVSGSAKTA APQVKIGADG RLVIDEESLV VAKDTTNESV WETVEEDRMT
RKVTSLSFRN RLWRKGTAWT EKETELFYEI LRCTGPDFGL MHEFFPSRAR NELKSKFNKE
ERTNWEKLKE VMSRPALLDD DLYERAADLQ KEIEEEALAK KMKKEKDKKG TGMTFKRRRK
KDREEQEEGE DVSDNSEDLV EEASKIINEM EKEKKKKRKR KARRSRSESS DSEQSDSNAD
DEVNHEIELR EKKPKQLSAK AQALLDETIG KSLSAKAKAL LDQTLGKSLS AKAQALFNEA
TGKSKGAEDA SNEGGNEDVE DPPQTSFDGY EEEDDGNGSP VTIDSDVELV HSSKLLVRNR
HAPRITLTSE PIAGRSSAPT LTPNVQDEDV TEDVRTPSAS PEPDVDSSSS TPEPSNAFAS
AKNESGKKDD LTSGNDNVFT QSDEPTPSIS PSTPAMTPIP SQTDLSRPRR SVRGKPARKP
ILKSALIKKP PNLKASTSDK ATPCPEQEAT PDSTVEQPSV STSPRRERTR TRSMRSVEQP
TRSLILGDKE DKNTHKPLIG DKEGENADIP TPDQGDVTGN NDGESAGDVI VEESTSVEPT
RPKRRRYTVV EGDVADQCVS IGDCLHGLKT LEVFWITVKG RK
//