ID A0A2A4K398_HELVI Unreviewed; 622 AA.
AC A0A2A4K398;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=SANT domain-containing protein {ECO:0000259|PROSITE:PS51293};
GN ORFNames=B5V51_3204 {ECO:0000313|EMBL:PCG78725.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG78725.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG78725.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG78725.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG78725.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the N-CoR nuclear receptor corepressors family.
CC {ECO:0000256|ARBA:ARBA00010097}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG78725.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000176; PCG78725.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4K398; -.
DR STRING; 7102.A0A2A4K398; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR Gene3D; 1.20.5.430; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR031557; N-CoR_GPS2_interact.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR13992; NUCLEAR RECEPTOR CO-REPRESSOR RELATED NCOR; 1.
DR PANTHER; PTHR13992:SF39; SMRTER, ISOFORM G; 1.
DR Pfam; PF15784; GPS2_interact; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51293; SANT; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 493..544
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 17..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 263..287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 547..622
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 352..405
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 42..63
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..602
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 622 AA; 68983 MW; 438D6CE8FF3BAA6E CRC64;
MVWSTQTGEV LIDIVTRPGQ TDHGHGGGAL AYGGKPAAGG VVAPPQPRPP PAAPYAPQPY
PPAQLRLHQP GYGSAALSYR DSSYVRGGGA VGSVGGVGAV GAGAAGAGEY RGGRVSLLGA
ADSSYVRGGG AVGSVGGVGA VGAGAAGAGE YRGGRVSLLG AAYLPPPPAP AAHHPPPADP
PPFKKIRLTA DRTPLSHHQP LRVDTREPVN QYSTVEVLSP NPPSEPTMED QSFRTTKDDL
LQQISKVDRE MALSESTLSK LKKKQEELEQ TASKPARAEE PEETVPRHRS LAQCVYAENR
KKISMDNKRQ MQASSAHAVL SHLGPPVLYP LYNQPQDTEV YHENIRRHRT FRKRLAEHIR
KLKLEAEKRE DALAEAYSRR AAEWLRRVER IEQGQKRKAK DARNREFFEK VFPELRKQRE
ERERFHRLGA RVKSEAELEE IADGLHEQEH EDKKMRSLTV VPPLLRDPSD ATPKYVDTNR
RCMDMESEHK ELQLRNVWSQ AERELFREKY LQHPKNFGQI ASFLPRKSVR DCVRFYYLSK
KAENYKQLLR KPRQRRSARN PPRAQPEPEL PAGVTTRLQR SQGTTARGTE NKEPNMDESM
PQVTDPVTCI PIPTAAPAQP PR
//