ID A0A226N4C0_CALSU Unreviewed; 731 AA.
AC A0A226N4C0;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=THAP-type domain-containing protein {ECO:0000259|PROSITE:PS50950};
GN ORFNames=ASZ78_013790 {ECO:0000313|EMBL:OXB62446.1};
OS Callipepla squamata (Scaled quail).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Galliformes; Odontophoridae;
OC Callipepla.
OX NCBI_TaxID=9009 {ECO:0000313|EMBL:OXB62446.1, ECO:0000313|Proteomes:UP000198323};
RN [1] {ECO:0000313|EMBL:OXB62446.1, ECO:0000313|Proteomes:UP000198323}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texas {ECO:0000313|EMBL:OXB62446.1,
RC ECO:0000313|Proteomes:UP000198323};
RC TISSUE=Leg muscle {ECO:0000313|EMBL:OXB62446.1};
RA Oldeschulte D.L., Halley Y.A., Bhattarai E.K., Brashear W.A., Hill J.,
RA Metz R.P., Johnson C.D., Rollins D., Peterson M.J., Bickhart D.M.,
RA Decker J.E., Seabury C.M.;
RT "Disparate Historic Effective Population Sizes Predicted by Modern Levels
RT of Genome Diversity for the Scaled Quail (Callipepla squamata) and the
RT Northern Bobwhite (Colinus virginianus): Inferences from First and Second
RT Generation Draft Genome Assemblies for Sympatric New World Quail.";
RL Submitted (JUL-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXB62446.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MCFN01000216; OXB62446.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226N4C0; -.
DR STRING; 9009.A0A226N4C0; -.
DR Proteomes; UP000198323; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR InterPro; IPR025398; DUF4371.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR006612; THAP_Znf.
DR PANTHER; PTHR46289:SF16; 52 KDA REPRESSOR OF THE INHIBITOR OF THE PROTEIN KINASE; 1.
DR PANTHER; PTHR46289; 52 KDA REPRESSOR OF THE INHIBITOR OF THE PROTEIN KINASE-LIKE PROTEIN-RELATED; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF14291; DUF4371; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Reference proteome {ECO:0000313|Proteomes:UP000198323};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00309}.
FT DOMAIN 1..87
FT /note="THAP-type"
FT /evidence="ECO:0000259|PROSITE:PS50950"
FT REGION 92..122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..108
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 731 AA; 84126 MW; 524099C954DBD925 CRC64;
MPNFCAAPNC TRKSTQSDLA FFRFPRDPVR CQRWVENCRR ADLEDKTPDQ LNKHYRLCAK
HFETSMICRS SEDEIRTLKQ QKINEAFERE QATQELNESN AQNTVTEEGG EQQEEKSVPL
TLEERENKDY LKSLFEILIL MGKQNIPLDS HNVDELPEGI FTSDNFQALL EYRINAGDEV
LRRRFEMTAV NLEYCSKTQQ KQMLEICESC IREETLREVR DSHFFSIITD EVVDIAGEEH
LPVLVRFVDD SHNLREEFVG FLPYEADPEI LAVKFHTTIT EKWGLNMEYC RGQAYIVSSG
FASKMKVVAT RLLEKYPQAV YTLCSSCALN IWLAKSVPLV GVSMALGTME EVRCLFSRSP
QLLVELDNTI SALFQNNEEK GNELKEICRS QWTGRHDTFE VLVDLIQALV LCLDTVSNDS
TVRWNNFIAG RAFVLSSALT DFDFIVTIVI LKNVLSFTRA FGKNLQGQTS DVFFAASSLT
AVLHSLNEVM ENIEVYHEFW FEEATNLAAK LDVQIKLPGK FRRSQQGNLD SELTSENYYK
EILSVPTVEH IIQELKDIFS EQHLKALKCL SLVPSVMGQL KFNTSEEHHA DMYKNDLPNP
DTLSAELHCW RIKWKHRGKD IELPATIYEA LHLPDIKYFP NVYALLKVLC ILPVMKVENE
KYEMGRKRLK AYLKNTLTEQ RSSNLALLNI NIDIKHDLDL MVDTYIKLYP GKVEFQADFL
PSNNSEVKES A
//