ID A0A182Y9B5_ANOST Unreviewed; 1302 AA.
AC A0A182Y9B5;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=C2H2-type domain-containing protein {ECO:0000259|PROSITE:PS50157};
OS Anopheles stephensi (Indo-Pakistan malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=30069 {ECO:0000313|EnsemblMetazoa:ASTEI05051-PA, ECO:0000313|Proteomes:UP000076408};
RN [1] {ECO:0000313|Proteomes:UP000076408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Indian {ECO:0000313|Proteomes:UP000076408};
RX PubMed=25244985; DOI=10.1186/preaccept-1262842421127991;
RA Jiang X., Peery A., Hall A.B., Sharma A., Chen X.G., Waterhouse R.M.,
RA Komissarov A., Riehle M.M., Shouche Y., Sharakhova M.V., Lawson D.,
RA Pakpour N., Arensburger P., Davidson V.L., Eiglmeier K., Emrich S.,
RA George P., Kennedy R.C., Mane S.P., Maslen G., Oringanje C., Qi Y.,
RA Settlage R., Tojo M., Tubio J.M., Unger M.F., Wang B., Vernick K.D.,
RA Ribeiro J.M., James A.A., Michel K., Riehle M.A., Luckhart S.,
RA Sharakhov I.V., Tu Z.;
RT "Genome analysis of a major urban malaria vector mosquito, Anopheles
RT stephensi.";
RL Genome Biol. 15:459-459(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASTEI05051-PA}
RP IDENTIFICATION.
RC STRAIN=Indian {ECO:0000313|EnsemblMetazoa:ASTEI05051-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the teashirt C2H2-type zinc-finger protein
CC family. {ECO:0000256|ARBA:ARBA00007158}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 30069.A0A182Y9B5; -.
DR EnsemblMetazoa; ASTEI05051-RA; ASTEI05051-PA; ASTEI05051.
DR VEuPathDB; VectorBase:ASTE001601; -.
DR VEuPathDB; VectorBase:ASTEI05051; -.
DR VEuPathDB; VectorBase:ASTEI20_036835; -.
DR OMA; VPPINPY; -.
DR Proteomes; UP000076408; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0010468; P:regulation of gene expression; IEA:InterPro.
DR InterPro; IPR027008; Teashirt_fam.
DR InterPro; IPR041661; ZN622/Rei1/Reh1_Znf-C2H2.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR12487:SF7; PROTEIN TEASHIRT-RELATED; 1.
DR PANTHER; PTHR12487; TEASHIRT-RELATED; 1.
DR Pfam; PF12756; zf-C2H2_2; 2.
DR Pfam; PF13912; zf-C2H2_6; 1.
DR SMART; SM00355; ZnF_C2H2; 4.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 4.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 2.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000076408};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 462..486
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 565..594
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
SQ SEQUENCE 1302 AA; 132560 MW; 8D351A622D98B6EE CRC64;
MASPRCQSRE SSSGGGRCPS DESIRSEGDR EEQGAGNNLS PVSITLPPTL PPAAAAALLP
QHSAAMAAYL NAAVAAQQNR LLLSSPLAAG LASVRNGSPP VLPLRSPTDE NDDATVLDFS
KKRGSGGGGG DDDDEDDDED DDGGEGGGAG GAGGRGSRGG EGDGDGDGGS DCGDAVNLSQ
KSGIGDNSPL DLSVSHRKRT GNEEGASPPP RKTTRSLDYK PAIVTPWSTP VTPQLPYLAA
AAVAAAGLSP KNNHHQHAMH ADSWNGKHKS VAAAAAAAAA AAAVTNDATK ALEKMSELSR
LGGEDIYRSP STGNSGGGGG GGGGGAGGGG RHSAWQSHWL NKGADSAKDV LKCVWCKQSF
PSLAAMTTHM KETKHCGVNV PPGGGGGGSG GHAQQPIPPP AQGGNNMGGS GGQSGQGHGP
ASKPSPSELN MLIKETMPLP RKLVRGQDVW LGKGAEQTRQ ILKCMWCGQS FRTLAEMTAH
MQQTQHYTNI ISQEQIISWK SSDSDKSGGG GGGGGAGGGP AGAAGGAGPS GGAGPNGGGA
GGGGPGGAAN AAAAAQTNSH VSAVLTCKVC DQAFSSLKEL SNHMVKNAHY KEHIMRSITE
SGSRRRQTRE KRKKSLPVRK LLEMERAQHD YGKNGADTGP NSAAAAAAAV AAAVNAANKP
LRDLGAGKIS CEKCNEKIET TMFVDHIRQC IGGTALLAAQ QQRDKLKSAL LSNTIIPPDS
ISPVTPTGRD GRKSVSDDLA SPLSSQKSPL GSDLLSPVSM LTKKDGNGGG GGEKSSSPSV
LNAIEQLIEK SFDTRSRHGG SNYSGSGGGG GGGQSSTPLG SSILKRLGID ESVDYTKPLV
DPQTMNLLRS YHHQQQQAQY ASQQFGRRER SGSESSSISE RGSSRMDSLT PEKKFDTPSH
STPRGTPDKP YSEQSDHHHP DGSDLHAQPP IKRELHSDDE SPEAETKPAA EQGKIRIKKE
LMPEEGDDHE TDHEMRPPSK AQHRDDRDDQ EPPRRSSVAS SPAPSPRLST ASVPLSPAAS
PLSDHHSIAS RSTPGAGDCS SKKSSNVSNS LGALSSMFDS LTGAGSGSTG GGDAASSGGS
GGKKSSAHPL AALQKLCDKT ETPSAGGGGR GSSSSSALLA ATTNSSGSRT APGAILAFSW
ACNEAVVSED VAGGTIKCAY CDTTFGSKGA YRHHLSKAHF VNDDVIPDPK VPGGGPGALS
LAATSGAQSL KTRSSPAPNS PKSTGSTAPL SLVTGSSTGS SSSSASSTGR ESVERRVGHE
SGNGPSSAKS PPPSQSPAFD ESPHSKFLKY TELAKQLSSK YV
//