ID A0A1P8AUH0_ARATH Unreviewed; 2043 AA.
AC A0A1P8AUH0;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 28-JAN-2026, entry version 44.
DE SubName: Full=Zinc finger C-x8-C-x5-C-x3-H type family protein {ECO:0000313|EMBL:ANM60302.1};
GN Name=SOP1 {ECO:0000313|TAIR:AT1G21580};
GN OrderedLocusNames=At1g21580 {ECO:0000313|Araport:AT1G21580,
GN ECO:0000313|EMBL:ANM60302.1};
GN ORFNames=F24J8.17 {ECO:0000313|EMBL:ANM60302.1}, F24J8_17
GN {ECO:0000313|EMBL:ANM60302.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000313|EMBL:ANM60302.1, ECO:0000313|Proteomes:UP000006548};
RN [1] {ECO:0000313|EMBL:ANM60302.1, ECO:0000313|Proteomes:UP000006548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA Feldblyum T.V., Feng J., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA Lee J.M., Lenz C.A., Li J.H., Li Y., Lin X., Liu S.X., Liu Z.A.,
RA Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL Nature 408:816-820(2000).
RN [2] {ECO:0007829|PubMed:19376835}
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=19376835; DOI=10.1104/pp.109.138677;
RA Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A.,
RA Grossmann J., Gruissem W., Baginsky S.;
RT "Large-scale Arabidopsis phosphoproteome profiling reveals novel
RT chloroplast kinase substrates and phosphorylation networks.";
RL Plant Physiol. 150:889-903(2009).
RN [3] {ECO:0000313|EMBL:ANM60302.1}
RP NUCLEOTIDE SEQUENCE.
RG TAIR;
RA Swarbreck D., Lamesch P., Wilks C., Huala E.;
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
RN [4] {ECO:0000313|EMBL:ANM60302.1}
RP NUCLEOTIDE SEQUENCE.
RA Krishnakumar V., Cheng C.-Y., Chan A.P., Schobel S., Kim M., Ferlanti E.S.,
RA Belyaeva I., Rosen B.D., Micklem G., Miller J.R., Vaughn M., Town C.D.;
RL Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
RN [5] {ECO:0000313|Proteomes:UP000006548}
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002684; ANM60299.1; -; Genomic_DNA.
DR EMBL; CP002684; ANM60302.1; -; Genomic_DNA.
DR RefSeq; NP_001322596.1; NM_001332512.1.
DR RefSeq; NP_001322599.1; NM_001332515.1.
DR ProteomicsDB; 183161; -.
DR GeneID; 838759; -.
DR Araport; AT1G21580; -.
DR TAIR; AT1G21580; SOP1.
DR Proteomes; UP000006548; Chromosome 1.
DR ExpressionAtlas; A0A1P8AUH0; baseline and differential.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-KW.
DR Gene3D; 4.10.1000.10; Zinc finger, CCCH-type; 1.
DR InterPro; IPR000571; Znf_CCCH.
DR PANTHER; PTHR46156; CCCH ZINGC FINGER; 1.
DR PANTHER; PTHR46156:SF1; ZINC FINGER CCCH DOMAIN-CONTAINING PROTEIN 3; 1.
DR SMART; SM00356; ZnF_C3H1; 2.
DR PROSITE; PS50103; ZF_C3H1; 1.
PE 1: Evidence at protein level;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00723};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A1P8AUH0,
KW ECO:0007829|ProteomicsDB:A0A1P8AUH0};
KW Reference proteome {ECO:0000313|Proteomes:UP000006548};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00723};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00723}.
FT DOMAIN 1936..1965
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT ZN_FING 1936..1965
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT REGION 1..113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 151..215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 258..284
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 299..376
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 419..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 688..724
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1063..1090
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1183..1207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1319..1340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1361..1429
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1566..1586
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1818..1840
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 21..42
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 52..61
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..113
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..183
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..208
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..284
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..340
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 358..376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..436
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 695..704
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1183..1204
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1361..1370
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1389..1398
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1827..1838
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2043 AA; 225157 MW; 5B79CBFF075EF4A4 CRC64;
MDSSHYNPTY DPWNSPYSPH LHPPSAPLPP PPPLPPPPPP RQSHPESPNL YGRSTQSNGQ
RQDYLHQYSH HRQDLPPNTV VNQPTSNYYQ HPPPLQQQHQ PLSLQQQQQH PQYIPQQVSY
EPQRISQPLT SSIQCTESQS DWVGFNEKRL DSWTVDSGPG RSRVDDGPSR NYQYDYSRNS
SGVNRGLDGS SRSRDEFRSL GYARKESGVT RTEGNYQARG QLKSESDRYV RGLGDGNRVL
SSSVGYGSDR YGLTVSRDVT RSSASREGAR ETRNEGRTLY PEKKDDYYHS EIEQYFDRGR
REASNELNRT PRKQVQKKSA LLRLETPRSY KNSRENEWSR QHNHHNGNGK RFNSNSYRGK
EHLGHSDRGL VEKQRGRSPV DLDISFKSNV LVAKPVASPT SAGIRSGASV TPRSIKARRA
LLSDKNEKVS VTERNGKLGT HLSDEISVSE GFRRSTRQTT ASKNEKEPDS HSTPSSSDSG
GKLNKVRFVN GVVQDSKVKL TDSGPEASTH DTEKISSFCE TLIEAKDDIN VKHGINTEAC
STEEGVIDGN QSTLKSHEDV LDRTSTDCNA GEALLPKVME MDEILKTKTT INTSPGKLPV
SWPTVADLSG CSEDMDCDED MDCDEDMDCI PSRNIPMMEV NTGFEERKSI NSSDGSLGYG
GKDFQKPYLD ASIYFNREDP GDKVLAKSDI GGIEDDNKRI DKNVDSLSPE NDSSRGRPMG
LDSPASLDIA NVSLDLANSN NSASGDLANA NSFTVGTYMN PMVTSPDKSV VFQMESKNLP
HCKNTVNAPV ENVSGKGYME TTPLNVAAET ADNMDSEEGK QTCVNDTSSS LTKVGVKGSS
NVLSVERTGG CSHSDESDLA MAVPSEGCME NVSTERLVPD EELILKSYHP AEIPCVDSGS
DSRGLKTCLL EPNVSLSKDL TDCARESLVE QDVSQRSAIF CDKLPSLSAF VTETTLAIGI
NGMSGNETVT DTESGLHEIQ PCTTVCKLSP EDRFGYGSSG AIGSVRSLSI DKNLEKDSSK
VSSCLVSDNS VSPCHISPLV AVNEEIQNKI SVKANYSNSQ DGIKHKEDNC TESVEVETHE
EKAKLPGGTS KYRTPVTNIT AGSGGDSLFL CDSLSSSRRR IRSEVHVSAV VDETSKGEEK
SKPSGGIVAV RRDSVFPCDS LSSSPRLSRP LRSEIHVASM VDETSKSIEK IESSGGTSEH
RTPETDIVAG SRDSVFPCNS LSNSQRLSFR QLRSEIHVAN MVDETNRVKE SQNGDSLLDT
LQEQIMTSHE LTQPGSSAHC DLVMKPMGDP IAKLTDITSD VGSQEKDLRN IAKTDTFDGE
AVSSDGQVSG TEIPGGSGVR VSRSYSHADV KFALTHVKEH VVSVPHRDPQ SKTSMNSKYE
IEKRKKKPNY STQKSYPSSL PYVSDTKKDA NPPIHITKRH TWHRKSDASP SSFVAAKPLS
STLSTQQKFP KVTAQSNNSY VRKGNSLLRK PSHGSPGAAL GIPPSAIQLN HFTVEDKSTG
SSNMVDVDNA SSLVKTGEIA TLERQSKPPS DSSTSKLSNA IATSSGKCAL SYSTDHLTTG
LPESIMDSAT SGEANFPHSG GDTLKTSDTL IQTGYASDCQ QKRNPSDLDS SNLKRMVYVK
RKANQLVAAS DIHDVSQNQI PSSDGYFKRS KNQLVRNSES RCNQSISLPD DALDTRSAAN
MVSERPSSSA FSDSAVMRPF KQSKFSLVWT QNDPQPRMPI AHMRNQNIVP QLVPWKRVTY
WRRLMNSVSA FRNGSSLNIS RKLSMMRKRH TIYTRSTNGY SLRKSKVLSV GGSHLKWSKS
IERDSRKANE EATLAVAAYS KKESEKQSGQ NNTSTASRNH LARERVFRFG SLRYKMDSSR
RTLQRISDVD SPCSGPSENG KGVKRPFIPK RLVIGNEEYV RFGNGNQLVR DPKKRTRVLA
NEKVRWSLHN ARLRLAKKKK YCQFFTRFGK CNKDDGKCPY VHDPSKIAVC TKFLNGLCAN
ANCKLTHKVF LLSHCCIIVL TEISNWITYF FHRSFQKGCL IVLIICKVRI STVLRSHCGY
SQY
//