GenomeNet

Database: UniProt
Entry: A0A1P8AUJ2_ARATH
LinkDB: A0A1P8AUJ2_ARATH
Original site: A0A1P8AUJ2_ARATH 
ID   A0A1P8AUJ2_ARATH        Unreviewed;      2049 AA.
AC   A0A1P8AUJ2;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   28-JAN-2026, entry version 44.
DE   SubName: Full=Zinc finger C-x8-C-x5-C-x3-H type family protein {ECO:0000313|EMBL:ANM60305.1};
GN   Name=SOP1 {ECO:0000313|TAIR:AT1G21580};
GN   OrderedLocusNames=At1g21580 {ECO:0000313|Araport:AT1G21580,
GN   ECO:0000313|EMBL:ANM60305.1};
GN   ORFNames=F24J8.17 {ECO:0000313|EMBL:ANM60305.1}, F24J8_17
GN   {ECO:0000313|EMBL:ANM60305.1};
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702 {ECO:0000313|EMBL:ANM60305.1, ECO:0000313|Proteomes:UP000006548};
RN   [1] {ECO:0000313|EMBL:ANM60305.1, ECO:0000313|Proteomes:UP000006548}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX   PubMed=11130712; DOI=10.1038/35048500;
RA   Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA   Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA   Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA   Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA   Feldblyum T.V., Feng J., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA   Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA   Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA   Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA   Lee J.M., Lenz C.A., Li J.H., Li Y., Lin X., Liu S.X., Liu Z.A.,
RA   Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA   Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA   Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA   Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA   Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA   Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT   "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL   Nature 408:816-820(2000).
RN   [2] {ECO:0007829|PubMed:19376835}
RP   IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX   PubMed=19376835; DOI=10.1104/pp.109.138677;
RA   Reiland S., Messerli G., Baerenfaller K., Gerrits B., Endler A.,
RA   Grossmann J., Gruissem W., Baginsky S.;
RT   "Large-scale Arabidopsis phosphoproteome profiling reveals novel
RT   chloroplast kinase substrates and phosphorylation networks.";
RL   Plant Physiol. 150:889-903(2009).
RN   [3] {ECO:0000313|EMBL:ANM60305.1}
RP   NUCLEOTIDE SEQUENCE.
RG   TAIR;
RA   Swarbreck D., Lamesch P., Wilks C., Huala E.;
RL   Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|EMBL:ANM60305.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Krishnakumar V., Cheng C.-Y., Chan A.P., Schobel S., Kim M., Ferlanti E.S.,
RA   Belyaeva I., Rosen B.D., Micklem G., Miller J.R., Vaughn M., Town C.D.;
RL   Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
RN   [5] {ECO:0000313|Proteomes:UP000006548}
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CP002684; ANM60304.1; -; Genomic_DNA.
DR   EMBL; CP002684; ANM60305.1; -; Genomic_DNA.
DR   RefSeq; NP_001322601.1; NM_001332517.1.
DR   RefSeq; NP_001322602.1; NM_001332518.1.
DR   ProteomicsDB; 175682; -.
DR   GeneID; 838759; -.
DR   Araport; AT1G21580; -.
DR   TAIR; AT1G21580; SOP1.
DR   Proteomes; UP000006548; Chromosome 1.
DR   ExpressionAtlas; A0A1P8AUJ2; baseline and differential.
DR   GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-KW.
DR   Gene3D; 4.10.1000.10; Zinc finger, CCCH-type; 1.
DR   InterPro; IPR000571; Znf_CCCH.
DR   PANTHER; PTHR46156; CCCH ZINGC FINGER; 1.
DR   PANTHER; PTHR46156:SF1; ZINC FINGER CCCH DOMAIN-CONTAINING PROTEIN 3; 1.
DR   SMART; SM00356; ZnF_C3H1; 2.
DR   PROSITE; PS50103; ZF_C3H1; 1.
PE   1: Evidence at protein level;
KW   Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00723};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A1P8AUJ2,
KW   ECO:0007829|ProteomicsDB:A0A1P8AUJ2};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006548};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00723};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00723}.
FT   DOMAIN          1936..1965
FT                   /note="C3H1-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50103"
FT   ZN_FING         1936..1965
FT                   /note="C3H1-type"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT   REGION          1..113
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          151..194
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          258..284
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          299..376
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          419..483
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          688..724
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1063..1090
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1183..1207
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1319..1340
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1361..1429
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1566..1586
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1818..1840
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        21..42
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        52..61
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        96..113
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        170..183
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        267..284
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        325..340
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        358..376
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        421..436
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        695..704
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1183..1204
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1361..1370
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1389..1398
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1827..1838
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2049 AA;  225804 MW;  28F767712BC3A7D4 CRC64;
     MDSSHYNPTY DPWNSPYSPH LHPPSAPLPP PPPLPPPPPP RQSHPESPNL YGRSTQSNGQ
     RQDYLHQYSH HRQDLPPNTV VNQPTSNYYQ HPPPLQQQHQ PLSLQQQQQH PQYIPQQVSY
     EPQRISQPLT SSIQCTESQS DWVGFNEKRL DSWTVDSGPG RSRVDDGPSR NYQYDYSRNS
     SGVNRGLDGS SRSRDEFRSL GYARKESGVT RTEGNYQARG QLKSESDRYV RGLGDGNRVL
     SSSVGYGSDR YGLTVSRDVT RSSASREGAR ETRNEGRTLY PEKKDDYYHS EIEQYFDRGR
     REASNELNRT PRKQVQKKSA LLRLETPRSY KNSRENEWSR QHNHHNGNGK RFNSNSYRGK
     EHLGHSDRGL VEKQRGRSPV DLDISFKSNV LVAKPVASPT SAGIRSGASV TPRSIKARRA
     LLSDKNEKVS VTERNGKLGT HLSDEISVSE GFRRSTRQTT ASKNEKEPDS HSTPSSSDSG
     GKLNKVRFVN GVVQDSKVKL TDSGPEASTH DTEKISSFCE TLIEAKDDIN VKHGINTEAC
     STEEGVIDGN QSTLKSHEDV LDRTSTDCNA GEALLPKVME MDEILKTKTT INTSPGKLPV
     SWPTVADLSG CSEDMDCDED MDCDEDMDCI PSRNIPMMEV NTGFEERKSI NSSDGSLGYG
     GKDFQKPYLD ASIYFNREDP GDKVLAKSDI GGIEDDNKRI DKNVDSLSPE NDSSRGRPMG
     LDSPASLDIA NVSLDLANSN NSASGDLANA NSFTVGTYMN PMVTSPDKSV VFQMESKNLP
     HCKNTVNAPV ENVSGKGYME TTPLNVAAET ADNMDSEEGK QTCVNDTSSS LTKVGVKGSS
     NVLSVERTGG CSHSDESDLA MAVPSEGCME NVSTERLVPD EELILKSYHP AEIPCVDSGS
     DSRGLKTCLL EPNVSLSKDL TDCARESLVE QDVSQRSAIF CDKLPSLSAF VTETTLAIGI
     NGMSGNETVT DTESGLHEIQ PCTTVCKLSP EDRFGYGSSG AIGSVRSLSI DKNLEKDSSK
     VSSCLVSDNS VSPCHISPLV AVNEEIQNKI SVKANYSNSQ DGIKHKEDNC TESVEVETHE
     EKAKLPGGTS KYRTPVTNIT AGSGGDSLFL CDSLSSSRRR IRSEVHVSAV VDETSKGEEK
     SKPSGGIVAV RRDSVFPCDS LSSSPRLSRP LRSEIHVASM VDETSKSIEK IESSGGTSEH
     RTPETDIVAG SRDSVFPCNS LSNSQRLSFR QLRSEIHVAN MVDETNRVKE SQNGDSLLDT
     LQEQIMTSHE LTQPGSSAHC DLVMKPMGDP IAKLTDITSD VGSQEKDLRN IAKTDTFDGE
     AVSSDGQVSG TEIPGGSGVR VSRSYSHADV KFALTHVKEH VVSVPHRDPQ SKTSMNSKYE
     IEKRKKKPNY STQKSYPSSL PYVSDTKKDA NPPIHITKRH TWHRKSDASP SSFVAAKPLS
     STLSTQQKFP KVTAQSNNSY VRKGNSLLRK PSHGSPGAAL GIPPSAIQLN HFTVEDKSTG
     SSNMVDVDNA SSLVKTGEIA TLERQSKPPS DSSTSKLSNA IATSSGKCAL SYSTDHLTTG
     LPESIMDSAT SGEANFPHSG GDTLKTSDTL IQTGYASDCQ QKRNPSDLDS SNLKRMVYVK
     RKANQLVAAS DIHDVSQNQI PSSDGYFKRS KNQLVRNSES RCNQSISLPD DALDTRSAAN
     MVSERPSSSA FSDSAVMRPF KQSKFSLVWT QNDPQPRMPI AHMRNQNIVP QLVPWKRVTY
     WRRLMNSVSA FRNGSSLNIS RKLSMMRKRH TIYTRSTNGY SLRKSKVLSV GGSHLKWSKS
     IERDSRKANE EATLAVAAYS KKESEKQSGQ NNTSTASRNH LARERVFRFG SLRYKMDSSR
     RTLQRISDVD SPCSGPSENG KGVKRPFIPK RLVIGNEEYV RFGNGNQLVR DPKKRTRVLA
     NEKVRWSLHN ARLRLAKKKK YCQFFTRFGK CNKDDGKCPY VHDPSKIAVC TKFLNGLCAN
     ANCKLTHKVI PERMPDCSYY LQGKNFHCTK IPLRLLTILI PIMQAYATMR HVHIGMCMSI
     RSPLYVMGF
//
DBGET integrated database retrieval system