ID D7LGI5_ARALL Unreviewed; 461 AA.
AC D7LGI5;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=WW domain-containing protein {ECO:0000313|EMBL:EFH56162.1};
GN ORFNames=ARALYDRAFT_483169 {ECO:0000313|EMBL:EFH56162.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348716; EFH56162.1; -; Genomic_DNA.
DR RefSeq; XP_002879903.1; XM_002879857.1.
DR AlphaFoldDB; D7LGI5; -.
DR STRING; 81972.D7LGI5; -.
DR EnsemblPlants; fgenesh2_kg.4__2229__AT2G41020.1; fgenesh2_kg.4__2229__AT2G41020.1; fgenesh2_kg.4__2229__AT2G41020.1.
DR Gramene; fgenesh2_kg.4__2229__AT2G41020.1; fgenesh2_kg.4__2229__AT2G41020.1; fgenesh2_kg.4__2229__AT2G41020.1.
DR eggNOG; KOG0152; Eukaryota.
DR eggNOG; KOG3427; Eukaryota.
DR HOGENOM; CLU_032600_0_0_1; -.
DR OrthoDB; 5403339at2759; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0045087; P:innate immune response; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR CDD; cd00201; WW; 2.
DR Gene3D; 2.20.70.10; -; 2.
DR Gene3D; 3.40.30.10; Glutaredoxin; 1.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR21737; POLYGLUTAMINE BINDING PROTEIN 1/MARVEL MEMBRANE-ASSOCIATING DOMAIN CONTAINING 3; 1.
DR PANTHER; PTHR21737:SF3; POLYGLUTAMINE-BINDING PROTEIN 1; 1.
DR Pfam; PF00397; WW; 2.
DR SMART; SM00456; WW; 2.
DR SUPFAM; SSF51045; WW domain; 2.
DR PROSITE; PS01159; WW_DOMAIN_1; 1.
DR PROSITE; PS50020; WW_DOMAIN_2; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008694}.
FT DOMAIN 190..224
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT DOMAIN 235..269
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 266..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 338..389
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 404..461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..292
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 338..359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 442..461
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 461 AA; 49821 MW; 0F0784E958FD6EF0 CRC64;
MGEELQHQQN NQTSNSGYGS SLAYDQSQDI ESAATNALLR EQEIETQKII QGQREAGTSV
AGDAEHNTDI LRDRSDPNAL KEHLLKFTAH HRAEAAAKRG GSVSTCGEGN VDVGNGYGIP
GGVAYAGHSE LTGKPEPTDA SNNLPEYLKQ KLRARGILRD GTGAVTSNTQ DTSAVSWNRQ
TTSPFTANAS TLPLGWVDAK DPASGATYYY NQHTRTCQWE RPVELSYTTS SAPPVPPKEE
WIETLDEASG HKYFYNTRTH VSQWEPPASL QKPAPTNSNN AVTQSTANGK GEHPPSQMPR
CSGCGGWGVG LVQRWGYCVH CTRVFNLPEQ QFLPANLNHF TNAGDSGQKD PNQRSSSKPP
MKKVIGKKRA HADDDELDPM DPSSYSDAPR GGWVVGLKGV QPRAADTTAT GPLFQQRPYP
SPGAVLRRNA EVASSQKKKP NSHFTEITKR GDGSDGLGDA D
//