ID D7MCP5_ARALL Unreviewed; 738 AA.
AC D7MCP5;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Cell wall hydroxyproline-rich glycoprotein {ECO:0000256|ARBA:ARBA00041871};
GN ORFNames=ARALYDRAFT_492992 {ECO:0000313|EMBL:EFH44240.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall
CC {ECO:0000256|ARBA:ARBA00004191}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348719; EFH44240.1; -; Genomic_DNA.
DR RefSeq; XP_002867981.1; XM_002867935.1.
DR AlphaFoldDB; D7MCP5; -.
DR STRING; 81972.D7MCP5; -.
DR EnsemblPlants; fgenesh2_kg.7__2425__AT4G18670.1; fgenesh2_kg.7__2425__AT4G18670.1; fgenesh2_kg.7__2425__AT4G18670.1.
DR Gramene; fgenesh2_kg.7__2425__AT4G18670.1; fgenesh2_kg.7__2425__AT4G18670.1; fgenesh2_kg.7__2425__AT4G18670.1.
DR eggNOG; ENOG502QQ2D; Eukaryota.
DR HOGENOM; CLU_000288_23_3_1; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0071555; P:cell wall organization; IEA:UniProtKB-KW.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR013210; LRR_N_plant-typ.
DR PANTHER; PTHR32093; LEUCINE-RICH REPEAT EXTENSIN-LIKE PROTEIN 3-RELATED; 1.
DR PANTHER; PTHR32093:SF111; LEUCINE-RICH REPEAT EXTENSIN-LIKE PROTEIN 5; 1.
DR Pfam; PF00560; LRR_1; 1.
DR Pfam; PF13855; LRR_8; 1.
DR Pfam; PF08263; LRRNT_2; 1.
DR SUPFAM; SSF52058; L domain-like; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Cell wall biogenesis/degradation {ECO:0000256|ARBA:ARBA00023316};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000008694};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}.
FT DOMAIN 85..117
FT /note="Leucine-rich repeat-containing N-terminal plant-
FT type"
FT /evidence="ECO:0000259|Pfam:PF08263"
FT REGION 407..676
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 698..738
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 408..676
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 738 AA; 79588 MW; 4CAF5EAB223AE314 CRC64;
MKTKMMMKKT SQIFVLLFTL FFFFTSLTHS LPLTFNGDLS DNQVRLITQR QLLYFRDELG
DRGENVVVDP SLVFENPRLR NAYIALQAWK QAILSDPNNF TTNWIGSDVC SYTGVYCAPA
LDNRRIRTVA GIDLNHADIA GYLPQELGLL TDLALFHVNS NRFCGTVPHR FNRLKLLFEL
DLSNNRFAGI FPTVVLQLPS LKFLDLRFNE FEGPVPRELF SKDLDAIFIN HNRFRFELPD
NLGDSPVSVI VVANNHFHGC IPTSLGDMKN LEEIIFMNNG FNSCLPPQIG RLKNVTVFDF
SFNELVGSLP ASIGGMVSLE QLNVAHNRFS GKIPASICQL PRLENFTFSY NFFTGEPPVC
LGLPGFDDRR NCLPSRPAQR SPVQCAAFSS LPPVDCGSFG CGRSTRPPVV VPSPPTTPSP
GGSPPSPSTV PSPPTTPSPG VSPPSPSISP SPPITAPSPP STPSNPPIIL PSPPSTPPTP
ISPGQHSPPV IPSPPFTGPS PPSSPSPPSP PIIPSPPGLG PSSPYPGPPS PPVVPRYSPP
SQPPTYSPSP SPPPPYSPST SPSPPPTYSP FPSPPPPPPQ TYYPPQPSPP TPPQTPIYYT
PPPSPPPHSP SSPQFSPPPP VPYYYSSPPP HSPPPPPPTP LHPPPPPSPQ PCIEYSPPPP
PTVHYNPPPP PTPAHYSPPP SPPVYYYNSP PPPPAVHYSP PPPPVIHHSP PPPTPIYEGP
LPPIPGISYA SPPPPPFY
//