ID D7MFT1_ARALL Unreviewed; 668 AA.
AC D7MFT1;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 27-MAR-2024, entry version 52.
DE RecName: Full=Cell wall hydroxyproline-rich glycoprotein {ECO:0000256|ARBA:ARBA00041871};
GN ORFNames=ARALYDRAFT_491276 {ECO:0000313|EMBL:EFH43401.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall
CC {ECO:0000256|ARBA:ARBA00004191}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348719; EFH43401.1; -; Genomic_DNA.
DR RefSeq; XP_002867142.1; XM_002867096.1.
DR AlphaFoldDB; D7MFT1; -.
DR STRING; 81972.D7MFT1; -.
DR EnsemblPlants; fgenesh2_kg.7__709__AT4G33970.1; fgenesh2_kg.7__709__AT4G33970.1; fgenesh2_kg.7__709__AT4G33970.1.
DR Gramene; fgenesh2_kg.7__709__AT4G33970.1; fgenesh2_kg.7__709__AT4G33970.1; fgenesh2_kg.7__709__AT4G33970.1.
DR eggNOG; ENOG502QRPA; Eukaryota.
DR HOGENOM; CLU_000288_23_3_1; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0071555; P:cell wall organization; IEA:UniProtKB-KW.
DR GO; GO:0009860; P:pollen tube growth; IEA:EnsemblPlants.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 2.
DR InterPro; IPR001611; Leu-rich_rpt.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR013210; LRR_N_plant-typ.
DR PANTHER; PTHR32093; LEUCINE-RICH REPEAT EXTENSIN-LIKE PROTEIN 3-RELATED; 1.
DR PANTHER; PTHR32093:SF147; POLLEN-SPECIFIC LEUCINE-RICH REPEAT EXTENSIN-LIKE PROTEIN 4; 1.
DR Pfam; PF00560; LRR_1; 1.
DR Pfam; PF08263; LRRNT_2; 1.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SUPFAM; SSF52058; L domain-like; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Cell wall biogenesis/degradation {ECO:0000256|ARBA:ARBA00023316};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Reference proteome {ECO:0000313|Proteomes:UP000008694};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..668
FT /note="Cell wall hydroxyproline-rich glycoprotein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003103151"
FT DOMAIN 75..107
FT /note="Leucine-rich repeat-containing N-terminal plant-
FT type"
FT /evidence="ECO:0000259|Pfam:PF08263"
FT REGION 392..668
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 410..462
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 483..497
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..637
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 668 AA; 71735 MW; 856E07067DAC23E7 CRC64;
MAKPPSFGCC FFLLFFSFLS SFFVSFALTD TEAAFIVQRQ LLTLPDNGEL PDDIEYEVDL
KATFANTRLK RAYIALQAWK KAIFSDPFNT TGNWHGPHVC SYTGVVCAPA LDDSDVTVVA
GVDLNGADIA GHLPAELGLM TDVAMFHLNS NRFCGIIPKS FEKLKLMHEF DVSNNRFVGP
FPKVVLSWPN VKYIDLRFND FEGQVPPELF KKELDAIFLN DNRFTSVIPE SLGESPASVV
TFANNKFTGC IPKSIGNMKN LNEIVFMDND LGGCFPSEIG KLSNVTVFDA SKNSFIGRLP
TSFVGLTNVE EFDISGNKLT GLVPDNICNL PNLVNFTYSY NYFNGQGGSC VPGGGRKEIA
LDDTRNCLPA RPDQRSSQEC AVVINRPVDC SKDKCAGGGG GGSSTPSKPS PVHKPTPVPT
TPVPKPTPVP TTPVHKPSPV PTTPVHKPTP VPTTPVPKPT PFQLRPLTNH RQFQLRQFHL
GRDEQAQPSS TSSIPTSGVH SPPPPPPVHS PPPPVFSPPP PPVHSPPPPP PPVYSPPPPP
PVNSPPPPVH SPPPPVHSPP PPPVHSPPPP VHSPPPPAPV HSAPPPVHSP PPPASSPPQT
PLKPSPSPTI FSPPPPQFPP VVYSPPPRPP KINSPPAQAP APSDDEFIIP PFIGHQYASP
PPPMFSGY
//