ID C1MMH2_MICPC Unreviewed; 1180 AA.
AC C1MMH2;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH59052.1};
GN ORFNames=MICPUCDRAFT_56528 {ECO:0000313|EMBL:EEH59052.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH59052.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH59052.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC -!- SIMILARITY: Belongs to the sel-1 family.
CC {ECO:0000256|ARBA:ARBA00038101}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663737; EEH59052.1; -; Genomic_DNA.
DR RefSeq; XP_003057407.1; XM_003057361.1.
DR AlphaFoldDB; C1MMH2; -.
DR STRING; 564608.C1MMH2; -.
DR GeneID; 9682846; -.
DR KEGG; mpp:MICPUCDRAFT_56528; -.
DR eggNOG; KOG1550; Eukaryota.
DR OrthoDB; 66487at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR006597; Sel1-like.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR11102:SF147; FIBRONECTIN TYPE-II DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11102; SEL-1-LIKE PROTEIN; 1.
DR Pfam; PF13385; Laminin_G_3; 1.
DR Pfam; PF08238; Sel1; 5.
DR SMART; SM00671; SEL1; 8.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF81901; HCP-like; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001876};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1180
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002910546"
FT REGION 249..270
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1180 AA; 127406 MW; 666DAA45BD928044 CRC64;
MRASLAAAAV LLAVLASSAV GVGADHPRTV SRTPRFKTFA RPSRAKAWVP EAHVFANKTR
DTDVLDGVLA VDGTYALVVS NDQLPSLAPI SSTFTIAMSL FLHDDLNVAS IDAHRGIFWK
GGASGDRTPS AWLIPHSNRV TYRVSTSSAE EVWGTSAAAL PTRRWVHLAF SVGADRIMRL
YVDGKLDSAV EIVGEVVAND GPLYLGKDAH LTGINAFIAS VQMHAHALED EDIAQAAAHA
LRAAPDFDGE ASAEARAGAD ANREARERHH RDGDAFASLL SSLPMPMHAR QAAAMTSSAR
AEMEVRTSQA RADAIANAEE KEKMARRLFI DAEYSRGGIA LNDAMKSENR NAGGTYRYRR
GPRESGLAKA REHYLRAAAM GHDKARYALA TMHQLGYGGP ISHGKALYHK LRAAARGDPG
SNLAVGSAAF AAASAAASAG GGGGEAACRV AEYYLLHAAN VAYDNSGKPG GQNRVERLRL
YEGVEKERQD HKGPNDQKVQ YLVQTASHGD ALASLAMGNA YYWGNFGLRR NFQAALFYYE
SAHAQGALHG TVGVAKMNLK GEGLAGGVKN VSRAMEMYEQ AAKRDSPDAL NGLGYIYFYG
DADIEKNTTT ALSYFRKAAA LGNADGHMNS GLMLRAGIGE RANLTEAHEH FSVCAKARHT
SCIYQIGLMH SEGSIPGAER DCFAAAQRFR RVAQSGEWME PLSDGLKAHL AGNASLARWT
YDYVAGFGMP VARYNAAWLH TIHAREIRVA LESPARDSGR LSDESDLEAI ASGRRSAGLL
SYYASEMTKD PATTLTDRAY AAMLQADCLY HGEESRPGAR CARRLPDALA AYERAGRIAR
VAARTSPNRN GDDGESGELR DGDARLLLAR ALYSEAWMRA RGEGCAVDRT RAKAALWSAL
AEGRWQSSLA VAPSFAWFVA EDAFRYLVPS RGRGRDGAAA ADPDAIARAS ASVTSGSGLA
SRLARRAAAP WRLTARVVAA VMATSSGLFD RFEPVLASWW ATPPVWLIVG AYALGWYVVY
RCVMAGYGGE YNPLVLHRRR LALAREDRWR ARLFGGTRAR TVEERQQRND LIRNLHALHD
LQEEAARTST PGEMERIRDE RLPDIARIIR GEEDEDDGGF GRTFGTLMME ILGIDAARLE
PANRPATDAE VEVSVTEVVE TLVGELVAEM EGWGDGASAE
//