ID C1MUW2_MICPC Unreviewed; 1638 AA.
AC C1MUW2;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 22-FEB-2023, entry version 36.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH56393.1};
GN ORFNames=MICPUCDRAFT_40297 {ECO:0000313|EMBL:EEH56393.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH56393.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH56393.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663740; EEH56393.1; -; Genomic_DNA.
DR RefSeq; XP_003059261.1; XM_003059215.1.
DR GeneID; 9685010; -.
DR KEGG; mpp:MICPUCDRAFT_40297; -.
DR OMA; IDARVGC; -.
DR OrthoDB; 102982at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR CDD; cd00603; IPT_PCSR; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR SUPFAM; SSF81296; E set domains; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001876};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1638
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002910673"
SQ SEQUENCE 1638 AA; 168414 MW; B93A6A89BA4A54DD CRC64;
MLRLPTSARR LALAVVLFLS TMVQPVEPAT RGHAHGAVLE SVTPDVGTAY AFTDVELYGT
GFSRHLPTLG CRFGENVKKK LAVSSRADAV ECVAPALRPG FVVVGLAHVT GKSYNPGRDD
ISREAGHNVF EYVTPWRISN VNPEEFPDVG GVALFATGAH FRPDLLCNFR GGNSSGVSTT
FVSSALVICE AQSSSSAGKS AAFMLQHDSH PTPDGQQATL RRRGVPTVSR GGPRVVAIGG
SVAISSSAVD ETTPAMLSWQ SRVGCSFGGV WVAATAGATA LDVKCVVPAT NVGELFVTTS
DLNSRTPLPF FDDGVKHAGE PKHLGRRAVT VTESPVVSEF VPSAGTPIIR GTSMNVDVYG
DHLHVDTTSA PLLHLCLTLS RTLDDVRLNC VVAINAGTSA LEAANAFSVG FNAVNVRGGA
SDRANVEVQY MLQSPPQIVT ATTTAASAGD VITAIGEHFV DAITPLWCFA GNSMRAADVV
SSALARCVVP WGHHMPESTL HRAGASSALK FGVISSENLQ STSNLVSITW SPRPMHATGV
VPNVGFSHGG APVSVVLEKG GANAARQRAS VSCRFGTIHP VSASVANAEA ISCLSPALAP
GSVSVGAPVP SLEYVVLDGA ATLTTGSIST APSDAAVAEF FLFSSGIDSR MAIGCAMPSL
QDALVPARLS ADGRTTCQLP AMMKPGFKTL DVAVAYFGSP KSFDGLVEFN PPLDVWSSVP
APNLDTHAAT PSRGAARLHD GMELSITHPP PTVVVQRPMT GGETRPGDLV FIAAPAGGLD
ARAWCVLSVG TDEYKVAATF ISSAIAVCET PTMGHHGVPS FDAVVSVCAS HSRCASGTNA
SSVVVLGTEK TVTEVSPREG GTGGGVKVRL HHRGFSNHRA SCKIGSIGPI AAASASKGDT
TAELECATPA RAPGVVAVAV GMGAGWAGES AFTFTFVDDG VEETTTAAPA LVSRAAANEA
NALSSVFDCP DVAVGEIATV SPSSGASEGG TEMIVTADVS PSRQGSECGL LFAACRVGTI
WPVLGYATPR GVVCVAPARA PGSVVDVSAP KMVTGAGRPF AFVDALSKND TEPALNFSMS
ATEVAKASSS AFDCVDVAEG EVVAVSPSSS ASSGGTEITL TASVAASRPS GSCGILFAAC
RFGTTWPVLG YQSSLGVVCA SPARAPGSVD VSAPKLVTGG GRPLLYRGVD ATTDEKTLAP
KADELVDVHV KITPKNSDSG ALVDATTATA MSRLVSCVFE SPVRSAAPWL VAATAHVISS
VVTRCEIPTT MPVTGVTVVE KSSLRAVDAT AKHFSAYRES PSCEISRLST TFGSTTGGSV
TRVDARCLAG ATLTSIDARV GCRYGTIGPI TASMSDGDDG KIQCVSPGKV AGVVAFALTT
NWRDASFEPA SGVPAATFAY LNDETRTSDA EDETLALRQS QSQNALPLMS NVVPWLVWGG
NQLVHVTGRD MPVGFDAVCL VGSSLVAAVP ISSALTLCDP FPMSTLDRVS NAHVAGSMRE
VQLAVTSRDA HSIARAQNVS QMKLPLMIIS AADVVGIDVF NGWEHGGSHV NVELGGWAPT
GLIDCHFGTV AVHGREGGGA GWQSRAAMGR TGEWWSEATV ATDVECVTPA HSSGRVPIGV
SLAHSTSPTY GKVEYLYL
//