ID A0A0M0JEB5_9EUKA Unreviewed; 3983 AA.
AC A0A0M0JEB5;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0000259|PROSITE:PS50853};
GN ORFNames=Ctob_003055 {ECO:0000313|EMBL:KOO24563.1};
OS Chrysochromulina tobinii.
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Prymnesiales;
OC Chrysochromulinaceae; Chrysochromulina.
OX NCBI_TaxID=1460289 {ECO:0000313|EMBL:KOO24563.1, ECO:0000313|Proteomes:UP000037460};
RN [1] {ECO:0000313|Proteomes:UP000037460}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP291 {ECO:0000313|Proteomes:UP000037460};
RX PubMed=26397803; DOI=10.1371/journal.pgen.1005469;
RA Hovde B.T., Deodato C.R., Hunsperger H.M., Ryken S.A., Yost W., Jha R.K.,
RA Patterson J., Monnat R.J. Jr., Barlow S.B., Starkenburg S.R.,
RA Cattolico R.A.;
RT "Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin:
RT Metabolic Tools for Enhanced Algal Fitness in the Prominent Order
RT Prymnesiales (Haptophyceae).";
RL PLoS Genet. 11:e1005469-e1005469(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KOO24563.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JWZX01003080; KOO24563.1; -; Genomic_DNA.
DR OrthoDB; 1387371at2759; -.
DR Proteomes; UP000037460; Unassembled WGS sequence.
DR GO; GO:0017154; F:semaphorin receptor activity; IEA:InterPro.
DR CDD; cd00063; FN3; 1.
DR CDD; cd00102; IPT; 1.
DR CDD; cd00603; IPT_PCSR; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR002909; IPT_dom.
DR InterPro; IPR031148; Plexin.
DR PANTHER; PTHR22625; PLEXIN; 1.
DR PANTHER; PTHR22625:SF44; PLEXIN-B; 1.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF01833; TIG; 3.
DR SMART; SM00060; FN3; 1.
DR SMART; SM00429; IPT; 4.
DR SUPFAM; SSF81296; E set domains; 5.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000037460};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..3983
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005601821"
FT DOMAIN 41..142
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 742..773
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..770
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3983 AA; 419568 MW; 7BDBCA5C2468BD9F CRC64;
MPASTFFKVR AAAYMALLAC LLVERVFAQD VAPKGTDVPD QPAPPVFVAS NSTSLSVSWA
MPEVDGGSPV LWYTLYGGDA AESLRPLYDP FRAGAAPEPN VRNFTIRRLR PSTQYRFMIA
ARNAVGQSKF SDTATFSTTA PVPTVSHVVP AAGPLRGGTL VRVHGADLVF GGTSVYACRF
GQTVVPATLT EAAWPEHVPT RPIRLPRDEG DGRAVVECIS PRARQARAID LGPGSTTGQP
LRLSTQLVEL QVALDGRQFG AVIGTTFVYY EPPSWAPAIV SPASGPLAGG TVVSIHGLFP
RADNGFFGPS SASLLSLPAS FDFPEAACNF SGAIVPAMLM PLRQSYSNAH PRPRALDAPT
PAPAVGMPSH IIAENNTGVP TDALERTLVC TSPYVGLSSY QRTVAQSFSF GEEEEAIAAF
IEPETTVRVN AESRSRVDWA AHGAVEHGLQ GAVEPGLGAT VASSMAHVDD RRVEPPAGLR
PIPVVRVSAS SEHPLGGRAA ENLINGSGMR TAPDGQRLSA LCNGTYDLAC HQQCWASDPA
APLDEQWIEF TFDRPRYVDG MHLFGYNAPH GRAFLRSLAA AEVQVPSSTR RGEWVPIGRL
SGLPLPSLRD GDAGVNVRRA SMPSYFLPGA YRRGAALLTA SWLGFEAAAV RLARMVHHGV
DQYGASVGAC EVLLLEEDPD VSIQLVGASA VRARMLQLTP GSPYASAAAE ADMYLPSTFG
TETMDASAQD MRPRLRAHVN KAHGGATEPT LQEGHATPTT TPTTQSPLPC GPAIDQEPCA
AAVDPMLALP TQPPPPAKAV IVQAQRGINA LTLEARQLEH PPGTVSSAAA SPVAQYVGGH
GTLSGLEAPA EEEEGEGAML DWEAEPRIRD SDCVEDEGGE DGHPQLLRLT ELGWLPDASD
VMPAGRAGSM RTRCAWTYED GRAGGVTRFP TGIRTRERDE SASATVLLGG AFIPLASGGV
GLTYLHVRLD LYVGVEAALE GGGLWLCYSD LVPEDLGRVV QKVEGVHSVG EAGAGLCASL
WLPDPAVDAY APASAILVHA GRVVARENWR STRELISPGT WLPLELIISS EGCSLRHARR
TLLDRVPLEG WEPADHWRLA LVGQSRAVPG APFWVDNLRV TAGPLVESAR ASFEVGLNGQ
DFSAAGFIFQ YHGVPRVLGI SPTDGPLHGG TTVRIRGKHL AHGAGYTCRF GAQNVSASFE
AASETVLCTS PPALASLGLN GSRVAFAIAL NGQNYVLGPD FGFHAQLRLS HVHPSTGPID
GGIRINISGA GLVGGSAYMC CLVAESEPCV VREDGRSPFT PATFDEAFGH VTCRAPAVYN
DAGAYQVRVA LNGQQYSSQP LETLGFWALT NIVQFELGDQ GMVPSAGPYL GGSILIVRLA
EPILALPSLT QIDELAFATE GSAFANRDAI AFAPACRFGT AIVPAVRRSD SEVTCTAPTA
LAAGASVVLA PRTPEQLNTT LSLHGGASLL AGDLDDGGAG GGGDDEGAVR LTGRYPHESG
SIVFRRPALA WLPEPAAPAL HDFELSFHLS FGSLESHDGV SVSYGPNLGF ERHFGGDGGL
RVAFLARERQ ILAWSNERAV GLARMPDSAN LGNARIPVVV RVERDVSTDY LTRGESIDSA
VLTVTFANVV VLRSQLPGWT ALVTSEWRVG LGGGTAHESL FGDKTDTYHG VRALTLRMGA
AITRAAVYFG ISYNGQDFYP SVEALQAQLA MQALPGAEAL EEAAIVRVAD FLFGRPLITV
IKPRSGVTGS TVELRGALLP FGRNLEASGG YRCRYQQLTR TEYMSDSWAA RFDPLAQYDD
EGWMLLPGGA QQEGDADFAG SEDDEGAEER AYDALRCDVL PMPHGRTYLE ISLHDTMYSA
DRTQFITLPA RHEGTALRAV PATAPAYFST PPPPPPLVPY TGPVVPPTLA AFMPSRNVTT
TLVTGPHLDG GFAYACRPLP RYPAAAGGSS GGLPATYVPA LRAVRCALAQ PSANVTLPLA
LTLHVSLDGE QYNAGGAPFT FYPAPIISRV LPAGGPVRGG AAVSVIGQHL GHSNHSAMEC
RIGASRSRAV PLSEYARGFV NASDEPHSAM SCPAPEARFA GLLFGSVDGW PLPNLTGVWR
DSYGHDVHVC TRDGGFAATV EPPWSASLGQ RVAAVGLWDE QHLAYVGMWQ QWRESLADAL
TFGVADALPV ADRVRLHAGE MGAVNWPTPQ LLAAHRGCAE QLSFRASVYF GGGVGPEGGG
DGLSLVLGHV TTASTFAFTD THAVANVRGS CGDVPLVDLP FTVTNEGFSP MQPDTRSSGR
WTDGQWVDPF DPAEYGPPPT LEHRPFRYYS ERSLVASSPA LGHVSGGALI RINGTGLDVG
VGFSCAFGIR SDEVPDMAPG WTWAADGPLH PRYSTATWLA AEQVVACRAP QVPYASSVQL
RVSLNAQQFS VAALNYTFFA PVITEFTPSS GPVAGGTTLV VRGTQLDVEV GWWSTPPPLA
RCVFNQTMQT PATFSRSANT LSCSSARSPS STSARVPFGI SLNGQELTTD PFFFFFIGAG
MATAPEPPPP PPFSDQEDVE EAFRYPTPGG RPGGDGYDAT GLVARFVNRF SGNVSGGALV
RIWIDPQPPE ATSSPTCRFG EAVVPATVQS STTLVCITPN TTMAGPVTLG VSLNAQDYSQ
GGGQFCFVNV TGLLAEPTEG PVSGGTRILS RGHGLVPDGC HSDTAREPKR CRWFNPALNQ
RRTVAAAVDL GRGALVCYSP PNPMGWGAGD PLVLSVSLNA YDYLPATMYR YEPVPPLPAL
SPGNGPITGG TMLALFGASM RDVDGLACFF GGLLRLALPN CNAPLQTFAL QVEVLFRSMA
QTAANLVDEG LFEGGLAVEY GPDPDAIDHL DGHSWGDVCE DGTRRNQPWA AQGSASTAEE
ATRGLNISLL ASARALRPSV RHNETELLAL ELVEALSADF GPPMDVWMGL HVTIAADPVT
HEAYLNLSLA GTPLSPPLAL NGWGAQSGWT LRLRAIGGRG VGVSVYRVRL LSGSTAALTP
IGLKVVVGPL FVSPAAPFYF LAPPLVPWAG VAPTAGPVEG GTVVRVRVAN YSSTVGTLPD
VSLALRCRFG NGLASADEFT ALAVHWSEAS GDGVSIRVGL ECTAPSHLSA GPEMVRLRIS
TNAQQFFDAA NFSFYARHAW GYAAPRSGPT GGGALLTVRG WPMDSGMRSG GAYACRVGGV
AVAGTYRLQP ERIQCRTPAL GVTGFTAVPV ELSLNAQQFV PLSRTNGSFF VFADPVDAVV
ALPASGPANG GTLVEVHVSG GGLGANLTET LDYRCAFGHV VRGLLDEMLV EGAAVVPATR
VDDGRLRCIS PTAEVAGASD FLEPLRAAAG RSGRRHVNGA AITPAAAAPT IDAERPWRGS
WPLASSGGHE GGENLVENLA RLTGPNGECG SVLVAPSRSA ASHARTEWIE HTMFVRSMMP
TPSWSGGGRV ASLGGFSWSY GPLRPGPHAT VGAAGKGLGL IARLAPQPDG RVLWEVLRAG
ELLYSEPLPV SERDALIREE WRRLWLRLDE RGMHLQFGNS TLVAAADSGT STSPHGLALH
PKHLRAAPHW TFAWGACSVA PEQPEPGAWL LANVTMRAGA RLRYSDEQLR VSLNGQQFLA
WTNLTFRYYA APRPLEITPS SAPRAGGTLL TIRGEGLAEA SAFECEFADG GTADERLRTA
AELLATSTIN NNNSTDGAGG AGTLRCPTPT VLPRLGEWSL GIRVDRAGQN YGSHEDVAAV
PAPLWVYPSL SADATYAPAS GPGAGGTLVR IELPRYNTSA IAFANNASDA GPLLETVAGL
WPRHGAKCRF GAAYHDGAYG PTIVDASHDP AGAPGEAMLC ASPALPGGEA PLQLALNGHD
FEPGPALPFS VYGRPRLVDS VPAGATPGTS IILVGENLGG DGLPGVNHTC RFGTRVVLGQ
LLVEPRVVGN QWRTNTVRCV VPEMPLASST AAGQYVRSVP LRISLNGQQY SPEPVRFSFP
WLPARPVAND ALPTAGHWVS GDH
//