ID L1JA84_GUITC Unreviewed; 5723 AA.
AC L1JA84;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE RecName: Full=Cadherin domain-containing protein {ECO:0000259|PROSITE:PS50268};
GN ORFNames=GUITHDRAFT_108698 {ECO:0000313|EMBL:EKX45431.1};
OS Guillardia theta (strain CCMP2712) (Cryptophyte).
OC Eukaryota; Cryptophyceae; Pyrenomonadales; Geminigeraceae; Guillardia.
OX NCBI_TaxID=905079 {ECO:0000313|EMBL:EKX45431.1};
RN [1] {ECO:0000313|EMBL:EKX45431.1, ECO:0000313|Proteomes:UP000011087}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP2712 {ECO:0000313|EMBL:EKX45431.1,
RC ECO:0000313|Proteomes:UP000011087};
RX PubMed=23201678; DOI=10.1038/nature11681;
RG DOE Joint Genome Institute;
RA Curtis B.A., Tanifuji G., Burki F., Gruber A., Irimia M., Maruyama S.,
RA Arias M.C., Ball S.G., Gile G.H., Hirakawa Y., Hopkins J.F., Kuo A.,
RA Rensing S.A., Schmutz J., Symeonidi A., Elias M., Eveleigh R.J.,
RA Herman E.K., Klute M.J., Nakayama T., Obornik M., Reyes-Prieto A.,
RA Armbrust E.V., Aves S.J., Beiko R.G., Coutinho P., Dacks J.B.,
RA Durnford D.G., Fast N.M., Green B.R., Grisdale C.J., Hempel F.,
RA Henrissat B., Hoppner M.P., Ishida K., Kim E., Koreny L., Kroth P.G.,
RA Liu Y., Malik S.B., Maier U.G., McRose D., Mock T., Neilson J.A.,
RA Onodera N.T., Poole A.M., Pritham E.J., Richards T.A., Rocap G., Roy S.W.,
RA Sarai C., Schaack S., Shirato S., Slamovits C.H., Spencer D.F., Suzuki S.,
RA Worden A.Z., Zauner S., Barry K., Bell C., Bharti A.K., Crow J.A.,
RA Grimwood J., Kramer R., Lindquist E., Lucas S., Salamov A., McFadden G.I.,
RA Lane C.E., Keeling P.J., Gray M.W., Grigoriev I.V., Archibald J.M.;
RT "Algal genomes reveal evolutionary mosaicism and the fate of
RT nucleomorphs.";
RL Nature 492:59-65(2012).
RN [2] {ECO:0000313|Proteomes:UP000011087}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP2712 {ECO:0000313|Proteomes:UP000011087};
RA Kuo A., Curtis B.A., Tanifuji G., Burki F., Gruber A., Irimia M.,
RA Maruyama S., Arias M.C., Ball S.G., Gile G.H., Hirakawa Y., Hopkins J.F.,
RA Rensing S.A., Schmutz J., Symeonidi A., Elias M., Eveleigh R.J.,
RA Herman E.K., Klute M.J., Nakayama T., Obornik M., Reyes-Prieto A.,
RA Armbrust E.V., Aves S.J., Beiko R.G., Coutinho P., Dacks J.B.,
RA Durnford D.G., Fast N.M., Green B.R., Grisdale C., Hempe F., Henrissat B.,
RA Hoppner M.P., Ishida K.-I., Kim E., Koreny L., Kroth P.G., Liu Y.,
RA Malik S.-B., Maier U.G., McRose D., Mock T., Neilson J.A., Onodera N.T.,
RA Poole A.M., Pritham E.J., Richards T.A., Rocap G., Roy S.W., Sarai C.,
RA Schaack S., Shirato S., Slamovits C.H., Spencer D.F., Suzuki S.,
RA Worden A.Z., Zauner S., Barry K., Bell C., Bharti A.K., Crow J.A.,
RA Grimwood J., Kramer R., Lindquist E., Lucas S., Salamov A., McFadden G.I.,
RA Lane C.E., Keeling P.J., Gray M.W., Grigoriev I.V., Archibald J.M.;
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblProtists:EKX45431}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH992999; EKX45431.1; -; Genomic_DNA.
DR RefSeq; XP_005832411.1; XM_005832354.1.
DR STRING; 905079.L1JA84; -.
DR PaxDb; 55529-EKX45431; -.
DR EnsemblProtists; EKX45431; EKX45431; GUITHDRAFT_108698.
DR GeneID; 17302097; -.
DR KEGG; gtt:GUITHDRAFT_108698; -.
DR eggNOG; KOG3599; Eukaryota.
DR HOGENOM; CLU_223107_0_0_1; -.
DR OrthoDB; 52189at2759; -.
DR Proteomes; UP000011087; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR011936; Myxo_disulph_rpt.
DR InterPro; IPR002859; PKD/REJ-like.
DR NCBIfam; TIGR02232; myxo_disulf_rpt; 1.
DR PANTHER; PTHR46730:SF1; PLAT DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR46730; POLYCYSTIN-1; 1.
DR Pfam; PF13948; DUF4215; 1.
DR Pfam; PF02010; REJ; 1.
DR SMART; SM00060; FN3; 4.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR PROSITE; PS50268; CADHERIN_2; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000011087};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 12..33
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 5409..5431
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1861..1948
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2304..2391
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT REGION 5373..5405
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5472..5686
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5379..5394
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5481..5503
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5521..5559
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5560..5574
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5575..5650
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5651..5668
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 5723 AA; 617451 MW; 66DF7B37643BE4DB CRC64;
MKKVSNRRYR YLAAAALILL LLAGIAIFVV FTLPKKDMML HPQGIVLHVA SERDAIVFNA
TATAPSQASM LHDIQFESAI CNVSFLSATY YWLHLVQINI DGPVQVMGQS SDSQSLRGGV
NVTDVDFSQF MSWHGALLRL KVECFLEVKA DLYWGLYQKT QTRNLTKFLS YEMGKISRRS
TDSQSLASVL NLALNLFQAE DLKDFASMFV VPKEEIALIL KPRLPLIFDS SGLISAVYVH
VPEISFDIGA EAFYGTESQF QESSSKQPNG VVRIGTLPTQ WLLISQKGNG STHLDQISRT
RANFFITCAH GKCNSSSIRP IVQVFTNYLT THPVSGISVT FLTSGSSSAL FRNLLGSRNS
FFIKLYPVQT NFHRKMHPRS GWVGNQNLNC FEADFVQRWR LQVCGSLENN LISEQKIPNF
SFLFDPHKLP SLEITFADIS YFLSGQGDPV IYGHANVAEF TKIDLVRAYE LIGFNAGQSQ
PYLDVLDDSL SLICALDVEV GSTVLVKHSL EVDIGIASKT GTGSLEGSSI LFAIVRLVDS
QGTMNWEWVN TAARRQLNKV DQLFLLPLNW TLSSKMQTSS QDWYMADAAL LAREVREMGK
IEGLASVSFS IGDSTILDGS IDGGLVLRND EISINLIAKD RKDRKEKFRV TLFLGDNKPG
NNGFEGIFLL ADIRWENRSL LKTDTSLSFI LPGQGSVDLQ GKINFELNSH RLLNWNIHAG
ASWPQMSLLD GNFGAVLNVE DNNHKLIYFD ANLINNLKAN LLSMNGDVTL LFQNNKLLHW
TGSIAGKYTK TDQYTPIPSS SSTEAGTTFD DVTQREELLA ANYSTSLGLG PLAGGGSLES
SGGLTFSTYY HQIRACSSSC SYGSCWSSCD EPKTERKLHL APGLVALWNQ KKKIDMTMTL
DGATLNEALD GSSVLHVGYD SEPLVDWTGS IAGKYTKTDQ YTPWPSSSST EAGTTFDDVT
QREELLAANY STSLGLGPLA GGGSLESSGG LTFSTYYHQI RACSLSCSSG SCWPSCDEPK
TERKLHLAPG LVALWNQKRQ VDIEMSLNGV VSDARLAGSS SLNALIGSNN LVGFANASGD
CLGWSCLPSE IHLSSSWAES KLTSIDLVIQ GLLSRGWDYA HGSVLASLWI GERHWFALDV
CGDLRELVML KPGGMVKLIS RAALLGNELI DLSLIAEYDV RGKGLNANMT VTNLLTISNR
QLYAIAAEGS YSLNAGPYGV TARLRKDGEE NFLLIGRGDY DWNSEQRLLN GSITTILIIP
HTPVKVDLDH IVTLSLGNGG KYAGAVDVLE RRGAPLGFLV PSFPSLLHVK ASAVLRDMPD
NTPASFLKRF ALATAGLDVA MSVAVSGKTA VVVTALVESD GDMRGDETRM SCLEVDLGFH
GPLWITLYGD LRSTRFSLPW ALESSKNYEA LTALLQYILV SWRINNLIDS VQDWNGLYKN
QVPQCDVSSA GKLLITLGDL PHPAWPPKAT GMPNAADVIS TTPEPTYAMP SQEPMATNIS
TTSVYIPSSE KSVTFFVLKL PFLTTDEDSK KLDFVTSVAT NISYVSSQGF FQCAMIESSN
NLFKVTPSLD LNGSLHLQLE GNASGDAFWN ISLVDGDRTH YSPIPLHIVV RPVNDPPSFV
VASQLLLLLD LGSNLVPNAL RNISLGAPDE DLSQNISFSV TFLGGPQNLL LNTLAISPSG
TLSFNATSLG TGLTTFSVVA RDDGGTMNGG VNVSTKQALK LAVVSRPRSV FDVVLTQISL
NRVSIQWSYK DPGQDFEAAQ YHFNVSGRYA RASSGSYYGF ERMENLTVCL GVCSIVLDDV
LPKSTLAVTI TAVNIAGKSP EQNRSIVLED KNVAPTFSIN STFVRLEDAG LVTIGGAAYD
ISAGSLDEVQ QKLTFIVSSS SPAAFTRLPS IDPTNGSLTF QTALNVNGNF NCTVLLVDDG
GTRFGGVNVS QQHSFVIKVV PVNDAPSFLL KQPVITLLED AGLYVYTGLA NSISSGPSDE
SWQSLTFSMT WTGGADVGIF QVAPSLNQSG TLMLKANPNM SGTSNWTIQL HDNGGEDNNG
QAWSATRTLQ IIIVPVNDPP SFVVASQLLL LLDLGSNLVP NALRNISLGA PDEDLSQNIS
FSVTFLGGPQ NLLLNTLAIS PSGTLSFHAT SLGTGLTTFS VVARDDGGTM NGGVNVSTKQ
ALKLAVVSRP RSVFDVVLTQ ISLNRVSIQW SYKDPGQDFE AAQYHFNVSG RYARASSGSY
YGFERMENLT VCLGVCSIVL DDVLPKSTLA VTITAVNIAG KSPEQNRSIV LEDKNVAPTF
SINSTFVRLE DAGLVTIGGA AYDISAGSLD EVQQKLTFIV SSSSPAAFTR LPSIDPTNGS
LTFQTALNVN GNFNCTVLLV DDGGTRFGGV NVSQQHSFVI KVVPVNDAPS FLLKQPVITL
LEDAGLYVYT GLANSISSGP SDESWQSLTF SMTWTGGADV GIFQVAPSLN QSGTLMLKAN
PNMSGTSNWT IQLHDNGGED NNGQAWSATR TLQIIIVPVN DPPSFVVASQ LLLLLDLGSN
LVPNALRNIS LGAPDEDLSQ NISFSVTFLG GPQNLLLNTL AISPSGTLSF NATSLGTGLT
TFSVVARDDG GTMNGGVNVS TKQALKLAVV SRPRSVFDVV LTQISLNRVS IQWSYKDPGQ
DFEAAQYHFN VSGRYARASS GSYYGFERME NLTVCLGVCS IVLDDVLPKS TLAVTITAVN
IAGKSPEQNR SIVLEDKNVA PTFSIVSTFV RLEDAGLVTI GGAAYDISAG SLDEVQQKLT
FIVSSSSPAA FTRLPSIDPT NGSLTFQTAL NVNGNFTARF LLKQPVITLL EDAGLYVYTG
LANSISSGPS DESWQSLTFS MTWTGGADVG IFQVAPSLNQ SGTLMLKANP NMSGTSNWTI
QLHDNGGEDN NGQAWSATRT LQIIIVPVND PPSFSMIGSL TVYQRVGIIS SSIAWDISAG
PPDESKQNMT FEMDLLSGRS DLFSEFPRID QQGLLTFSRT TGTFGRSDWK VSLRDDGGVL
NGGISSSSKV FTMVLIGIPS PVFNIQYQQP RNGFVLITWN HSDFSLTRQE LGETYNSARS
FLVSFTECRS SCILTATVLV KSSECSKICN VTMQLAIGVT YSVSIVAQNE AGQALPVLAN
VFLPDLTPKL EYLSVSSGDV SLATLCTVAI SNFQESASSK YLIRINDIKP LPVVDLFFVA
ATASTAPKVT LSFNIPPWNT TSIVNISVGL IAAPDVRVIF PFEYFSSSVP RVSLVFPTMA
STAGGDLVRI TVSNVFESNP DTSAWLFNLG NTSFHPTSAV SISQSEVSLI GQIPPQAQFT
SVATIDVHLL YKQVKMSFLL QMQPPCNYRV FCPAAKSSYL ANDYLLKFDP PMTTSCDMKY
CLDSSTFTAM RLISSSPSTG STKGGSPVLI SIASDTLISA YSGGLKLLYN GVPLAITVVS
TIRLTSMNFT LPQYVSPCCC GRNYTCVSTF QVVDLLTTRS LSIPFEFIAD IEGPPNIVSI
YPGCTSSRPD SCTSSPVLTL QRSSILLDLE NFPKIETQGG IQVYIDFSVD PNATVTVLSS
TNALTKMNIS FVTPASSGIF TGRIWAAGYE KSNSFQIYVQ SPPNPQVLSY FPSKFVEGDS
IQIQAQLSDW SSYFDLSNVE AVSSTNQTHP AEFKLLLGAT LGSATLKVSF VARLTSSEYQ
LRIQVRRVAN QETFSIPLNF QVSANPYIGY VFPSSGQVLT RTRVNVVVYN SFSIANSQLF
ASFEGQIVTV TDVGQQGTQN TSSRFLTFLT PNVSSVGTIK FCITSNQNQC GGLTSQFQFM
APAPLASRRA SSQGGTNIFI SFYGTNDDLK VYAFLPSRLS GVVKSELKAT RTSTCTDQTF
CTQQVAIEAP VSSDGSDVLL QLENGKLSPQ NLAISYFLMP SIISVNPTQS SVMGGDSVQI
SMKNFPAIST ALDVSVVFDS QQARVMQVLG YNDFICVSPS HKSVGSIQLS IIPTKIANAQ
EKADMTVTSQ FSYSRPDSKV LSMIPSRGSL KGGAIVTMIF SHFPTKLTET DVSVVFSSKQ
QATVNRVVQS DSFSGESIVE FLMPALPVGL QQATLTVSDG YNSTAALFFE SYDATVQLTC
NRLQMQSLDD PIKVTGGCQG GVDGTDFLLI AVTNIGQIQS FSGLAISFGT EYAVMGKLVN
STMQKTFVIV AVPANSFFSK STVIDLSVTN AKTMAGMSTL QGSGKFEYIA APSVLSASFD
AMGSSIMIVF SESTNIFSLP VAQLSTCQSF LEIGTSQPFG KSAKCTWQDE QTLSIILSYD
ATVAVNDNLQ IKMNLVKHVS GLGPYVSGTV KVASPSILIK PVFQLLGPQE VGPCDDAQIM
ALGTSSRPLT FSWTCMGCPD VIQSKLAVTV SDKIVISSSD LAQSQSSYRI QVYGVNFLGT
KSETLSLQFY RTSLPIPVVS LSGFDLYKKS DDILLGAQTA FSKCSEEMEL QFKWEQVNQD
NAATNNGIPA SVLARNSPTF FLSKGTLAVP GLYKLKLTVT LPTTPPTQTA VSTSFTLLAS
DLIAIVKNGD SKIGRSQKLS LDAGDSKDPD YRGSSPDVNL KFTWTCKVQG MMCRDRKSNN
PLVFDSSSVL SMSSDGFEVN VQYIFSLEVM KDSRSASTSV SMVFVEEFVP LTTLTSPSTM
LVKGENVLNP LQTLNLLEYS CTSCSSYKWQ LIDQGTATVL PATTNELFLS SSTFVQGRRY
TVLLTSYQSP QEGQVCNGLC QGSASFEFRV NLAPSGGQCS VQPTQGYELN TQFVVQCGSF
SDADLPLSFQ FNYKVKNGTE ASLAPSSSPM RSLYLPSGEI NISVIAIDSL GAQSTYFLDL
VVVLPIPSVS PVVAQQALDS FVLLGKTSEF SNYAISLTQK LSRDSSTLLA RRRRAAASLR
VNDVQLSESA LIRSNTTLSL LRIIEKQVPS AGNLLENIGT LFFLVNSSEP LTMDATVAAV
EILSNTTILI PQVLQYSLER TTVQTIIFIA STLKDQLQQS VLSMSEKTRC MSMLVTSAAT
AMYHGIFDFI DIKNNLVCDH TCVPTTDHIF TYKWKSAFTK ILHPQDIAVE SLFMLANISA
GSSAVVPASS EQSASFYTFM YVWRTPLATN TDTFISDSHG FILGISDLNG VRNSVSQFDY
DARIPVSQSD LSSILWNCVM WKQGWSSETC SPVGFSPNIA QCSCKETGLV AVSVSSVSVC
GDGRVTGKEE CDDGNRVSQD GCSQSCTLES GFTCQPGNSI LRTRCVNSSS TTTACKPQML
GPHCETLCLG EVVGDSCTAE TLTPLLVQPA VVVGSLGGQV SLASWGNLTF PKDFYGSTLD
MTVKLYAGVP LASGSSVGPT LVLEPTGATF LKPLTLSLST SHVTDLVAQP VALYYLNSLT
KQWEYVESRV DVQQNLLTAQ ILHFSVYAVL RKPPENQSPS PKEAAPPPPA ASSTPPGTTK
DTKSSKVPVI VGVTVSCGVV ALVAAGLFAY YRRRIAEKLK EGASPAVTPA PEPANTQENQ
VNDEGLEPYL ESPMADASSR SSSSEDDEEI GRDRGDTDRP HLDAIPPGHV AQMVEKQNRL
VLESSKTDSE EKRPQKLKSS LEYQDKVNDE DKDKGDEDKE EDKGEEEDNK EEGEGEGDYD
KKGEGEGDYD KKGEGEGDYD KKGEGEGDDD KKGEGEGDDD KKGEGEGDDD KKGEGEEKDH
KERQEKPAGE TSPEQQEQQE QLPPLVSTSK EATAEAMEGS HQPEPAALPV EHDQELRVMS
YSLVFAEPQR SLGPWGLDGP HNC
//