ID W5JQ54_ANODA Unreviewed; 3068 AA.
AC W5JQ54;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Cadherin {ECO:0000313|EMBL:ETN64889.1};
GN ORFNames=AND_003344 {ECO:0000313|EMBL:ETN64889.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN64889.1};
RN [1] {ECO:0000313|EMBL:ETN64889.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN64889.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN64889.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC003344-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004651};
CC Multi-pass membrane protein {ECO:0000256|ARBA:ARBA00004651}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02000839; ETN64889.1; -; Genomic_DNA.
DR STRING; 43151.W5JQ54; -.
DR EnsemblMetazoa; ADAC003344-RA; ADAC003344-PA; ADAC003344.
DR VEuPathDB; VectorBase:ADAC003344; -.
DR VEuPathDB; VectorBase:ADAR2_012245; -.
DR eggNOG; KOG4289; Eukaryota.
DR HOGENOM; CLU_000158_1_0_1; -.
DR OMA; YTFLRGN; -.
DR OrthoDB; 4006628at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0004930; F:G protein-coupled receptor activity; IEA:UniProtKB-KW.
DR GO; GO:0048513; P:animal organ development; IEA:UniProt.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProt.
DR GO; GO:0007166; P:cell surface receptor signaling pathway; IEA:InterPro.
DR GO; GO:0016043; P:cellular component organization; IEA:UniProt.
DR GO; GO:0001736; P:establishment of planar polarity; IEA:UniProt.
DR GO; GO:0007163; P:establishment or maintenance of cell polarity; IEA:UniProt.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR GO; GO:0048731; P:system development; IEA:UniProt.
DR CDD; cd15441; 7tmB2_CELSR_Adhesion_IV; 1.
DR CDD; cd11304; Cadherin_repeat; 8.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00055; EGF_Lam; 1.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.220.50; -; 1.
DR Gene3D; 2.60.40.60; Cadherins; 8.
DR Gene3D; 4.10.1240.10; GPCR, family 2, extracellular hormone receptor domain; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 1.20.1070.10; Rhodopsin 7-helix transmembrane proteins; 1.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR032471; GAIN_dom_N.
DR InterPro; IPR046338; GAIN_dom_sf.
DR InterPro; IPR017981; GPCR_2-like_7TM.
DR InterPro; IPR036445; GPCR_2_extracell_dom_sf.
DR InterPro; IPR001879; GPCR_2_extracellular_dom.
DR InterPro; IPR000832; GPCR_2_secretin-like.
DR InterPro; IPR000203; GPS.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR002049; LE_dom.
DR PANTHER; PTHR24026; FAT ATYPICAL CADHERIN-RELATED; 1.
DR PANTHER; PTHR24026:SF51; PROTOCADHERIN-LIKE WING POLARITY PROTEIN STAN; 1.
DR Pfam; PF00002; 7tm_2; 1.
DR Pfam; PF00028; Cadherin; 7.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF16489; GAIN; 1.
DR Pfam; PF00053; Laminin_EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR SMART; SM00112; CA; 7.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00180; EGF_Lam; 1.
DR SMART; SM00303; GPS; 1.
DR SMART; SM00008; HormR; 1.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 8.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF81321; Family A G protein-coupled receptor-like; 1.
DR PROSITE; PS00232; CADHERIN_1; 4.
DR PROSITE; PS50268; CADHERIN_2; 8.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS01248; EGF_LAM_1; 1.
DR PROSITE; PS50027; EGF_LAM_2; 1.
DR PROSITE; PS50227; G_PROTEIN_RECEP_F2_3; 1.
DR PROSITE; PS50261; G_PROTEIN_RECEP_F2_4; 1.
DR PROSITE; PS50221; GPS; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell membrane {ECO:0000256|ARBA:ARBA00022475};
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW G-protein coupled receptor {ECO:0000256|ARBA:ARBA00023040};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Laminin EGF-like domain {ECO:0000256|ARBA:ARBA00023292,
KW ECO:0000256|PROSITE-ProRule:PRU00460};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Receptor {ECO:0000256|ARBA:ARBA00023170};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transducer {ECO:0000256|ARBA:ARBA00023224};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2289..2309
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2321..2340
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2360..2380
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2392..2412
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2432..2452
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2473..2495
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2501..2524
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2..69
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 70..176
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 177..281
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 282..384
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 385..494
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 495..600
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 601..707
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 726..826
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 969..1005
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1043..1241
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1244..1281
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1285..1450
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1452..1489
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1586..1632
FT /note="Laminin EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50027"
FT DOMAIN 1605..1691
FT /note="G-protein coupled receptors family 2 profile 1"
FT /evidence="ECO:0000259|PROSITE:PS50227"
FT DOMAIN 2284..2525
FT /note="G-protein coupled receptors family 2 profile 2"
FT /evidence="ECO:0000259|PROSITE:PS50261"
FT REGION 1728..1748
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2050..2081
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2131..2192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2580..2701
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2730..2781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2911..3068
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2137..2151
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2580..2623
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2631..2667
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2675..2690
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2933..2962
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2963..2977
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3038..3068
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 995..1004
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1271..1280
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1479..1488
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1586..1598
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
FT DISULFID 1606..1615
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00460"
SQ SEQUENCE 3068 AA; 337670 MW; 9A9C49CEE36FF14B CRC64;
MDQEDSQKFR IDARSGTIST RSALDREVSG MYTIAVTASD MATPQTERKS ATTTVLVKIL
DDNDNYPQFS ERTYTVQVRE DQWTNENNVI AHIQATDADQ GNNAAIRFAI IGGNTQSQFS
IDSMSGDVSL VKPLDYESVR SYRLVIRAQD GGSPSRSNTT QLLVNVLDAN DNAPRFYTSQ
FQEAVLESVP VGYNIVRVQA YDADEGANSE ITYSILNRDD SMPLAVDPRT GWIHTTKGLD
REEQSRYSFQ VVAVDGGIPP KSASTSVIVT IQDVNDNDPT FSPKYYEAML AEDQPPGTPV
TTVTATDPDE DARLHYEITA GNTRGRFAIT SQNGRGLITI AQPLDYKQER RFALTITATD
SGQRTDTAIV NINITDANNF APVFENAPYS ASVFEDAPIG TTVLVVSATD SDVGINAQIT
YLLNDESVNG LGANEPFTIN AQTGAVVTNA KLDRETTAGY LLTVTAKDGG NPSLSDTTDV
EIAVTDVNDN APVFKVPLYQ ATIPEDALIG TSVVQIGATD LDMGLNGRVK YALSQKDMDE
GSFVVDPISG VIRTNKGLDR ESIPVYHLSA IASDKGTPTM SSTVEVQIRL DDVNDSPPTF
ASDKLTLYVP ENSPVGSVVG EIYAHDPDEG VNAIVHYSII GGDDSNSFSL VTRPGSDRAQ
LLTMTELDYE SSRKRFELII RAASPPLRND VSVEILVTDV NDNAPVLRDF QVIFNNFRDC
FPSGVIGRIP AFDADVTDKL TYRILSGNNA NLLRLNTSTG GLTLSPQLNT NVPKFATMEV
SVTDGINEAK AIMQLIVRLI TEDMLFNSVT VRLDEMTEEA FLSPLLSFFL DGLAAIIPCP
RENIFLFSLQ EDIDVNSKIL NVSFSARRPD VAFEEYYTSQ YLQERIYLNR AILARLATVQ
VLPFDDNLCV REPCLNYEQC LSVLKFGNAS GFIHSDTVLF RPIHPVNTFA CKCPEGFTGS
KEHYLCDTEV DLCYSDPCQN GGSCVRREGG YTCVCGEQYT GVNCETSIAS LKPCISEVCG
DGYSCLTSGH GGHWPPYTKT CELMSRSFSP NSFLTFPGMR QRHRFNIRLK FATVRDSGLL
LYNGRYNEQH DFIALEIIDG RVVFSFSLGD QRQSVSVNQQ QHRRVSDGNW HTVEVKYFNR
TVLLSLDGCD TATALAGLGE RWNCANQTTL VLDRRCASLV EPCHRFFDLT GPLQIGGLPK
ISAHFQIPSH SFVGCISDLY IDHRYVDLGA YTADNGTIAG CPQKAASCAS EPCFNGGTCR
EGWGEGWECD CPDGFTGNAC QESVALPWRF QGDGILSFNP LLRPIQLPWL TAFSIRTRKR
DSFVMEIQVG QNSSAIVSLR SGTLQYAYNG EVLQLAGAEL ADGRWHRVEI KWMGAEVALT
VDYGQRTTVL PVSQKIQGLY VGRIVIGGSI AGGGMAGESN FEGCIQDVRV GGVQSVLKRP
TVRENVIDGC ASNAKCPEDG GCPQESVCVS NWDEAYCECL HGYVGEECKP VCTVKPCSDD
GVCRADTFNA RGYRCECNSS LSSGEYCENR IQQPCPAGWW GERSCGPCKC NVKQGYHPNC
DKTTGQCYCR ENHYQPANDR SACLPCECYT VGSYGKSCNS SGQCECREGV IGRRCDSCSN
PYAEVTLNGC EVVYDGCPKS HSAGLWWPRT AFGELAVENC PAPARGKGTR RCDQVQSGWG
APDMFNCTSE AFLELRKQLA QIEVDGFELN TFVSVKVASS LRQACGTVGG RQSAETGTGQ
RGLERSPEDS RVRDFYTVET GKSSLWREED FELDYLADIG DQEAFRERKL YGADLLITDR
LLHELMRYES YQAGLNLSHS QDKHFVRNLV ESAGEVLDRR YAAEWKRVQA LTQHGPDDLV
EAFNRYMIVL ARSQHDTYTN PFEIVHSNMV LGMDIVTAES LFGYETQMVK QQAKTQQQHY
VHSHPAETVI LPDTSTFLQT APKQKTPFVA FPKYNNYIQD RSKFDRHTRV LVPLDMLGIA
VPDRNEVINQ LAEHRAIVSY AQYKDAGTLF PANFDETVTR RWGVEMQIAS SVVSIAIVTP
ESAERESLVA ANGERNGERS NAVETIAPPT KHGERQSEKL SMNNEIKISI HDMSDREDGL
DSLDQHAPEL PVVEDGGQEF FHEGNVPETV ILSPRSDEPM DGDESASPHA TSIRRKRRRS
IVSAPESGGQ GGTVESDERD TEASRTNYVP LGHPHLPQAV KLQMWLNIPR NRFASRSNPQ
CVRWNTHAKL WTRIGCQTEI PNYDSIGGHN DTIIVNCTCN QLATYAVLVD IIDPEDIPEP
SLLVQITSYS AFLIALPVLF AVIISLALLR GLQTNSNSIH QNLLFCIFAA EVLFFVAIQA
RRELLDNEFP CKLVAIGLHY SWLAAFAWTT VDCVHLYRML TEMRDINHGP MGFYHTIGYG
APALLVGLAV GVRVHEYGNS LFCWLSVYES VIWWMVGPIA IVSVFDLFIL FLSVRAAFTI
KDHVLGFGNL RTLLWLSVVS LPLLGIMWVL AVLSASDNSQ LLNMLLSAVV ILHALFSLVG
YCIINKRVRE NLHNACLRCL GRKVPLLDSS IAISNSSQNV GSPKTPGFAG GAGQYETARR
NIGISTSSTT SRSTAKTSSS PYRSDGQFRH TSTSTSNYNS DGVASYMRGH YDDGAMKKSK
NANGRSGDGE RRSHRRQRRD SDSGSETDGR SLELASSHSS DDEESRVGRN SSTHRSTGVC
STSYLPNITE HVATTPPELH VVQSPQLFPN VTPTRWPNQN AGNYLPPGNG RWSQETGSDN
EAHPHKSPTN GGSLPNPDIT ETSYLHQNRM NMPPSILENI QESYNIGYST TDLHSDRYSN
YGPTENYVPP AADYGKRYES PTVAQNPHAS SSTLPHHYAS SVGAANVPLA DSRHTGSMQI
INHMRAYPTE NPYALKESLY DRSRTLGYGA ESPYHGHPMA PPGGDLYSPP GVMSFKSSVQ
SLLKNDYQQH QQQQQQQQRQ HKHHPASGAD SDRMSEGSDK NPYNFPYTAE EDHLVHSHSN
NGGRMHHGLS EGLNGLDNGG TPPPPQRMLR ATDGLSPAPL QSMGSHLTNG SSLASGNDPT
NDDDETTV
//