ID G1LX36_AILME Unreviewed; 1165 AA.
AC G1LX36;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=Crumbs cell polarity complex component 2 {ECO:0000313|Ensembl:ENSAMEP00000011639.2};
GN Name=CRB2 {ECO:0000313|Ensembl:ENSAMEP00000011639.2};
OS Ailuropoda melanoleuca (Giant panda).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000011639.2, ECO:0000313|Proteomes:UP000008912};
RN [1] {ECO:0000313|Ensembl:ENSAMEP00000011639.2, ECO:0000313|Proteomes:UP000008912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20010809; DOI=10.1038/nature08696;
RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA Wang J.;
RT "The sequence and de novo assembly of the giant panda genome.";
RL Nature 463:311-317(2010).
RN [2] {ECO:0000313|Ensembl:ENSAMEP00000011639.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9646.ENSAMEP00000011639; -.
DR Ensembl; ENSAMET00000012134.2; ENSAMEP00000011639.2; ENSAMEG00000011056.2.
DR eggNOG; KOG1217; Eukaryota.
DR GeneTree; ENSGT00950000183101; -.
DR HOGENOM; CLU_000827_2_0_1; -.
DR TreeFam; TF316224; -.
DR Proteomes; UP000008912; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 9.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.10.25.10; Laminin; 10.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR PANTHER; PTHR24049:SF19; PROTEIN CRUMBS HOMOLOG 2; 1.
DR Pfam; PF00008; EGF; 7.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF02210; Laminin_G_2; 1.
DR PRINTS; PR00010; EGFBLOOD.
DR SMART; SM00181; EGF; 11.
DR SMART; SM00179; EGF_CA; 10.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 9.
DR PROSITE; PS00010; ASX_HYDROXYL; 6.
DR PROSITE; PS00022; EGF_1; 10.
DR PROSITE; PS01186; EGF_2; 7.
DR PROSITE; PS50026; EGF_3; 11.
DR PROSITE; PS01187; EGF_CA; 4.
DR PROSITE; PS50025; LAM_G_DOMAIN; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1165
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030171141"
FT DOMAIN 68..107
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 109..145
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 147..183
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 185..222
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 224..260
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 262..319
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 321..357
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 359..395
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 397..437
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 432..605
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 607..643
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 809..845
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 97..106
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 135..144
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 173..182
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 250..259
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 309..318
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 347..356
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 385..394
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 427..436
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 633..642
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 835..844
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1165 AA; 123571 MW; 6D14F7BBE0F3D10A CRC64;
MALAGPGTPA SRPLASLLLL LLLLAPVLSL LGGTVPSEPP SVCDSAPCAP GTQCQAMENG
SYSCAPTEPR GCATQPCHHG ALCVPQGPDP DGFRCYCVPG FQGPRCELDI DECASRPCHH
GATCHNLADR YECRCPLGYE GVTCEAEVDE CASAPCLHGG SCLDGVGSYR CVCAPGYGGA
SCQLDLDECH SQPCAHGGRC RDLVNGFRCD CADTGYEGVR CEQEVLECAS APCANNASCV
EGLGSFRCLC WPGYSGQRCE VDEDECESGP CQHGGQCLQR SDPALYGGVQ ATFPGAFSFR
HAAGFLCRCA PGFEGDECGV DVDECASQPC LNGGRCQDLP NGFQCHCLDG YTGVACQEDV
DECLSEPCLH GGTCDDTVAG YICQCPEAWG GHDCSVRLTG CQGHTCPPAA TCIPIFKAGV
HSYACRCPPG SRGPFCGQNT TFSVVAGSPV QTSVPAGGTR GLALRFRTTL PAGALAARTD
TQDSLELALA GGTLQATLWD HGNTTVLTLK LPDLALNDGR WHEVEVSLRL AVLELRLWHE
DCPAGLCVAS RPVAPAPVAS EAPTPAGFCS IQLGGRAFEG CLQDVHVDGH LVLPEDLGKN
VLLGCERREQ CQPPPCAHGG ACVDMWTHFH CRCTRPYSGP TCADEVPAAT FGLGGTLSSA
SFLLYQLPGP NLTVSFLLRT REPAGLLLQL ANDSVAGLTV FLSEGQVQAE VLGHPTLVLP
GRWDDGLRHL VTLSFGPNQL QGLGQRVHVG GRLLPADAQP WGGPFRGCLQ DLRLNDLHLP
FFPLLLGNSS QPGELGSRES WNLTLGCVSE DTCSPDPCLN GGICLVTWND FHCTCPVNFT
GPTCAQQLWC PGQPCLPPAT CEEVPDGFVC VAEATFREGP AAAFSGHNAS SGVSLSGLSL
AFRTRDSEAG LLRASAGGQE AVWLEVHNGX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXG TAVPS
//