ID G7NWI2_MACFA Unreviewed; 1406 AA.
AC G7NWI2; A0A2K5TN71;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Crumbs cell polarity complex component 1 {ECO:0000313|Ensembl:ENSMFAP00000001477.1};
GN Name=CRB1 {ECO:0000313|Ensembl:ENSMFAP00000001477.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000001477.1, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000001477.1, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000001477.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005540376.1; XM_005540319.2.
DR STRING; 9541.ENSMFAP00000001477; -.
DR Ensembl; ENSMFAT00000000506.2; ENSMFAP00000001477.1; ENSMFAG00000037248.2.
DR GeneID; 102143170; -.
DR KEGG; mcf:102143170; -.
DR CTD; 23418; -.
DR VEuPathDB; HostDB:ENSMFAG00000037248; -.
DR eggNOG; KOG1217; Eukaryota.
DR GeneTree; ENSGT00940000155152; -.
DR OrthoDB; 2877476at2759; -.
DR Proteomes; UP000233100; Chromosome 1.
DR Bgee; ENSMFAG00000037248; Expressed in cerebellum and 3 other cell types or tissues.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 14.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 2.10.25.10; Laminin; 17.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR24049; CRUMBS FAMILY MEMBER; 1.
DR PANTHER; PTHR24049:SF22; DROSOPHILA CRUMBS HOMOLOG; 1.
DR Pfam; PF00008; EGF; 12.
DR Pfam; PF12661; hEGF; 4.
DR Pfam; PF02210; Laminin_G_2; 3.
DR PRINTS; PR00010; EGFBLOOD.
DR SMART; SM00181; EGF; 17.
DR SMART; SM00179; EGF_CA; 16.
DR SMART; SM00282; LamG; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF57196; EGF/Laminin; 12.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 10.
DR PROSITE; PS00022; EGF_1; 15.
DR PROSITE; PS01186; EGF_2; 11.
DR PROSITE; PS50026; EGF_3; 17.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000233100}.
FT DOMAIN 70..108
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 110..146
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 148..184
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 186..222
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 224..260
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 262..299
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 301..337
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 339..395
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 397..439
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 441..481
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 485..670
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 672..708
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 714..885
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 887..923
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 950..1137
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 1139..1175
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1177..1212
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1214..1250
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1260..1295
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1297..1333
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DISULFID 79..96
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 98..107
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 136..145
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 174..183
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 212..221
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 250..259
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 327..336
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 385..394
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 471..480
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 698..707
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 913..922
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1165..1174
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1181..1191
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1202..1211
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1240..1249
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1285..1294
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1323..1332
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1406 AA; 153870 MW; A0667980AE4235A2 CRC64;
MALKNINYLL IFYLSFSLLI YIKNSFCNKN NTRCLSNSCQ NNSTCKGFSK DNDCSCSGTA
NNVDKDCDNV KDPCFSNPCQ GSATCVNTPG ERSFLCKCPP GYSGTICETT IGTCGKNSCQ
HGGICHQDPI YPVCICPAGY AGRFCEIDHD ECASSPCQNG AVCQDGIDGY SCFCVPGYQG
RHCDLEVDEC ASDPCKNEAT CLNEIGRYTC ICPRDYSGIN CELEIDECWS QPCLNGATCQ
DALGAYFCDC APGFLGDHCE LNIDECASQP CLHGGLCVDG ENRYSCNCTG SGFTGTHCET
LMPLCWSKPC HNNATCEDSV DNYTCHCWPG YTGAQCEIDI NECNSNPCQS DGECVELSSE
KQYGHITGLP STFSYHEASG YVCICQPGFT GIHCEEDVNE CSSNPCQNGG TCENLPGNYT
CHCPFDNLSR TFYGGRDCSD ILLGCTHQQC LNNGICIPHF QDGQHGFGCL CPSGYTGSLC
EIATTLSFEG DGFLWVKTGS ATTKGSVCNI ALRFQTVQPV ALLLFRGNRD VFVKLELLSG
YIHLSIQVNS QPKVLLYISH NTSDGEWHFV EVIFAEAVTL TLIDDSCKEK CISKAPSPLE
SDQSICAFQN SFLGGLPVGT TSDGVALLKF YNIPSTPSFV GCLQDIKIDW NHITLENISS
VSSLNVKAGC VRKDWCESQP CQSRGRCINL WLSYQCDCHR PYKGPNCLRE FVAGRFGQND
STGYVVFTLD ESYGDTVSLS MFVQTLQPSG LLLALENSTY QYIRVWLEHG RLAMLTPNSP
KLVVKFVLND GNVHLISLKI KPNKIELYQS SQNLGFISAS TWKIQKGDVI YIGGLPDKQE
TELNGGFFKG CIQDVRLNNQ NLEFFPNSTN NVSLKPVLVN VTQGCPGDNS CKSNPCRNGG
VCHSLWDDFS CSCPAHMSGK ACEEVQWCGF SPCPHEAQCQ PVLQGFECIA NAVFNEQSSQ
ILFRSNGNIT RELTNITFGF RTRDANVIIL HAEKEPEFLN ISIQDSRLFF QLQSGNSFYM
LSLTSLQSVN DGMWHEVTLS MTDPMSQTSR WQMEVDNQTP FVTSTIATGS LNFLKDNTDI
YVGDGAIDNI KGLQGCLSTI EIGGIYLSYF ENVHGFTNKP QEEQFLKIST NSVVTGCLQL
SVCNSNPCLH GGNCEDIYSS YHCSCPLGWS GKHCELNTDE CFSNPCIHGN CSDRVAAYHC
TCEPGYTGGN CEVDIDNCQS HQCANGATCI SDTNGYSCLC FGNFTGKFCR QSRLPSTVCG
NEETNLTCYN GGNCTEFQAE LKCMCRPGFT GERCEKDIDE CASDPCVNGG LCQDLLNKFQ
CLCDVAFAGE RCEVDLADDL ISDIFTAIGS VTLALLLILL LAIVASVVTS NKRATQGTYS
PSRQEKEGSR VEMWNLMPPP AMERLI
//