ID A0A151PE78_ALLMI Unreviewed; 4337 AA.
AC A0A151PE78;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Protocadherin Fat 4 {ECO:0008006|Google:ProtNLM};
GN ORFNames=Y1Q_0001205 {ECO:0000313|EMBL:KYO47397.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO47397.1};
RN [1] {ECO:0000313|EMBL:KYO47397.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO47397.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO47397.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03000422; KYO47397.1; -; Genomic_DNA.
DR STRING; 8496.A0A151PE78; -.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3594; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 29.
DR CDD; cd00054; EGF_CA; 5.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.40.60; Cadherins; 29.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR InterPro; IPR039808; Cadherin.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR24027; CADHERIN-23; 1.
DR PANTHER; PTHR24027:SF422; NEURAL-CADHERIN 2-RELATED; 1.
DR Pfam; PF00028; Cadherin; 28.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF12661; hEGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR SMART; SM00112; CA; 29.
DR SMART; SM00181; EGF; 6.
DR SMART; SM00179; EGF_CA; 5.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 29.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS00232; CADHERIN_1; 15.
DR PROSITE; PS50268; CADHERIN_2; 29.
DR PROSITE; PS00022; EGF_1; 7.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 3853..3878
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 7..46
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 42..145
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 146..245
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 246..348
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 349..452
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 453..562
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 563..667
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 668..772
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 780..881
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 896..981
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 982..1092
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1093..1193
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1194..1296
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1297..1403
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1403..1506
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1507..1611
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1612..1716
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1717..1820
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1821..1921
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1922..2023
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2024..2127
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2127..2226
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2227..2337
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2338..2443
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2444..2548
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2549..2652
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2653..2758
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2759..2864
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2863..2970
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 3156..3214
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3216..3252
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3254..3290
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3292..3328
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3329..3513
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 3516..3552
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 3571..3751
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 3779..3816
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 3890..3936
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4036..4065
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4101..4126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4147..4261
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4310..4337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3890..3907
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4102..4123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4164..4178
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4200..4216
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4226..4249
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 3204..3213
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3242..3251
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3280..3289
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3318..3327
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3542..3551
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 3806..3815
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 4337 AA; 472927 MW; 1AFAAEC5FA6D6C53 CRC64;
MAPRRVFRLD PVSGKLSTIT QLDREEQSYY SLQVLATDLG SPPLSSVARV NENEPAGSYL
TTVSATDPDM GLNGTVKYSI SAGDTSRFQI HTQTGVITTK IALDREEKTA YQLQIMATDG
GHLHSQNQAI VTITVLDTQD NPPVFSQDVY SFVVFENIAL GYHVGTVYAS TMDLNTNITY
LITTGDQRGM FAINKVTGQI TTASIIDREE QAFYQLKVVA SGGAITGDTM VNITVKDLND
NSPHFIHAVE SVNVVENWKA GHTIFQAKAV DPDEGVNGMV LYSLKQNPKG LFSINEQNGN
ISLEGPLDII AGSYQVEILA SDMGVPQLSS SFILAVSVHD VNDNSPVFDQ LSYEVTILES
EPVNSRFFKV QASDKDSGAN GEVAYTITEG NTGDAFGIFP DGQLYIKSEL DRELQERYIL
LVVASDRAVE PLSATVNVTI ILEDVNDNRP LFNSTNYVFY FEEEQSGGSF VGKINAVDKD
FGPNGEVRYS FENMQPDFEL NTGTGEITST YQFDRESLMR QRGAAVFSLT VIATDQGLPK
PLKDQATVQI YMKDINDNAP KFLKDLYQAT ISELAANLTQ VLRVSASDVD EGNNGLIHYS
VIKGNEENQF AIDSGTGQVT LVGKLDHEAT ASYSLIIQAV DSGTVSLSST CTLSIDVLDE
NDNSPSFPKS TLFVDVLENM RIGELVSSVT ATDSDSGDNA DLHYSITGTN NHGTFSISPN
TGSIFLAKKL DFETQSLFKL NITAKDQGRP PRSSTMSVVI HVRDFNDNPP NFPPGDIFKS
IVENIPVGSS VISVTARDPD ADINGQLMYA IIQQMPRGNH FRIDEIRGTI FTNAEIDREF
ANLFELTVKA TDQAVPVESR RFAVKNVTIL VTDQNDNVPV FISQNALAAD PSVVIGSVLT
TIIAADPDEG ANGEVEYEII NGDTETFIVD RYSGDLRVAS ALVPSQLIYN LIVAATDLGP
ERRKSTTEMT VILQGVDGPV FTQPKYITIL KEGEPIGTNV IAIEAASPRG SEAQVEYYIV
SVRCEDKSLG RLFTIGRHTG VIQTAAILDR EQGARLYLVD VYAIEKSAVL PRTQRAEVEI
TLQDINDNPP VFPTDMLDLT VEENIGDGSK IMQLTAMDAD EGANALVTYT IISGADDSFH
IDPESGDLIA TKRLDRERRS KYSLLVRADD GLQSSDMRIN ITVSDVNDHT PKFSKPVYSF
DIPEDTTPGS LVAAILATDD DSGVNGEITY TVSEDDEDGI FFLNPVTGVF NLTRILDYEM
QQYYILTVRA EDGGGQYTAI RVYFNILDVN DNPPLFSMVS YSTSLVEDLP PGSTILNFNV
TDADDGPNSQ LSYSIASGDS LGQFNIDKDG ILSIKKILDR ESQSFYSLIV QVHDMASLPA
SRFTSTAQVS IILLDVNDNP PNFISPKLTY IPENTPIDTI VFKAQATDPD SGPNSYIEYT
LQRPLGNKFS IGTIDGEVRL TGELDREAVS NYTLTVVATD KGQPSLSSST DVVVIVLDIN
DNNPLFAQKQ YKVEVDENTL TGTDLIQVFA TDGDEGTNGQ VRYTIISGNT NNEFRIDSVT
GVITVAKPLD REKEPSYTLT VQSSDRGSSP RTDTTTVNIV LMDINDFIPT FELSPYSVNV
PENLETLPKV ILQVVARDDD QGLNSKLTYI LISGNEDGAF TLSATGELRL VKSLDRETKE
KYVLLITAAD SGSPALTGTG TIAVTVDDVN DNVPTFAFNM YSTTIPEDAP TGTDILLVNS
SDADASVNAV ISYKLIGGNS QFTINPSTGQ IITSALLDRE TKENYTLVVV ASDGGFPKAL
SSSTSVLVSV ADVNDNPPKF QHHPYVTHIP SPTTSGSFVF AVTVTDADSG PNAELHYSLR
GKNSEKFHID PTRGAIMAAD SLTGDSEVTF SVHVKDGGLY PKTDSTTVTV RFMNKAQFPQ
VQAEQHTFMF PENQAISTLV TTVSGSSSRG GSLSYYIASG NLGNTFQIDQ LTGQFSICQS
LDFEAIQKYV VWIEARDMGF PPFSSYEKLE ITVVDVNDNA PEFERDPFIA EIIENLSPRK
ILTVSAVDKD SGPNGQLNYE IIDGNKENSF TINRATGEIR SVRPLDREKL AQYVLTIKAF
DKGTPLQSTT VKVIVNILDE NDNAPRFSQI FSASVPENAP LGYTVTRVTT SDEDIGVNAI
SRYSIRDPSL PFVINPSTGD ITVSRPLNRE DTDRYRMRVS AHDSGWTVST DVTIFVTDIN
DNAPRFTKPS YYLECPELTE VGLRVTQVSA TDPDEGFNGQ IFYFIKSQSE FFRINATSGE
IFNKQYLKYQ NSSGSSNFNI NRHSFIVTSS DRGSPPLLSE TTVTINIVDS NDNAPRFLAS
KYFTPVTKNV GIGTNLIKVT AVDDKDFGLN SEVGYFISNE NNTNKFKLDS RTGWISVASS
LMADLNQDFL IKVKAKDKGN PPLSAEVTVE IVITEENYHT PVFSQSHMSI TIPESHAVGA
IIRTVSARDR DVAMNGLIKY NISSGNEAGI FSINTSTGAL TLAKSLDYEL YQKHEIIVSA
TDGGWVARTG YCTVTVNVID VNDNSPAFSP EDYFPNVLEN APSGTTVIRL NATDADSGPN
AVIAYAIQSS DSDLFVIDPN TGIITTQGFL DYETKQSYHL TVKAFNVPDE ERCSFASVNI
QLEGTNEYVP RFVSKLYYFE VSEAACRGTV VGEVFASDRD MGTDGEVHYL IFGNSRKKGF
QIDGRSGQMY VSGPLDREKE ERISLKVLAK NFGSIRGADI DEVIVNITIL DANDPPVFSL
EVYNIQISEG VPPGTHVTFV SAFDSDSVPS WSRFSYFIGS GNDNGAFSIN PQTGQVTVTA
ELDRETLPVY NLTVLAIDSG SPSATGSASL LVTLEDINDN GPTLSTSQGE VMENNRAGTL
VMTLQSSDPD LPPNQGPFSY YLLSTGPATS YFSLSTAGVL TTTREIDREQ ISDFFLSVIT
RDSGIPQMSS TGTVQIKVVD QNDNPSQPRT VEIFVHYYGN LFPGGILGNV KPQDPDVLDS
FHCSLTSGVT SLFNIPGGTC DLNSQARSTD GTFDLTVLSN DGLHSAVTSS VRVFFAGFNN
GTIDNSILLR LSVHTVKDFL TNHYLHFLRI ANSQLTGLGT AVQLYGVYED SNRTFLMAAV
KRNNNLYVSP SGVATFFESI KEILFRQSGV WIESVDHDSC IQNPCQNGGS CLRRLAVSPV
LKSHESIPVI IMANEPLQPF VCRCLPGYDG NLCETDIDEC LPSPCHNNGT CHNLVGGFSC
SCPDGFTGMA CERDINECLS SPCKNGAVCQ NFPGSFNCVC KTGYTGKTCD SAVNYCECNP
CFNGGSCQSG VEGYYCHCPF GVFGNHCELN SYGFEELSYM EFPSMDPNNN YIYIKFSTIK
SNALMLYNYD NQTGEQAEFL ALEITEERLR FSYNLGSGTY KLTTMKKVSD GQFHTVIARR
AGMAASLTVD SCSEDQEPGY CTVSNVAVAT DWTLDVQPNR VTVGGIRSLE PILQRRGQVE
SHDFVGCIME FAVNGRPLEP SQALAAHGIL DQCPRLEGAC SISPCQNGGT CVDHWSWQQC
HCKEGVTGKH CEKYMTADTA LSLEGKGRLD YHMSQNRKRE YLMRHGTRDT ILEPPNVNRL
EVKFRTRSEN GILIHVQESS NYTTVKIRSG KVHYTSDAGI AGKVERNIQE VYTADGQWHS
LLIEKNGSAT ILSVDRAYSR DILHVTQDFG GLNVLTVSLG GIPPNHAPRS TSAGFDGCIE
YVKYGGESLP FTGKHSLATI SKTDPSVKTG CRGPNVCASN PCWGELMCIN QWYAYKCVPP
GACASNPCQN GGSCEPGTHS GFTCSCPESY AGRTCEKVVA CLGILCAQGH VCKGGVNGGH
MCVPSPHPAE LSLPLWAVPA IVGSCATVLA LLVLSLILCN QCRGKKAKGQ KEEKKKEKKK
KGSENVAFDD PDNIPPYGDD MTVRKQPEGN PKPDIIEREN PYLIYDETDI PHNTETIPSA
PLASPEPEIE HYDIENASSI APSDADIIQH YKQFRSHTPK FSIQRHSPLG FARQSPMPLG
ASSLTYQPSY SQGLRTTSLS HSACPTPNPL SRHSPAPFSK SSTFYRNSPA RELHLSIREG
SPLEMHNDVC QPGIFNYATR LGRRSKSPQT MATHGSSRPG SRLKQPIGQI PLETAPPVGL
SIEEVERLNT PRPRNPSICS ADHGRSSSEE DCRRPLSRTR NPADGIPAPE SSSDSDSHES
FTCSEMEYDR DKPMAYTSRM PKLSQVNESD ADDEDNYGAR LKPRRYPGRR AEGGPVGTQA
TTSNVAENTL PMKLGQQAGN FNWDNLLNWG PGFGHYVDVF KDLASLPEKA AAAAANEESK
GGTIKPVSKD GEAEQYV
//