ID A0A2K5UG51_MACFA Unreviewed; 1470 AA.
AC A0A2K5UG51;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Neogenin 1 {ECO:0000313|Ensembl:ENSMFAP00000011382.2};
GN Name=NEO1 {ECO:0000313|Ensembl:ENSMFAP00000011382.2};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000011382.2, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000011382.2, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000011382.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the immunoglobulin superfamily. DCC family.
CC {ECO:0000256|ARBA:ARBA00009588}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9541.ENSMFAP00000011382; -.
DR Ensembl; ENSMFAT00000048943.2; ENSMFAP00000011382.2; ENSMFAG00000014361.2.
DR VEuPathDB; HostDB:ENSMFAG00000014361; -.
DR GeneTree; ENSGT00940000156684; -.
DR Proteomes; UP000233100; Chromosome 7.
DR Bgee; ENSMFAG00000014361; Expressed in colon and 13 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 6.
DR CDD; cd05722; IgI_1_Neogenin_like; 1.
DR CDD; cd05723; IgI_4_Neogenin_like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR010560; Neogenin_C.
DR PANTHER; PTHR44170:SF14; NEOGENIN; 1.
DR PANTHER; PTHR44170; PROTEIN SIDEKICK; 1.
DR Pfam; PF00041; fn3; 6.
DR Pfam; PF07679; I-set; 2.
DR Pfam; PF13895; Ig_2; 1.
DR Pfam; PF13927; Ig_3; 1.
DR Pfam; PF06583; Neogenin_C; 1.
DR PRINTS; PR00014; FNTYPEIII.
DR SMART; SM00060; FN3; 6.
DR SMART; SM00409; IG; 4.
DR SMART; SM00408; IGc2; 4.
DR SUPFAM; SSF49265; Fibronectin type III; 3.
DR SUPFAM; SSF48726; Immunoglobulin; 4.
DR PROSITE; PS50853; FN3; 6.
DR PROSITE; PS50835; IG_LIKE; 4.
PE 3: Inferred from homology;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000233100};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..33
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 34..1470
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030050930"
FT TRANSMEM 1113..1137
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 52..147
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 152..238
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 241..336
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 341..426
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 461..555
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 561..651
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 656..751
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 761..851
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 876..972
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 977..1074
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 1069..1106
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1147..1169
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1183..1215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1244..1285
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1298..1388
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1196..1215
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1298..1360
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1371..1388
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1470 AA; 160815 MW; 62D1464561642838 CRC64;
MAAERGALRL LSTTSFWLYC LLLLGRRTPG AAAARSGSAP QSPGASIRTF TPFYFLVEPV
DTLSVRGSSV ILNCSAYSEP SPKIEWKKDG TFLNLASDDR RQLLPDGSLF ISNVVHSKHN
KPDEGYYQCV ATVESLGTIV SRTAKLTVAG LPRFTSQPEP SSVYAGNSAI LNCEVNADLV
PFVRWEQNRH PLLLDDRVIK LPSGMLVISN ATEGDGGLYR CIVESGGPPK YSDEVELKVL
PDTEVTSDLV FLKQPSPLVR VIGQDVVLPC VASGLPTPTI KWMKNEEALD TESSERLVLL
AGGSLEISDV TEDDAGTYFC IADNGNETIE AQAELTVQAQ PEFLKQPTNI YAHESMDIVF
ECEVTGKPTP TVKWVKNGDM VIPSDYFKIV KEHNLQVLGL VKSDEGFYQC IAENDVGNAQ
AGAQLIILEH DVAIPTLPPT SLTSATTDHL APATTGPLPS APRDVVASLV STRFIKLTWR
TPASDPHGDN LTYSVFYTKE GIARERAENT SRPGEMQVTI QNLMPATVYV FRVMAQNKHG
SGESSAPLRV ETQPEVQLPG PAPNIRAYAT SPTSITVTWE TPVSGNGEIQ NYKLYYMEKG
TDKEQDVDVS SHSFTINGLK KYTEYSFRVV AYNKHGPGVS TQDVAVRTLS DVPSAAPQNL
SLEVRNSKSI MIHWQPPAPA TQNGQITGYK IRYRKASRKS DVTETLVSGT QLSQLIEGLD
RGTEYNFRVA ALTINGTGPA TDWLSAETFE SDLDETRVPE VPSSLHVRPL VTSIVVSWTP
PENQNIVVRG YAIGYGIGSP HAQTIKVDYK QRYYTIENLD PSSHYVITLK AFNNVGEGIP
LYESAVTRPH TDTSEVDLFV INAPYTPVPD PTPMMPPVGV QASILSHDTI RITWADNSLP
KHQKITDSRY YTVRWKTNIP ANTKYKNANA TTLSYLVTGL KPNTLYEFSV MVTKGRRSST
WSMTAHGTTF ELVPTSPPKD VTVVSKEGKP KTIIVNWQPP SEANGKITGY IIYYSTDVNA
EIHDWVIEPV VGNRLTHQIQ ELTLDTPYYF KIQARNSKGM GPMSEAVQFR TPKASGSGGK
GSRLPDLGSD YKPPMSGSNS PHGSPTSPLD SNMLLVIIVS VGVITIVVVV IIAVFCTRRT
TSHQKKKRAA CKSVNGSHKY KGNSKDVKPP DLWIHHERLE LKPIDKSPDP NPIMTDTPIP
RNSQDITPVD NSMDSNIHQR RNSYRGHESE DSMSTLAGRR GMRPKMMMPF DSQPPQPVIS
AHPIHSLDHP HHHFHSSSLA SPARSHLYHP GSPWPIGTSM SLSDRANSTE SVRNTPSTDT
MPASSSQTCC TDHQDPEGAT SSSYLASSQE EDSGQSLPTA HVRPSHPLKS FAVPAVPPPG
PPTYDPALPS TPLLSQQALN HHIHSVKTAS IGTLGRSRPP MPVVVPSAPE VQETTRMLED
SESSYEPDEL TKEMAHLEGL MKDLNAITTA
//