ID F6PYN4_HORSE Unreviewed; 1414 AA.
AC F6PYN4;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Neogenin 1 {ECO:0000313|Ensembl:ENSECAP00000022424.3};
GN Name=NEO1 {ECO:0000313|Ensembl:ENSECAP00000022424.3,
GN ECO:0000313|VGNC:VGNC:20685};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000022424.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000022424.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022424.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000022424.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022424.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the immunoglobulin superfamily. DCC family.
CC {ECO:0000256|ARBA:ARBA00009588}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSECAT00000026833.3; ENSECAP00000022424.3; ENSECAG00000024743.3.
DR VGNC; VGNC:20685; NEO1.
DR GeneTree; ENSGT00940000156684; -.
DR Proteomes; UP000002281; Chromosome 1.
DR Bgee; ENSECAG00000024743; Expressed in brainstem and 23 other cell types or tissues.
DR ExpressionAtlas; F6PYN4; baseline.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 6.
DR CDD; cd00096; Ig; 1.
DR CDD; cd05722; IgI_1_Neogenin_like; 1.
DR CDD; cd05723; IgI_4_Neogenin_like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 10.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR013098; Ig_I-set.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR003598; Ig_sub2.
DR InterPro; IPR010560; Neogenin_C.
DR PANTHER; PTHR44170:SF14; NEOGENIN; 1.
DR PANTHER; PTHR44170; PROTEIN SIDEKICK; 1.
DR Pfam; PF00041; fn3; 6.
DR Pfam; PF07679; I-set; 2.
DR Pfam; PF13895; Ig_2; 1.
DR Pfam; PF13927; Ig_3; 1.
DR Pfam; PF06583; Neogenin_C; 1.
DR PRINTS; PR00014; FNTYPEIII.
DR PRINTS; PR01832; VEGFRECEPTOR.
DR SMART; SM00060; FN3; 6.
DR SMART; SM00409; IG; 4.
DR SMART; SM00408; IGc2; 4.
DR SUPFAM; SSF49265; Fibronectin type III; 3.
DR SUPFAM; SSF48726; Immunoglobulin; 4.
DR PROSITE; PS50853; FN3; 6.
DR PROSITE; PS50835; IG_LIKE; 4.
PE 3: Inferred from homology;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Immunoglobulin domain {ECO:0000256|ARBA:ARBA00023319};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1057..1081
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 12..107
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 112..198
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 203..296
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 301..386
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 421..515
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 521..611
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 616..711
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 721..814
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 820..916
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 921..1018
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 1013..1050
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1087..1113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1127..1159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1188..1229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1243..1335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1140..1159
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1243..1304
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1315..1332
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1414 AA; 154999 MW; C14257402BE5996E CRC64;
MTIGSGVRTF TPFYFLVEPV DTLSSRGSSV ILNCSAYSEP SPKIEWKKDG TFLNLVSDDR
RQLLPDGSLF ITNVVHAKHN KPDEGYYQCV ATVENLGTIV SRTAKLTVAS LPRFASQPEP
SSVYAGNSAV LNCEVNADLV PFVRWEQNRQ PLLLDDRVIK LPSGTLVISN VTERDGGLYR
CVVESGGPPK YSDEAELKVL PDPEVTSNLV FLKQPSSLVR VIGQSAVLPC VALGLPTPTI
RWMKNEEALD TESSERLVLL AGGSLEISDV TEDDAGTYFC IADNGNETIE AQAELTVQAQ
PEFLKQPTNI YAHESMDIIF ECEVTGKPAP TVKWVKNGDM VIPSDYFKIV KEHNLQVLGL
VKSDEGFYQC IAENDVGNAQ AGAQLIILEH DVAIPTLPPT SLTSATTDHL APATTGPLPS
APRDVVASLV STRFIKLTWR TPASDPHGDN LTYSVFYTKE GIARERVENT SHPGEMQVTI
QNLMPATVYI FRVMAQNKHG SGESSAPLRV ETQPEVQLPG PAPNIRAYAT SPTSITVTWE
TPLSGNGEIQ NYKLYYMEKG TDKEQDVDVS SHSHTINGLK KYTEYSFRVV AYNKHGPGVS
TQDVAVRTLS DVPSAAPQNL SLEVRNSKSI MIHWQPPPPA TQNGQITGYK IRYRKASRKS
DVTETLVTGT QLSQLIEGLD RGTEYNFRVA ALTINGTGPA TDWLSAETFE SDLDETRVPD
VPSSLHVRPL VTSIVVSWTP PENQNIVVRG YAIGYGIGSP HAQTIKVDYK QRYYTIENLD
PSSHYVITLK AFNNVGEGIP LYESAVTRPH TVPDPTPMMP PVGVQASILS HDTIRITWAD
NSLPKHQKIT DSRYYTVRWK TNIPANTKYK NANATTLSYL VTGLKANTLY EFSVMVTKGR
RSSTWSMTAH GTTFELVPTS PPKDVTVVSK EGKPRTIIVN WQPPSEANGK ITGYIIYYST
DVNAEIHDWV IEPVVGNRLT HQIQELTLDT PYYFKIQARN SKGMGPMSEA VQFRTPKASG
SAGKGSRLPD LGSDYKPPMS GSNSPHGSPT SPLDSNMLLV IIVSVGVITI VVVVIIAVFC
TRRTTSHQKK KRAACKSVNG SHKYKGNSKD VKPPDLWIHH ERLELKPIDK SPDPNPIMTD
TPIPRNSQDI TPVDNSMDSN IHQRRNSYRG HESEDSMSTL AGRRGMRPKM MMPFDSQPPQ
PVISAHPIHS LDNPHHHFHS SSLASPARSH LYHPGSPWPI GTSMSLSDRA NSTESIRNTP
STDTMPASSS QTCCTDHQDP EGATSSSYLA SSQEEDSGQS LPTAHVRPSH PLKSFAVPAI
PPPGPPTYDP TLPSTPLLSQ QALNHHLHSV KTASIGTLGR SRPPMPVVVP SAPDVQETTR
MLEDSESSYE PDELTKEMAH LEGLMKDLNA ITTA
//