ID A0A3Q2HLV9_HORSE Unreviewed; 1360 AA.
AC A0A3Q2HLV9;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Nidogen 2 {ECO:0000313|Ensembl:ENSECAP00000035822.3};
GN Name=NID2 {ECO:0000313|Ensembl:ENSECAP00000035822.3,
GN ECO:0000313|VGNC:VGNC:55919};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000035822.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000035822.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000035822.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000035822.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000035822.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000035822; -.
DR PaxDb; 9796-ENSECAP00000035822; -.
DR Ensembl; ENSECAT00000040423.3; ENSECAP00000035822.3; ENSECAG00000015369.4.
DR VGNC; VGNC:55919; NID2.
DR GeneTree; ENSGT00940000157901; -.
DR InParanoid; A0A3Q2HLV9; -.
DR OMA; TCEHNHG; -.
DR Proteomes; UP000002281; Chromosome 1.
DR Bgee; ENSECAG00000015369; Expressed in synovial membrane of synovial joint and 20 other cell types or tissues.
DR GO; GO:0005604; C:basement membrane; IBA:GO_Central.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IBA:GO_Central.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00255; nidG2; 1.
DR CDD; cd00191; TY; 2.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 2.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR PANTHER; PTHR46513:SF15; NIDOGEN-2 ISOFORM X1; 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF12947; EGF_3; 2.
DR Pfam; PF07645; EGF_CA; 1.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF00058; Ldl_recept_b; 3.
DR Pfam; PF06119; NIDO; 1.
DR Pfam; PF00086; Thyroglobulin_1; 2.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 4.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 4.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00211; TY; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 2.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01186; EGF_2; 4.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51120; LDLRB; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
DR PROSITE; PS00484; THYROGLOBULIN_1_1; 2.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 2.
PE 1: Evidence at protein level;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A3Q2HLV9};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..1360
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5040491007"
FT DOMAIN 105..270
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 515..745
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 746..787
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 788..830
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 835..877
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 878..916
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 923..991
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 1001..1069
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT REPEAT 1139..1182
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1183..1225
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1226..1270
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REGION 278..300
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 319..450
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 978..1000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1058..1084
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 397..411
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1069..1084
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 846..863
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 960..967
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 1039..1046
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 1360 AA; 146881 MW; E5D6FEEC94D21A18 CRC64;
MRGDRAAVRP ALPLLLWLPL LWPRAGALRP GELFPHGPAR GDRLLRDGDD ESSGAVELAR
PLRFYGARVR RLYVGTNGII STQDFPRETQ YVDDGFPTDF PAIAPFLADV DTSQGRGRVL
FREDTSPEVL ALAARYVRTG FPRAAHFAPT HAFLATWAQV GAYEEAARGA PPSGERNTFQ
AVLASDESDT YALFLYPAGG LQFFGTRPKE SYNVQLELPA RVGFCRGEAT GLKGEAPYFS
LTSTEQSVKN LYQHSNLGVP GVWAFHIGST SPLDNVRPAA AGGGLSAAQT PAPPGQPFSH
VAALESDDAE DDLDYYDGNE EEVEYPPGDP EEAAKGHSSV DVPLHVEADP GPLGESATLD
PQTEEGRPVG ETDASDAKGP TESSEQLETG GLAPPGTEGH PPAPPREGPA PHPETRSLQP
HPAAGTPPSG LDVPPHRPVL DHHPPLGHGR QVVGVEDDIG SNTEVFTYNA GKETCEHNHG
RCSRHAFCTD YATGFCCHCQ SRFYGNGQQC LPEGAPHRVN GKVSGHLRVG PTPVHFTDVD
LHAYIVSNDG RAYMAISHIP QPAARALLPL TPIGGLFGWL FALEKPGWTN GFSLTGAAFT
HDMEVTFHPG GERLLVTQTA EGLDPENYLS IRTHIRGQAP YIPANLTVHV APYKELYHYA
DSAVTSASSR DYALAAGAVN QTRSYRVLQN ITFQACRHAP RPRAAPAVQQ LSVDRVFAWY
AEDEGVLRFA LTSLVGPAGG DSEPTPVNPC YDGSHACAMT ARCRPGAGVD YSCECAPGYQ
GDGRSCADVN ECATGFHRCG PNAMCVNLPG SYRCECRPGY EFADDGHTCR VVAPPPSPCE
DGSHSCAPGQ AQCIPRGGGA FSCACLPGYS GDGHQCSDVD ECSQDPCHPA ASCHNTPGSF
SCRCRPGYHG DGLRCAPDPL SGLKPCERQQ HEAQAQLAVP GAQLHVPQCD EHGHFVPLQC
DSSTGSCWCV DPDGHEVPGS QTRPGSAPPH CGPPEPTPRP RTVCERWRET LLELYGGAPG
DDQYVPQCDE WGHFTPLQCH GRSDFCWCVD RDGREVPGTR SQPGTTPACV PTVAPPTVRP
TPRPDVTPPP VGTYLLYAQG QQIGHLPLNG TRLQKDAART LLSLHGSIVV GIDYDCRERM
VYWTDVAGRT ISRASLEPGA EPETIISSGL MSPEGLAIDH VRRTMYWTDS GLDKIERARL
DGSERKALFH TDLVNPRAIA VDPIRGNLYW TDWNREAPKI ETSSLEGENR RILVNKDIGL
PNGLTFDAFS KLLCWADAGT KRLECTLPDG AGRRVIQSSL NYPFSLVSYA GHFYHTDWRR
DGVISVDRDS GQFTDEYLPE QRSHLYGITA VHPYCPAGRK
//