ID G1L943_AILME Unreviewed; 1199 AA.
AC G1L943;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=Nidogen 1 {ECO:0000313|Ensembl:ENSAMEP00000003414.2};
GN Name=NID1 {ECO:0000313|Ensembl:ENSAMEP00000003414.2};
OS Ailuropoda melanoleuca (Giant panda).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000003414.2, ECO:0000313|Proteomes:UP000008912};
RN [1] {ECO:0000313|Ensembl:ENSAMEP00000003414.2, ECO:0000313|Proteomes:UP000008912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20010809; DOI=10.1038/nature08696;
RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA Wang J.;
RT "The sequence and de novo assembly of the giant panda genome.";
RL Nature 463:311-317(2010).
RN [2] {ECO:0000313|Ensembl:ENSAMEP00000003414.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G1L943; -.
DR STRING; 9646.ENSAMEP00000003414; -.
DR Ensembl; ENSAMET00000003550.2; ENSAMEP00000003414.2; ENSAMEG00000003184.2.
DR eggNOG; KOG1214; Eukaryota.
DR GeneTree; ENSGT00940000156318; -.
DR HOGENOM; CLU_003163_1_0_1; -.
DR TreeFam; TF320666; -.
DR Proteomes; UP000008912; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00255; nidG2; 1.
DR CDD; cd00191; TY; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR026823; cEGF.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR PANTHER; PTHR46513:SF6; NIDOGEN-1; 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12662; cEGF; 1.
DR Pfam; PF12947; EGF_3; 3.
DR Pfam; PF14670; FXa_inhibition; 1.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF00058; Ldl_recept_b; 3.
DR Pfam; PF06119; NIDO; 1.
DR Pfam; PF00086; Thyroglobulin_1; 1.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 5.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00211; TY; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51120; LDLRB; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
DR PROSITE; PS00484; THYROGLOBULIN_1_1; 1.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00500};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..1199
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030170653"
FT DOMAIN 106..268
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 427..664
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 665..706
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 713..756
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 757..793
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 801..871
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT REPEAT 942..984
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 985..1027
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1028..1072
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REGION 310..352
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 841..848
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 1199 AA; 131398 MW; D23146462A151FA0 CRC64;
MLAAMQRSGA AWTRTLLLQL LLAGPGGCLS RQELFPFGPG QGDLELKAGD DHVSPALELK
RALRFFDRSD IDSVYVTTNG IIATSEPPAK ESYPGFFPPT FGVVAPFLAD LDTTDGLGKV
YYREDLSPSV TQLAAECVQR GFPEVSFQPS SAVVVTWESV APYQGPSKDP DQEGKRNTFQ
AVLASSDSSS YAIFLYPEDG LQFYTTFSKK DEKQVPAVVA FSQGVQGLLW KSEGAYNIFA
NDRESLGNLA KSSNSGQQGI WVFEIGSPAT ASGVVPSDVS LGLDDGAEYD DEDYELVTHL
DLDDLSTTPF PYEARRRGDP GTHSAPRVLS PRRLPTERLP GPPTERTRSF QPPAETFPQQ
HAQVIDVDEV EETGVVFSYN TDSRQTCAHN RHQCSVHAEC RDFATGFCCS CAAGYTGNGR
QCVAEGSPQR VNGKVKGRIF VGNDPVPIVF ENTDLHSYVV MNHGRSYTAI STIPETVGYS
LLPLAPVGGI IGWMFAVEQD GFKNGFSITG GEFTRQAEVT FVGYPGKLVI KQQFSGIDEH
GHLTINTELE GHVPHIPFGS SVHIEPYTEL YHYSRGVITS SSTREYTVME PEQDGAAPSK
IYTYQWRQTI TFQECAHDTS QPALPNTQQL SVDSVFVLYN QEEKILRYAL SNSIGPVREG
SPDALQNPCY IGTHGCDTNA ACRPGPGVQF TCECSIGFRG DGRTCSAVVD QRPINYCMTG
LHDCDIPQRA QCLYTGGSSY TCSCLPGFSG DGRACQDVDE CQSSRCHPDA FCYNTPGSFT
CQCKPGYHGD GFHCVPEEME KTRCQLEREH ILGAAGLTHP QRPGLFVPEC DEHGHYVPTQ
CHGSTGQCWC VDRDGRELEG TRTRPGMRPP CLSTVAPPIH HGPSVPTAVI PLPPGTHLLF
AQTGKIERLP LEGSTMQKTE AKTLLHAPDK VIIGLAFDCV DKMVYWTDIS EPSIGRASLH
GGEPTTIIRQ DLGSPEGIAL DHLGRNIFWT DSHLDRIEVA KLDGTQRRVL FETDLVNPRG
IVTDSVRGNL YWTDWNRDSP KIETSYMDGT NRRVLVQDDL GLPNGLTFDA YSSQLCWVDA
GTNRVECLKP GQSGRRKVLE GLQYPFAVTS YGKHLYYTDW KTNSVVAVDL AISKETDTFH
PHKQTRLYGI TTALSQCPQG HNYCSVNNGG CTHLCLATPG SRTCRCPDNT LGVDCIERK
//