ID G3VVP5_SARHA Unreviewed; 1254 AA.
AC G3VVP5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 81.
DE SubName: Full=Nidogen 2 {ECO:0000313|Ensembl:ENSSHAP00000007250.2};
GN Name=NID2 {ECO:0000313|Ensembl:ENSSHAP00000007250.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000007250.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000007250.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000007250.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VVP5; -.
DR STRING; 9305.ENSSHAP00000007250; -.
DR Ensembl; ENSSHAT00000007312.2; ENSSHAP00000007250.2; ENSSHAG00000006301.2.
DR eggNOG; KOG1214; Eukaryota.
DR GeneTree; ENSGT00940000157901; -.
DR TreeFam; TF320666; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00191; TY; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 2.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR003886; NIDO_dom.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR PANTHER; PTHR46513:SF15; NIDOGEN-2 ISOFORM X1; 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12947; EGF_3; 2.
DR Pfam; PF07645; EGF_CA; 1.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF00058; Ldl_recept_b; 2.
DR Pfam; PF06119; NIDO; 1.
DR Pfam; PF00086; Thyroglobulin_1; 1.
DR SMART; SM00181; EGF; 5.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 5.
DR SMART; SM00539; NIDO; 1.
DR SMART; SM00211; TY; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 2.
DR PROSITE; PS01186; EGF_2; 3.
DR PROSITE; PS50026; EGF_3; 4.
DR PROSITE; PS01187; EGF_CA; 2.
DR PROSITE; PS51120; LDLRB; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
DR PROSITE; PS00484; THYROGLOBULIN_1_1; 1.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00500};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 134..300
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 462..716
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 717..758
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 759..801
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 806..849
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 850..886
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 896..964
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT REPEAT 1034..1077
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1078..1120
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1121..1165
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DISULFID 934..941
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 1254 AA; 140631 MW; 39F5C5E48646B0F6 CRC64;
MFGVPDSGLV VLGCGFALLL KQSREHTMKV ERDTVQKRQV LLLLLHLHWL VPLSSALHPQ
DLFQYGEAWG DQFLQEGDDE SSSLVKLKRP LRFYEAQFNS LYVGTNGIIS TQDFPRERQY
VNDVFPTEFP AIAPFLSDLD TSNGRGKISY REDDSVEILN KAALYIRTGF PKTARNFTPT
HAFLATWEEV GAYEKVMHNV LPSRLNNTFQ VVLTFDEFDT YALFLYPTNG LQFFATRPKE
SYDTQLELPA RVGFSQGKDD YQKKEGLHFS VTSTEQSVKN LYQISNLGIP GVWAFHIGST
FRLNNVEPAN FRGNLSINHS LEHFLTNAEE YSVLESNYAE DKMNYVGSFY NEVNMMNDES
EHSPNGQELV SNSHSKTEIL LDLNDSSHSQ VDHSEPKHES DLLHHQIEKG AQGEEVFLIS
QSNTASSQQR GIKNPAPPDT KGEFLDLLKE ASPSYLKNET IQPYPDSGTM SSEMEALSDY
SEGGVLTNYP FPENKLPLNH GRYIMGMEED SNFSTNVSIH QAASKETCEQ NHSQCSQHAF
CTDYSTGFCC HCQSKYYGNG KHCLLEGATF SNHMDITFYP GEEKVHIIHT AEGLDSENYL
SVKTNIQGKV PFIPANFTAY IAPYKEIYHY SNLAVTSTTS REYYLSFGEI NQTFSYHLHQ
NITYQDCKHT PNPRIVPTTQ QLNVDRIFAL YNEEEKVLRY AMTNQIGPKE GDKDPSMTNP
CYDGSHACDI RAQCLVGTGL GYSCECTTGY QGDGRSCFDV NECTTGSHLC GPNSVCVNLP
GSYRCECQSG YEFGEDKHTC ILIASSFNPC EEGNHNCASV EKAQCIYHGG SSYSCICLSG
YVGNGHECTD VDECIEERCH PAAACYNVPG SFSCHCQHGY KGDGFQCFPE STQGPQTVCE
RWRESLLEHY GGRPREDQYV PQCDEFGHFN PLQCHGNSNY CWCVDKNGRE VEGTRSQPGI
TPACIPSIAP PTILPSPYPN VIPPSVGTFL LYAQGQQIGY LPLNGTRLQK ETAKTLLSLH
GSIAVGIDYD CQEKMVYWTD VAGRVISRAS LELGNEPEIV ISSGLMSPEG LAIDYFHRTM
FWTDSGLDKI ESAKLDGSER KVLFDTDLVN PRAIAVDPIR GNLFWTDWNR EGPKIETSSI
DGTNRRILVN KDIGLPNGLT FDPFSKLICW ADAGTKKLEC ALSNGTGRHI IQNNLNYPFS
IVSYANHFYH TDWRRDGVIA VNKENGQFTD EYLPEQRSHL YGITAIYPYC PKVE
//