ID T1KLJ0_TETUR Unreviewed; 1313 AA.
AC T1KLJ0;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE RecName: Full=Nidogen {ECO:0008006|Google:ProtNLM};
GN Name=107365142 {ECO:0000313|EnsemblMetazoa:tetur14g02610.1};
OS Tetranychus urticae (Two-spotted spider mite).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Acari;
OC Acariformes; Trombidiformes; Prostigmata; Eleutherengona; Raphignathae;
OC Tetranychoidea; Tetranychidae; Tetranychus.
OX NCBI_TaxID=32264 {ECO:0000313|EnsemblMetazoa:tetur14g02610.1, ECO:0000313|Proteomes:UP000015104};
RN [1] {ECO:0000313|Proteomes:UP000015104}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=London {ECO:0000313|Proteomes:UP000015104};
RA Rombauts S.;
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:tetur14g02610.1}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAEY01000211; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_015788092.1; XM_015932606.1.
DR STRING; 32264.T1KLJ0; -.
DR EnsemblMetazoa; tetur14g02610.1; tetur14g02610.1; tetur14g02610.
DR GeneID; 107365142; -.
DR KEGG; tut:107365142; -.
DR eggNOG; KOG1214; Eukaryota.
DR HOGENOM; CLU_003163_1_0_1; -.
DR OMA; PGTGNQF; -.
DR OrthoDB; 25347at2759; -.
DR Proteomes; UP000015104; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.10.25.10; Laminin; 10.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR024730; MSP1_EGF_1.
DR InterPro; IPR003886; NIDO_dom.
DR PANTHER; PTHR46513:SF34; NIDOGEN (ENTACTIN); 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12947; EGF_3; 3.
DR Pfam; PF12946; EGF_MSP1_1; 1.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF00058; Ldl_recept_b; 3.
DR Pfam; PF06119; NIDO; 1.
DR SMART; SM00181; EGF; 12.
DR SMART; SM00179; EGF_CA; 4.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 5.
DR SMART; SM00539; NIDO; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01186; EGF_2; 10.
DR PROSITE; PS50026; EGF_3; 9.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51120; LDLRB; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Reference proteome {ECO:0000313|Proteomes:UP000015104};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022869}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1313
FT /note="Nidogen"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004580864"
FT DOMAIN 98..255
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 328..552
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 546..584
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 595..635
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 637..676
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 684..725
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 727..767
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 774..814
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 860..901
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 903..944
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 945..987
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 1037..1078
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1079..1121
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1122..1166
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT DISULFID 694..711
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 870..887
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 913..930
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 956..973
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1313 AA; 143302 MW; 47BA82DA0F053DE6 CRC64;
MANLQAKIVF VIAICLTINS ANGITRQDLF PFAGPDDEVL TGKDEGSSQQ IKLSSNVILF
ERSYDFLFVS INGVISFLSD AANYLNYALP LSIPMIAPFY ADIDTRITGK VYYRQTTNPD
QLNLANGYIS SVFTSGLNFQ ANSLFIATWV DVGPYRNNLS FIDPNNEKKN TFQVVIATDG
KESYAFFLYP QKGITWAKSD TKEAREPALA QVGFIDLNGY QFYPLDVSGT SRTLNLDRMS
NIKEAGIFIF RIGKLGVSGG VIEPDRPSSS RESDVVPSDV MDRCETTFDP CPPEAHCEQY
STGFCCKCKE GFIGNGRQCI KSDTLDTSPI RLVGQVHGTI NDQKIGDADI YVYADVNSGQ
IYTTISRLQN IQPSDFQSLI PLTDLIGWLF AKGQPGTVNG FSLTGGVFNR TARITFPQTG
EQVTVIERYL GLDVFGNLKM DIVITGNLPA IPMGSRVEFD DAIEVYASKE PGMFTSSSGR
SYRLTGASMD IPISIEQEIQ FTSCAYAPKD QQVKTIKISA NRYNIDVRTI QDPIRFTAEY
SVVPPDDNPC KNGNAVCGAN SQCVVDGENY RCVCNPGFRS DDQSDPNSVR PLCLDIDECL
TNQSRCHKDA RCQNQPGNYR CICNDGFVGD GYRCRPQDSM CGDKFCDPNA DCLPSTDGSK
KCICLPGFTG NGIICRSVLG QPGSPNACPA DLRCDRNAQC AINPVTREYG CVCNPGFTGN
GLVCELVGKV CSSDANCGEN GQCKTSNSGV GYCVCKQGYY GDGYTCIRTT EPKQIENCFT
SKICDINAVC NDLPNGETKC ICNEGYTGDG RTCAIISDCK EDSDCPASSK CDPLPEIVDG
GRCTCIQGFA MIENTCVSND KAPCNIVKNC HADAACKFNL EEKQFKCECN KGFKGDGKNC
ESTQIPCNVL NTCGSHAACN YNPIELGYRC MCLSGYIGDG YTCVPSSSCR DYPYQCHRDA
ECVFSPTTRE YKCRCNNGFS GDGLDCSPNN RYDGPYLVAV QGMSFARLPM APGRQGEFLF
VLNNNLPVGI AFDCAKGSLY YTDIIDKAIS KAGLNGSHDI TILTGLISPE GLAIDWISRN
IYWTDSLKRA IQVSRLNGSS VRTLISDDLV NPRGIALHPG LGKMYFTDWN RAAPKIEMAN
MDGTGRVVLV SEAVKLPNML AIDYMTNELC WTDAGLKRIE CIDLYGSKRR LVHYTSGYPF
DLAIVEDYIF WTDWETNNVH RVGRNGGKEE TLMIPKGSNG RLHGIVAMAE ACPAMSNPCR
IRNGGCKHLC LPSGPRSRTC VCPDDDGSGE ECTLMLPPLP RPSPSGVGII GIG
//