ID A0A182VE75_ANOME Unreviewed; 1319 AA.
AC A0A182VE75;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=Nidogen {ECO:0008006|Google:ProtNLM};
OS Anopheles merus (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=30066 {ECO:0000313|EnsemblMetazoa:AMEM013471-PA, ECO:0000313|Proteomes:UP000075903};
RN [1] {ECO:0000313|Proteomes:UP000075903}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MAF {ECO:0000313|Proteomes:UP000075903};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Howell P., Walton C., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles merus MAF (V2).";
RL Submitted (JAN-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMEM013471-PA}
RP IDENTIFICATION.
RC STRAIN=MAF {ECO:0000313|EnsemblMetazoa:AMEM013471-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 30066.A0A182VE75; -.
DR EnsemblMetazoa; AMEM013471-RA; AMEM013471-PA; AMEM013471.
DR VEuPathDB; VectorBase:AMEM013471; -.
DR VEuPathDB; VectorBase:AMEM21_011477; -.
DR Proteomes; UP000075903; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR CDD; cd00053; EGF; 1.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00255; nidG2; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR003886; NIDO_dom.
DR PANTHER; PTHR46513:SF34; NIDOGEN (ENTACTIN); 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12947; EGF_3; 3.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF00058; Ldl_recept_b; 2.
DR Pfam; PF06119; NIDO; 1.
DR SMART; SM00181; EGF; 10.
DR SMART; SM00179; EGF_CA; 3.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 4.
DR SMART; SM00539; NIDO; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01186; EGF_2; 5.
DR PROSITE; PS50026; EGF_3; 6.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51120; LDLRB; 3.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022869}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1319
FT /note="Nidogen"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008139718"
FT DOMAIN 100..257
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 320..543
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 584..623
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 760..801
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 803..844
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 882..923
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 925..965
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 968..1008
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 1055..1097
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1098..1141
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REPEAT 1142..1187
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REGION 261..282
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..282
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 770..787
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 891..908
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1319 AA; 145589 MW; DF862465A44E81BB CRC64;
MSVKSEVWQG LVVYAVLLLL PSIVRAVDPR DLYSYANEAS EVLPRGDEEF QYMDLDMPAY
FYNEKYDRVY INTNGILSFG GELVGFLNLP FPLGNPLIAP FYANVDTTLP NDTATIVYFK
SRDPTLLHRT TELVRDNFGS LLARTRGFEA LQVFVATWEH VGHYSMKNEV QNSFQVAIIQ
GATDTFVQFL YPEEGISWIQ GDTGDSGLPD VRAQAGFAAE DDRTFMLPGS GTDNVRHLTI
SSNINQPGVW LYHVGPLAPE GNVEQPDRQH AEPSEPRSCA DGGRYKCHSA ASCVNSNWGF
CCQCKSGYYG NGFSCVKSDI PLRVAGKVLG NVNGEPLDTQ LQSYVVMVDG RTYTAISPLE
EQLGTDLQLT QILGGTIAWL FAKPLGNSMN GYQLTGGKFN HTSTLQFETG ESFYITQQYT
GLNVWDQLAL DVHLSGQLPS IPALEKLHLD DYSVAFRRSG KDRLQAVSAH SFRVESQARN
VSYTLYQDIT YEGCSGNEDT IAGKDESMLK ITRINLGYEP RERAVRMGML SKMVIGDQFN
PCDEANCGDN TVCVPKPDDT FDCNCKNGFT YIPYGTSDRI NCVDIDECSG VNICDENAQC
YNEPGGYSCR CNPGYEGNGY VCDKVVPGGY SQPTSSSYTV STPASYGGHS YNEVDSDPTQ
EEPEQCERCA EFADCVEGQC QCRSGYDGDG VSYCQSLCDP ESVWNGEECV KETYVEEEGI
EPFCTILGCT CPTGYTLIEY AFNQICRRVE LDPEEVPQEG MPPCDVENNC SPHANCEWRD
SSYRHECICN PGYDGNGYTC VEKEVSCLDD EEICDIHASC TYMLNRKSVC VCNKGYEGDG
RTCHLAPECA VDDDCGMNSE CQQGVCVCQE GYERDLSDFC VRAGSCGGAY CAENAVCVID
PVQKIPYCHC PQGFVGDGVS QCRSIPPPCN VRNNCGLHAA CVPSYRDSSS YECMCNQGFF
GDGFVCTPER NCANIPSLCD PNARCESTTN GYQCICNDGF IGNGSVCNTA HRLDDGFLLI
SQGVANIRVP LNGGLGYPVT MAFMSIGLDR DCAEGRIYWS DIAAQQIVSA KYDGTDQKPF
ITKDIVSPEG VAVDWISRRL YWTDSAKDTI EVASLDNPEQ RTVLISKFLV NPRGIAVDPH
QTKLYWSDWN RDGPKIEWSN LDGTEREQLV GSPQVALPNS IQVSMATGEL CYADAGTKKV
ECIDTYSRQI RTIASNLTYP FGLAVTDDLF YWTDWMTKKI ESINLYGVRQ KPINSPVFGN
HKMYGMTAVT DKCPLFHSPC VSNNGDCPED KICLINPRAP SGRSCKCTRN CNNDVVLDY
//