ID B4HMM9_DROSE Unreviewed; 1338 AA.
AC B4HMM9;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 27-MAR-2024, entry version 99.
DE SubName: Full=GM21210 {ECO:0000313|EMBL:EDW47241.1};
GN Name=Dsec\GM21210 {ECO:0000313|EMBL:EDW47241.1};
GN ORFNames=Dsec_GM21210 {ECO:0000313|EMBL:EDW47241.1};
OS Drosophila sechellia (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7238 {ECO:0000313|Proteomes:UP000001292};
RN [1] {ECO:0000313|EMBL:EDW47241.1, ECO:0000313|Proteomes:UP000001292}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Rob3c / Tucson 14021-0248.25
RC {ECO:0000313|Proteomes:UP000001292};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH480816; EDW47241.1; -; Genomic_DNA.
DR RefSeq; XP_002033228.1; XM_002033192.1.
DR STRING; 7238.B4HMM9; -.
DR EnsemblMetazoa; FBtr0204195; FBpp0202687; FBgn0176091.
DR HOGENOM; CLU_003163_0_0_1; -.
DR OMA; PGTGNQF; -.
DR PhylomeDB; B4HMM9; -.
DR Proteomes; UP000001292; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:EnsemblMetazoa.
DR GO; GO:2001197; P:basement membrane assembly involved in embryonic body morphogenesis; IEA:EnsemblMetazoa.
DR GO; GO:0007160; P:cell-matrix adhesion; IEA:InterPro.
DR GO; GO:0110011; P:regulation of basement membrane organization; IEA:EnsemblMetazoa.
DR CDD; cd00054; EGF_CA; 1.
DR CDD; cd00255; nidG2; 1.
DR Gene3D; 2.40.155.10; Green fluorescent protein; 1.
DR Gene3D; 2.10.25.10; Laminin; 6.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR006605; G2_nidogen/fibulin_G2F.
DR InterPro; IPR009017; GFP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR000033; LDLR_classB_rpt.
DR InterPro; IPR003886; NIDO_dom.
DR PANTHER; PTHR46513:SF34; NIDOGEN (ENTACTIN); 1.
DR PANTHER; PTHR46513; VITELLOGENIN RECEPTOR-LIKE PROTEIN-RELATED-RELATED; 1.
DR Pfam; PF12947; EGF_3; 3.
DR Pfam; PF07645; EGF_CA; 1.
DR Pfam; PF07474; G2F; 1.
DR Pfam; PF06119; NIDO; 1.
DR SMART; SM00181; EGF; 11.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00682; G2F; 1.
DR SMART; SM00135; LY; 3.
DR SMART; SM00539; NIDO; 1.
DR SUPFAM; SSF54511; GFP-like; 1.
DR SUPFAM; SSF57184; Growth factor receptor domain; 2.
DR SUPFAM; SSF63825; YWTD domain; 1.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS01186; EGF_2; 7.
DR PROSITE; PS50026; EGF_3; 5.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51120; LDLRB; 1.
DR PROSITE; PS51220; NIDO; 1.
DR PROSITE; PS50993; NIDOGEN_G2; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022869};
KW Reference proteome {ECO:0000313|Proteomes:UP000001292};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022869}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1338
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002808774"
FT DOMAIN 107..260
FT /note="NIDO"
FT /evidence="ECO:0000259|PROSITE:PS51220"
FT DOMAIN 325..550
FT /note="Nidogen G2 beta-barrel"
FT /evidence="ECO:0000259|PROSITE:PS50993"
FT DOMAIN 591..631
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 794..835
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 838..880
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 961..1002
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1003..1043
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 1159..1204
FT /note="LDL-receptor class B"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00461"
FT REGION 645..697
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 666..684
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 804..821
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 849..866
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 971..988
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1338 AA; 148165 MW; 1EA528C355FC8ED0 CRC64;
MPTFGSKLLA CLLLSSVILV SGQFEHYLDS LRASELYEFE EGSLGSIQLL PKGDSETIVL
QLEQPIHFYG EQYEQLYINT NGILTFNSEF PEYLNQPFPL EYASIAAFYS NVDTSFSDEG
TWISLFESKE QSILDRASSL VRYAFSSQSE FEARQVIVAT WRNVGYFDSK TDRLNTFQVA
LIANEQSTFV QFIYPDGGLN WLQGETAGLG LPDIRAQAGF VAEDGRFYTL NGSGSENARF
LSESTNLGVP GVWLFEVAPI ESEQNVRTPD NAESLTESPA LALSCQAHAH QCHEKAECHD
KAEGYCCVCG SGFYGNGKSC LANDQPIRVT GTLAGELNKQ PVSEDAQLQS YVVTSEGRTY
TTINPLTPEL GAQLRLVLPL LTTVPWLFAK SVGGVVNGYQ LTGGVYTHVS RLQFDSGENL
HVNQTFEGLN YWDQLSVKIE IYGEVPAVAA DAVLVLPDYV EEYTFERPGE LKSVQVLNIN
ITEEQRVLGL QVEQRILYRS CLRDDEADPS ATKVLQKISN MALDYVERDQ ALRIGAMSKV
GVTPESNACN DGTAECVENS VCVPYKDTYR CDCYHGFAAQ LDERGVEVCL DIDECATGSH
VCDENAVCDN TEGGFNCYCT EGFEGNGYRC LSNSTADNIE YPPAVEGQAE PTSEPNPNPI
PYPDQGQDQE REREDERERD REQYPQPNPY PYPEEQIPQH PDECYRCSKD ADCYQGRCTC
HEGFDGDGYT CTNICGHGEV WENGRCEPLL LERHDVDLLC DALGECRCPY GYELSEDSLR
CTYVQEFDGE RNADLIPCDV DENCHINATC NWYGQEFRHI CTCQPGFRGD GYNCDPISDD
SCAIRPNICD VHADCVYEEQ LGKSECHCQA GYTGNGFNCQ LAAECQSAEH CGENAFCDDG
VCRCQADFER DVSDRCVPAG RCGSVFCGSN AICKWDSVEG VQYCDCLKGY QGDALTGCTS
KPLSCHVLNN CGIHATCEPT EDPANYECQC IAGFKGDGYV CIEEQNCLNN PTLCDMNAQC
RSTNSGLVCV CNQGFFGNGS LCQERQHQNS EFLIVSHGVM IARVPLNGRN VRPISMAQMA
IGLDIDCLEG RVYCGDISTK KIVSAKYDST DLRLFITTIM NLPRDTIEVA SLDDPSLRTV
IINKQLVNPR GIAVDPYREK LFWSDWDRES PKIEMSNLDG TGRELLLGKD AVTLPNSLVV
LENSGEVCYA DAGTKKVECI EPQNRQIRTI SNQLTYPFGI TFTHDQFYWT DWTTKKVEIV
DSLGARQTPI QPPFFGSHKM YGMTVVEQHC PQYQSPCQIS NGGCTDSRLC LVNRQAPSGK
SCKCTSASTG CTVPAPGY
//