ID B4GSI5_DROPE Unreviewed; 1864 AA.
AC B4GSI5;
DT 23-SEP-2008, integrated into UniProtKB/TrEMBL.
DT 23-SEP-2008, sequence version 1.
DT 27-MAR-2024, entry version 76.
DE SubName: Full=GL26543 {ECO:0000313|EMBL:EDW25344.1};
GN Name=Dper\GL26543 {ECO:0000313|EMBL:EDW25344.1};
GN ORFNames=Dper_GL26543 {ECO:0000313|EMBL:EDW25344.1};
OS Drosophila persimilis (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7234 {ECO:0000313|Proteomes:UP000008744};
RN [1] {ECO:0000313|EMBL:EDW25344.1, ECO:0000313|Proteomes:UP000008744}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MSH-3 / Tucson 14011-0111.49
RC {ECO:0000313|Proteomes:UP000008744};
RX PubMed=17994087; DOI=10.1038/nature06341;
RG Drosophila 12 genomes consortium;
RA Clark A.G., Eisen M.B., Smith D.R., Bergman C.M., Oliver B., Markow T.A.,
RA Kaufman T.C., Kellis M., Gelbart W., Iyer V.N., Pollard D.A., Sackton T.B.,
RA Larracuente A.M., Singh N.D., Abad J.P., Abt D.N., Adryan B., Aguade M.,
RA Akashi H., Anderson W.W., Aquadro C.F., Ardell D.H., Arguello R.,
RA Artieri C.G., Barbash D.A., Barker D., Barsanti P., Batterham P.,
RA Batzoglou S., Begun D., Bhutkar A., Blanco E., Bosak S.A., Bradley R.K.,
RA Brand A.D., Brent M.R., Brooks A.N., Brown R.H., Butlin R.K., Caggese C.,
RA Calvi B.R., Bernardo de Carvalho A., Caspi A., Castrezana S.,
RA Celniker S.E., Chang J.L., Chapple C., Chatterji S., Chinwalla A.,
RA Civetta A., Clifton S.W., Comeron J.M., Costello J.C., Coyne J.A., Daub J.,
RA David R.G., Delcher A.L., Delehaunty K., Do C.B., Ebling H., Edwards K.,
RA Eickbush T., Evans J.D., Filipski A., Findeiss S., Freyhult E., Fulton L.,
RA Fulton R., Garcia A.C., Gardiner A., Garfield D.A., Garvin B.E., Gibson G.,
RA Gilbert D., Gnerre S., Godfrey J., Good R., Gotea V., Gravely B.,
RA Greenberg A.J., Griffiths-Jones S., Gross S., Guigo R., Gustafson E.A.,
RA Haerty W., Hahn M.W., Halligan D.L., Halpern A.L., Halter G.M., Han M.V.,
RA Heger A., Hillier L., Hinrichs A.S., Holmes I., Hoskins R.A., Hubisz M.J.,
RA Hultmark D., Huntley M.A., Jaffe D.B., Jagadeeshan S., Jeck W.R.,
RA Johnson J., Jones C.D., Jordan W.C., Karpen G.H., Kataoka E.,
RA Keightley P.D., Kheradpour P., Kirkness E.F., Koerich L.B., Kristiansen K.,
RA Kudrna D., Kulathinal R.J., Kumar S., Kwok R., Lander E., Langley C.H.,
RA Lapoint R., Lazzaro B.P., Lee S.J., Levesque L., Li R., Lin C.F., Lin M.F.,
RA Lindblad-Toh K., Llopart A., Long M., Low L., Lozovsky E., Lu J., Luo M.,
RA Machado C.A., Makalowski W., Marzo M., Matsuda M., Matzkin L.,
RA McAllister B., McBride C.S., McKernan B., McKernan K., Mendez-Lago M.,
RA Minx P., Mollenhauer M.U., Montooth K., Mount S.M., Mu X., Myers E.,
RA Negre B., Newfeld S., Nielsen R., Noor M.A., O'Grady P., Pachter L.,
RA Papaceit M., Parisi M.J., Parisi M., Parts L., Pedersen J.S., Pesole G.,
RA Phillippy A.M., Ponting C.P., Pop M., Porcelli D., Powell J.R.,
RA Prohaska S., Pruitt K., Puig M., Quesneville H., Ram K.R., Rand D.,
RA Rasmussen M.D., Reed L.K., Reenan R., Reily A., Remington K.A.,
RA Rieger T.T., Ritchie M.G., Robin C., Rogers Y.H., Rohde C., Rozas J.,
RA Rubenfield M.J., Ruiz A., Russo S., Salzberg S.L., Sanchez-Gracia A.,
RA Saranga D.J., Sato H., Schaeffer S.W., Schatz M.C., Schlenke T.,
RA Schwartz R., Segarra C., Singh R.S., Sirot L., Sirota M., Sisneros N.B.,
RA Smith C.D., Smith T.F., Spieth J., Stage D.E., Stark A., Stephan W.,
RA Strausberg R.L., Strempel S., Sturgill D., Sutton G., Sutton G.G., Tao W.,
RA Teichmann S., Tobari Y.N., Tomimura Y., Tsolas J.M., Valente V.L.,
RA Venter E., Venter J.C., Vicario S., Vieira F.G., Vilella A.J.,
RA Villasante A., Walenz B., Wang J., Wasserman M., Watts T., Wilson D.,
RA Wilson R.K., Wing R.A., Wolfner M.F., Wong A., Wong G.K., Wu C.I., Wu G.,
RA Yamamoto D., Yang H.P., Yang S.P., Yorke J.A., Yoshida K., Zdobnov E.,
RA Zhang P., Zhang Y., Zimin A.V., Baldwin J., Abdouelleil A., Abdulkadir J.,
RA Abebe A., Abera B., Abreu J., Acer S.C., Aftuck L., Alexander A., An P.,
RA Anderson E., Anderson S., Arachi H., Azer M., Bachantsang P., Barry A.,
RA Bayul T., Berlin A., Bessette D., Bloom T., Blye J., Boguslavskiy L.,
RA Bonnet C., Boukhgalter B., Bourzgui I., Brown A., Cahill P., Channer S.,
RA Cheshatsang Y., Chuda L., Citroen M., Collymore A., Cooke P., Costello M.,
RA D'Aco K., Daza R., De Haan G., DeGray S., DeMaso C., Dhargay N., Dooley K.,
RA Dooley E., Doricent M., Dorje P., Dorjee K., Dupes A., Elong R., Falk J.,
RA Farina A., Faro S., Ferguson D., Fisher S., Foley C.D., Franke A.,
RA Friedrich D., Gadbois L., Gearin G., Gearin C.R., Giannoukos G., Goode T.,
RA Graham J., Grandbois E., Grewal S., Gyaltsen K., Hafez N., Hagos B.,
RA Hall J., Henson C., Hollinger A., Honan T., Huard M.D., Hughes L.,
RA Hurhula B., Husby M.E., Kamat A., Kanga B., Kashin S., Khazanovich D.,
RA Kisner P., Lance K., Lara M., Lee W., Lennon N., Letendre F., LeVine R.,
RA Lipovsky A., Liu X., Liu J., Liu S., Lokyitsang T., Lokyitsang Y.,
RA Lubonja R., Lui A., MacDonald P., Magnisalis V., Maru K., Matthews C.,
RA McCusker W., McDonough S., Mehta T., Meldrim J., Meneus L., Mihai O.,
RA Mihalev A., Mihova T., Mittelman R., Mlenga V., Montmayeur A., Mulrain L.,
RA Navidi A., Naylor J., Negash T., Nguyen T., Nguyen N., Nicol R., Norbu C.,
RA Norbu N., Novod N., O'Neill B., Osman S., Markiewicz E., Oyono O.L.,
RA Patti C., Phunkhang P., Pierre F., Priest M., Raghuraman S., Rege F.,
RA Reyes R., Rise C., Rogov P., Ross K., Ryan E., Settipalli S., Shea T.,
RA Sherpa N., Shi L., Shih D., Sparrow T., Spaulding J., Stalker J.,
RA Stange-Thomann N., Stavropoulos S., Stone C., Strader C., Tesfaye S.,
RA Thomson T., Thoulutsang Y., Thoulutsang D., Topham K., Topping I.,
RA Tsamla T., Vassiliev H., Vo A., Wangchuk T., Wangdi T., Weiand M.,
RA Wilkinson J., Wilson A., Yadav S., Young G., Yu Q., Zembek L., Zhong D.,
RA Zimmer A., Zwirko Z., Jaffe D.B., Alvarez P., Brockman W., Butler J.,
RA Chin C., Gnerre S., Grabherr M., Kleber M., Mauceli E., MacCallum I.;
RT "Evolution of genes and genomes on the Drosophila phylogeny.";
RL Nature 450:203-218(2007).
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CH479189; EDW25344.1; -; Genomic_DNA.
DR RefSeq; XP_002021501.1; XM_002021465.1.
DR STRING; 7234.B4GSI5; -.
DR EnsemblMetazoa; FBtr0192158; FBpp0190650; FBgn0164125.
DR eggNOG; KOG3544; Eukaryota.
DR HOGENOM; CLU_002023_1_0_1; -.
DR PhylomeDB; B4GSI5; -.
DR Proteomes; UP000008744; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:EnsemblMetazoa.
DR GO; GO:0071711; P:basement membrane organization; IEA:EnsemblMetazoa.
DR GO; GO:0055013; P:cardiac muscle cell development; IEA:EnsemblMetazoa.
DR GO; GO:0060729; P:intestinal epithelial structure maintenance; IEA:EnsemblMetazoa.
DR GO; GO:0035848; P:oviduct morphogenesis; IEA:EnsemblMetazoa.
DR GO; GO:0048621; P:post-embryonic digestive tract morphogenesis; IEA:EnsemblMetazoa.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF880; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01413; C4; 1.
DR Pfam; PF01391; Collagen; 16.
DR SMART; SM00111; C4; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000008744};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1864
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5002804299"
FT DOMAIN 1537..1864
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 90..1516
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1231..1250
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1864 AA; 184084 MW; 4EC88F0107CDC06E CRC64;
MLPFWKRLLY AAVIAGALVG ANAQFWKTAG TSGSIQDSVR HQSREDPRDS FPIDDSYDIV
DSASIQRGGQ PPKNCTAGYA GCVPKCIAEK GNRGLPGPLG PTGPKGVSGF PGQDGPLGDK
GQKGDPGPYG LRGDKGERGS QGIPGLPGVG GVQGSSGNPG LPGINGKDGC DGSDGLPGLQ
GLSGMAGPRG FPGSPGGKGE KGEPAKENGN YAKGEKGEPG FTGRSGYAGP QGNPGDKGER
GDTGPYGSKG LRGDRGIKGE KGDSCFGPLM PGKQGEKGEK GDPAVPIPVT GSERVSGPRG
DPGQKGEVGP YGEKGELGPG GEPGQDGQKG EKGLPGGSGD RGRQGNFGPP GPSGQKGDRG
ETGLNGLPGS FGQKGEPGRP GQSGQRGLVG PPGPPGGGRG QPGAPGPKGP RGYTGTPGPQ
GLNGADGQPG PQGYVGQKGG SGMPGRPGGE GPPGQKGEKG IGGRVGPQGP VGPIGHTGPP
GPEGQKGDAG TPGYGIEGPK GDDGNPGYPG QKGGKGERGF KGNSGVPGDS KLGRPGTPGT
AGAPGQKGDA GRPGTPGAKG DMGIKGDIGG KCSSCPPGLK GDKGTRGLDG IPGNPGVRGP
PGEPGHQGER GHDGINGQTG APGEKGEDGL TGAPGATGAP GKGAIINYSQ IKTEKGEKGQ
PGLPGAAGVK GDRGESGVPG LAGVKGEMGF KGDKGYTGQA GTDGIPGAPG RDGYDGAPGL
SIKGESGSRG MDGLKGDKGT AGYVGPKGEP GTCEASELKI PAKGNKGDRG ATGMPGVDGP
IGQKGEIGYR GEKGSTGPQG PVGPVGPRGM TGPRGEKGNI GPIGAAGNPG KDGTRGLPGR
NGDRGQKGEE GIPMPGPPGP QGREGLQGLK GTRGLPGPQG LPGQEGNAGY PGDKGDAGVP
GIPGRVGAVG EKGDLGPLGP IGQPGPPGIP GIDGVQGRDG SKGEPGNPGL VGMPGNKGDR
GAPGNDGPKG FNGITGSPGK RGPAGIPGIS GMKGDKGARG LTGNDGPMGG RGPPGPPGLL
GVKGEPGLLG APGQRGLDGL NGEKGNQGLP GYDGPQGLPG DASEKGQKGE PGLSGLRGDP
GVSGEPGLPG EKGLPGMAVY GRPGPQGEKG DTGRSGADGI NGLNGEKGDI GLPGFSGRNG
EKGSVGLPGL PGAPGIDGQP GLQGAPGPVG YPGAQGYKGQ AGLPGLSGIK GDTGPIGPQG
YIGNAGIKGE RGPRGQPGLP ATVNVDMLRG EKGSVGERGL TGEKGDQGER GEQGLAGYPG
VKGDQGLEGP RGLNGLNGAP GLKGDAGLVG EPGYPGPVVK GEKGLPGRAG STDVKVSLPG
APGIKGERGL PGLAGTPGLV GLPGPIGPAG DKGERGLAGT PGQPGLDGLS GAPGLKGDQG
YPGPKGDRGL AGFEGQKGDK GDQGHAGLPG LAGLAGAKGD TGYPGLDGPE GPQGSPGEKG
FTGPKGRDGR DGIPGLQGQK GRNQASPGAR GLTGIQGQRG EKGDRGLIGP SGAIGRPGPK
GDRGEQGPVG REGAVGAMGL KGDAGLACSG AQDYLTGILV TRHSQNDQVP QCAAGHSQLW
TGYSLLYVDG NDYAHNQDLG SPGSCVQRFS TLPVLSCGQN NICNYASRND KTFWLSTSAP
IPMMPVENYE ISKYISRCVV CEAPANVIAM HSQSLEIPEC PSGWEGLWXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXPV
//