ID T1PFE5_MUSDO Unreviewed; 1832 AA.
AC T1PFE5;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2013, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Collagen {ECO:0000313|EMBL:AFP62126.1};
OS Musca domestica (House fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Muscoidea;
OC Muscidae; Musca.
OX NCBI_TaxID=7370 {ECO:0000313|EMBL:AFP62126.1};
RN [1] {ECO:0000313|EMBL:AFP62126.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=ALHF {ECO:0000313|EMBL:AFP62126.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:AFP62126.1};
RA Liu N., Zhang L., Li M., Reid W.;
RT "Transcriptome of adult Musca domestica launches a platform for comparative
RT house fly gene expression and characterization of differential gene
RT expression among resistant and susceptible house flies.";
RL Submitted (AUG-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KA647497; AFP62126.1; -; mRNA.
DR VEuPathDB; VectorBase:MDOA010996; -.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF533; COLLAGEN ALPHA-6(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 16.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 2: Evidence at transcript level;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:AFP62126.1};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..50
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 51..1832
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004593285"
FT DOMAIN 1542..1764
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 79..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 259..364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 392..503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 524..550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 579..781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 922..950
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 979..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1213..1278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1379..1417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1519..1540
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1784..1832
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 288..309
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 671..685
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1791..1818
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1832 AA; 186772 MW; 2B5C6428D351A551 CRC64;
MLPLPRGLLG LLGANATPKW HRQPQQQQLT MKSVFLVLCL VLLSGQFADA KEQQQQPDRN
CGGLACDCKG LKGRPGDIGL PGFQGYEGPA GDMGPPGPPG RPGEWGDAGE YGEQGEKGHR
GDAGEPGLPG APGVRGPPGE DGPHGPRGID GCAGKPGKHG DNGAPGRHGP RGDVGKPGPP
GPQGDAGEGG INSKGTKGSR GDRGPDGYDG QTGFPGMKGY KGDIGFPGAD GPKGEMGPKG
FKGEMAEDAN IILQLQGEQG EKGEPGEAEE FPFEPNGNIP KGYAGDVGER GDQGRKGEQG
EKGDMGRDGF PGARGDSGEP GERGKPGKPG ETGFPGAKGV KGAPGYNGRD GEDGLKGEMG
DDGYDGIPGV QGYAGPPGIY DPNLDESLPG PIGPQGDIGP PGDPGLPGIP GKQGRLGPRG
NTGPPGDPGL PGMPGRRGIS IKGDEGDYGF MGPAGPMGNP GRPGPVGRPG ARGADGRNVV
GPKGYAGQPG MPGLPGHRGD RGEIGFSGEK GLPGLGVNIV GPPGAHGPPG VRGPPGIDGQ
PGYRGLTGDK GVRGDDCGIC PAGPKGMRGI RGDDGFPGVH GVTGPHGLPG ERGPKGQQGK
PGFMGFKGQP GPDGIPGESG RPGMPGPPGK VMRVGSLTKA EKGDMGDMGE RGVQGLTGDR
GLNGAHGLHG QKGERGIRGD FGEPGRPGRD GAPGKPGKDG RPGRDANTPK LYLIGEKGYD
GRKGVAGEPG DMGPKGEKGQ PHPGEIFDNR GEPGDVGEPG PVGPQGPKGE KGTNGDNGER
GDIGLPGIVI QGPMGAKGYP GVTGEVGLHG AHGMEGLDGA PGIDGVAGVK GVRGDPGPYI
LPGEMGPDGP EGPKGMYGDM GFRGRPGVTG RPGVKGVRGE KGDIGPYGLQ GLPGNKGVIG
DTLVGFQGAA GEPGINGRIA PHGRKGQKGE TGVPGVQGVQ GAKGDIGFPG RRGPHGDRGF
QGIPGVIGMQ GLVGIPGEQG ERGELGEDGR HGDMGQRGSI GSMGPKGQMG DVGPYGRRGN
DGIPGRKGVE GDRGYPGRVG AKGFASRSGI KGEYGEPGQR GPRGYDGMPG EKGVQGAPGD
EAYGQDGPMG RKGETGAPGV DGINGLDGLK GMRGDYGIMG LIGAIGDRGD KGQPGYPGRP
GLPGIDGAVG PMGEMGYQGQ VGERGDEGYA GYVGQIGDRG DAGEPGAFGP KGEQGDEGFP
GRPGVLLAGY AQRGDKGQPG LRGQQGPMGE TGMEGAPGYP GRKGERGDFG FAGAPGADGY
PGVDGERGDK GYPGPPGMTP DYAEPGDEGD VGYDGLPGRP GRVGPKGAPG DMGDYGFNGI
KGEMGMSIMG PKGMQGDIGY PGPPGHNGLH GMVGFKGERG DVGPQGMRGE PGYVIHGMRG
DRGDAGPPGA RGPQGLKGEM GMHGRPGRTG PMGARGPRGP TGDAGFDGRN GLDGLPGPRG
EPGVTFPFHM ARKGERGEPG IDGFKGEMGD VGAEGEVGFQ GAYGLKGYQG ERGLTGQMGL
DGPKGERGMQ GPPGLAGFTG LAGAKGPEGD PAPPPPRPKS RGFIFARHSQ SVLIPECPAN
TNLMWVGYSL AGNIANSRAV AQDLGRSGSC LQRFSTMPYM TCDGSVCNYG QTNDDSMWLA
TDEPMNFAMV PIQANVIQKY ISRCAVCETT TKVIALHSQS MSIPDCPNGW EEMWTGYSYF
MTTTDNTGGM GQNLVSPGSC LEEFRAQPII ECHGQGNCNL YNPVTSFWLA VIEEHEQWQM
PVQRTLKKDQ TSKISRCSVC RRRNDSFVTR LERVDTSARE LRRGYEQVVP APQQHQPTYQ
RRPNTNYHRR TGQNWPGRNY RSRYPRADTT AP
//