GenomeNet

Database: UniProt
Entry: T1PFE5_MUSDO
LinkDB: T1PFE5_MUSDO
Original site: T1PFE5_MUSDO 
ID   T1PFE5_MUSDO            Unreviewed;      1832 AA.
AC   T1PFE5;
DT   13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT   13-NOV-2013, sequence version 1.
DT   27-MAR-2024, entry version 36.
DE   SubName: Full=Collagen {ECO:0000313|EMBL:AFP62126.1};
OS   Musca domestica (House fly).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Muscoidea;
OC   Muscidae; Musca.
OX   NCBI_TaxID=7370 {ECO:0000313|EMBL:AFP62126.1};
RN   [1] {ECO:0000313|EMBL:AFP62126.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=ALHF {ECO:0000313|EMBL:AFP62126.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:AFP62126.1};
RA   Liu N., Zhang L., Li M., Reid W.;
RT   "Transcriptome of adult Musca domestica launches a platform for comparative
RT   house fly gene expression and characterization of differential gene
RT   expression among resistant and susceptible house flies.";
RL   Submitted (AUG-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KA647497; AFP62126.1; -; mRNA.
DR   VEuPathDB; VectorBase:MDOA010996; -.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF533; COLLAGEN ALPHA-6(IV) CHAIN; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 16.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   2: Evidence at transcript level;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:AFP62126.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..50
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           51..1832
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004593285"
FT   DOMAIN          1542..1764
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          79..237
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          259..364
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          392..503
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          524..550
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          579..781
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          922..950
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          979..1099
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1213..1278
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1379..1417
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1519..1540
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1784..1832
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        288..309
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        671..685
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1791..1818
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1832 AA;  186772 MW;  2B5C6428D351A551 CRC64;
     MLPLPRGLLG LLGANATPKW HRQPQQQQLT MKSVFLVLCL VLLSGQFADA KEQQQQPDRN
     CGGLACDCKG LKGRPGDIGL PGFQGYEGPA GDMGPPGPPG RPGEWGDAGE YGEQGEKGHR
     GDAGEPGLPG APGVRGPPGE DGPHGPRGID GCAGKPGKHG DNGAPGRHGP RGDVGKPGPP
     GPQGDAGEGG INSKGTKGSR GDRGPDGYDG QTGFPGMKGY KGDIGFPGAD GPKGEMGPKG
     FKGEMAEDAN IILQLQGEQG EKGEPGEAEE FPFEPNGNIP KGYAGDVGER GDQGRKGEQG
     EKGDMGRDGF PGARGDSGEP GERGKPGKPG ETGFPGAKGV KGAPGYNGRD GEDGLKGEMG
     DDGYDGIPGV QGYAGPPGIY DPNLDESLPG PIGPQGDIGP PGDPGLPGIP GKQGRLGPRG
     NTGPPGDPGL PGMPGRRGIS IKGDEGDYGF MGPAGPMGNP GRPGPVGRPG ARGADGRNVV
     GPKGYAGQPG MPGLPGHRGD RGEIGFSGEK GLPGLGVNIV GPPGAHGPPG VRGPPGIDGQ
     PGYRGLTGDK GVRGDDCGIC PAGPKGMRGI RGDDGFPGVH GVTGPHGLPG ERGPKGQQGK
     PGFMGFKGQP GPDGIPGESG RPGMPGPPGK VMRVGSLTKA EKGDMGDMGE RGVQGLTGDR
     GLNGAHGLHG QKGERGIRGD FGEPGRPGRD GAPGKPGKDG RPGRDANTPK LYLIGEKGYD
     GRKGVAGEPG DMGPKGEKGQ PHPGEIFDNR GEPGDVGEPG PVGPQGPKGE KGTNGDNGER
     GDIGLPGIVI QGPMGAKGYP GVTGEVGLHG AHGMEGLDGA PGIDGVAGVK GVRGDPGPYI
     LPGEMGPDGP EGPKGMYGDM GFRGRPGVTG RPGVKGVRGE KGDIGPYGLQ GLPGNKGVIG
     DTLVGFQGAA GEPGINGRIA PHGRKGQKGE TGVPGVQGVQ GAKGDIGFPG RRGPHGDRGF
     QGIPGVIGMQ GLVGIPGEQG ERGELGEDGR HGDMGQRGSI GSMGPKGQMG DVGPYGRRGN
     DGIPGRKGVE GDRGYPGRVG AKGFASRSGI KGEYGEPGQR GPRGYDGMPG EKGVQGAPGD
     EAYGQDGPMG RKGETGAPGV DGINGLDGLK GMRGDYGIMG LIGAIGDRGD KGQPGYPGRP
     GLPGIDGAVG PMGEMGYQGQ VGERGDEGYA GYVGQIGDRG DAGEPGAFGP KGEQGDEGFP
     GRPGVLLAGY AQRGDKGQPG LRGQQGPMGE TGMEGAPGYP GRKGERGDFG FAGAPGADGY
     PGVDGERGDK GYPGPPGMTP DYAEPGDEGD VGYDGLPGRP GRVGPKGAPG DMGDYGFNGI
     KGEMGMSIMG PKGMQGDIGY PGPPGHNGLH GMVGFKGERG DVGPQGMRGE PGYVIHGMRG
     DRGDAGPPGA RGPQGLKGEM GMHGRPGRTG PMGARGPRGP TGDAGFDGRN GLDGLPGPRG
     EPGVTFPFHM ARKGERGEPG IDGFKGEMGD VGAEGEVGFQ GAYGLKGYQG ERGLTGQMGL
     DGPKGERGMQ GPPGLAGFTG LAGAKGPEGD PAPPPPRPKS RGFIFARHSQ SVLIPECPAN
     TNLMWVGYSL AGNIANSRAV AQDLGRSGSC LQRFSTMPYM TCDGSVCNYG QTNDDSMWLA
     TDEPMNFAMV PIQANVIQKY ISRCAVCETT TKVIALHSQS MSIPDCPNGW EEMWTGYSYF
     MTTTDNTGGM GQNLVSPGSC LEEFRAQPII ECHGQGNCNL YNPVTSFWLA VIEEHEQWQM
     PVQRTLKKDQ TSKISRCSVC RRRNDSFVTR LERVDTSARE LRRGYEQVVP APQQHQPTYQ
     RRPNTNYHRR TGQNWPGRNY RSRYPRADTT AP
//
DBGET integrated database retrieval system