GenomeNet

Database: UniProt
Entry: M4AWK7_XIPMA
LinkDB: M4AWK7_XIPMA
Original site: M4AWK7_XIPMA 
ID   M4AWK7_XIPMA            Unreviewed;      1223 AA.
AC   M4AWK7;
DT   01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 63.
DE   SubName: Full=Collagen alpha-1(XXVIII) chain-like {ECO:0000313|Ensembl:ENSXMAP00000018852.2};
OS   Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC   Xiphophorus.
OX   NCBI_TaxID=8083 {ECO:0000313|Ensembl:ENSXMAP00000018852.2, ECO:0000313|Proteomes:UP000002852};
RN   [1] {ECO:0000313|Proteomes:UP000002852}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RA   Walter R., Schartl M., Warren W.;
RL   Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000002852}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RX   PubMed=23542700; DOI=10.1038/ng.2604;
RA   Schartl M., Walter R.B., Shen Y., Garcia T., Catchen J., Amores A.,
RA   Braasch I., Chalopin D., Volff J.N., Lesch K.P., Bisazza A., Minx P.,
RA   Hillier L., Wilson R.K., Fuerstenberg S., Boore J., Searle S.,
RA   Postlethwait J.H., Warren W.C.;
RT   "The genome of the platyfish, Xiphophorus maculatus, provides insights into
RT   evolutionary adaptation and several complex traits.";
RL   Nat. Genet. 45:567-572(2013).
RN   [3] {ECO:0000313|Ensembl:ENSXMAP00000018852.2}
RP   IDENTIFICATION.
RC   STRAIN=JP 163 A {ECO:0000313|Ensembl:ENSXMAP00000018852.2};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_014324400.1; XM_014468914.1.
DR   AlphaFoldDB; M4AWK7; -.
DR   STRING; 8083.ENSXMAP00000018852; -.
DR   Ensembl; ENSXMAT00000018880.2; ENSXMAP00000018852.2; ENSXMAG00000018799.2.
DR   GeneID; 102220250; -.
DR   KEGG; xma:102220250; -.
DR   CTD; 100332225; -.
DR   eggNOG; KOG1217; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000163195; -.
DR   HOGENOM; CLU_009158_0_0_1; -.
DR   InParanoid; M4AWK7; -.
DR   OMA; ESSKCCF; -.
DR   OrthoDB; 2906665at2759; -.
DR   Proteomes; UP000002852; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd22628; Kunitz_collagen_alpha1_XXVIII; 1.
DR   CDD; cd01472; vWA_collagen; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF878; COLLAGEN ALPHA-1(XXVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00014; Kunitz_BPTI; 1.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00759; BASICPTASE.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00131; KU; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF57362; BPTI-like; 1.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002852};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          85..267
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          837..1016
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1170..1220
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   REGION          282..810
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1048..1104
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1124..1160
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        420..436
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        658..689
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        707..721
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1088..1104
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1136..1150
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1223 AA;  126639 MW;  02CB0958B8D2615A CRC64;
     MNKSHSPSLN ESLHLLQVLS LKMFLHRSPI RGVGLWLLLL ASTHGASGQR RKEAKSSNYK
     LQDDGGNGIF VTKSLYPLGR NCSLEVIFIL DSSESAKTKL FQQEKIFVLR FSTKLSMLKM
     TGVSLKVRMA ALQYSSSVSV EHRFVDWKDL DLFHSKINDM SYIGQGTFTS FAITNATQML
     LQETQKEAVR IAVLMTDGVD HPRGPNVIVA AEEAKRHGIK IFTVGLSYVS FQKENKEKLQ
     AIASSPAERF VHSMEDQQLQ EKLLKEMGAV AVEGCPPCVC EKGEKGSSGS PGRKGDQGDE
     GPSGQKGAKG EPGLNGKPGN DGSKGFPGFK GNKGMKGNCG LPGGKGATGL EGPPGPPGLK
     GEQGEIGPVG DVGPEGPAGP KGDRGHSGEP GPPGDFGIGP PGAKGEKGIQ GKPGGIGPAG
     KGDPGPPGPQ GPAGPQGKPG IPGEGFPGPK GDRGFEGPRG NRGPPGIGIK GDKGNYGLPG
     PQGPVGEPGI GLPGEKGVQG PVGLTGPRGA PGIGLTGQKG NQGLPGEPGL PGERGAGAPG
     PKGDSGVPGS SGFPGIPGED GSPGQKGDVG LPGPRGPDGI PGRGVPGGKG DKGDRGSRGQ
     PGPSGPMGPL GPKGDSGNVG LPGATGPPGR GISGPKGDQG PPGPVGQMGE PGVGLPGPKG
     DRGPPCPPGP PGPKGDGFVG PPGLPGPPGL PGETGLDGIG LPGPKGDRGF PGPPGPAGPP
     GIGLFGPKGS PGPAGPPGLP GLPGEGAQGE KGDRGFQGIP GPRGPPGQGL QGDKGDRGLR
     GETGKKGDRG QTGQSGDKGS AGRMGQKGEA GLTETEIIEL IRKICNCSES CKQKPLELVF
     VIDSSESVGP QNFQVIKDLV NAVVDRTTVS WNATRVGVVL YSDINVVVVD LKQEATADEV
     KSAVYAMDYL GEGTYTGSAI EKANQMFEAA RRDVRKVAVI ITDGQTDTRD VVSLESAVLK
     ANESQIERFV IGVVNESDPN SEEFKKELNF IASDPDQDYM FLIKDFKVLK VLEKRLLRCV
     FEEGKVALFD HPTIATFLLP GISQGTGKNG RAPFRTGGDT PTFPGDSRRD KIVPGYPASQ
     LEPELVLPER PNTDEDRNPT EPQRFPFFDR ELYRPVEEFL PPFDANKPKH QEPGANGQAG
     ASTRALTKKE SSLVISPPLK PSDSWRSAGC SQNLDPGPCR DYVVKWYYDA TSNSCAQFWF
     GGCQGNQNRF DTEKKCRETC VKV
//
DBGET integrated database retrieval system