ID M4AWK7_XIPMA Unreviewed; 1223 AA.
AC M4AWK7;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Collagen alpha-1(XXVIII) chain-like {ECO:0000313|Ensembl:ENSXMAP00000018852.2};
OS Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Xiphophorus.
OX NCBI_TaxID=8083 {ECO:0000313|Ensembl:ENSXMAP00000018852.2, ECO:0000313|Proteomes:UP000002852};
RN [1] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RA Walter R., Schartl M., Warren W.;
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RX PubMed=23542700; DOI=10.1038/ng.2604;
RA Schartl M., Walter R.B., Shen Y., Garcia T., Catchen J., Amores A.,
RA Braasch I., Chalopin D., Volff J.N., Lesch K.P., Bisazza A., Minx P.,
RA Hillier L., Wilson R.K., Fuerstenberg S., Boore J., Searle S.,
RA Postlethwait J.H., Warren W.C.;
RT "The genome of the platyfish, Xiphophorus maculatus, provides insights into
RT evolutionary adaptation and several complex traits.";
RL Nat. Genet. 45:567-572(2013).
RN [3] {ECO:0000313|Ensembl:ENSXMAP00000018852.2}
RP IDENTIFICATION.
RC STRAIN=JP 163 A {ECO:0000313|Ensembl:ENSXMAP00000018852.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_014324400.1; XM_014468914.1.
DR AlphaFoldDB; M4AWK7; -.
DR STRING; 8083.ENSXMAP00000018852; -.
DR Ensembl; ENSXMAT00000018880.2; ENSXMAP00000018852.2; ENSXMAG00000018799.2.
DR GeneID; 102220250; -.
DR KEGG; xma:102220250; -.
DR CTD; 100332225; -.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000163195; -.
DR HOGENOM; CLU_009158_0_0_1; -.
DR InParanoid; M4AWK7; -.
DR OMA; ESSKCCF; -.
DR OrthoDB; 2906665at2759; -.
DR Proteomes; UP000002852; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd22628; Kunitz_collagen_alpha1_XXVIII; 1.
DR CDD; cd01472; vWA_collagen; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF878; COLLAGEN ALPHA-1(XXVIII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00759; BASICPTASE.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00131; KU; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000002852};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 85..267
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 837..1016
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1170..1220
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 282..810
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1048..1104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1124..1160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..436
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 658..689
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 707..721
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1088..1104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1136..1150
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1223 AA; 126639 MW; 02CB0958B8D2615A CRC64;
MNKSHSPSLN ESLHLLQVLS LKMFLHRSPI RGVGLWLLLL ASTHGASGQR RKEAKSSNYK
LQDDGGNGIF VTKSLYPLGR NCSLEVIFIL DSSESAKTKL FQQEKIFVLR FSTKLSMLKM
TGVSLKVRMA ALQYSSSVSV EHRFVDWKDL DLFHSKINDM SYIGQGTFTS FAITNATQML
LQETQKEAVR IAVLMTDGVD HPRGPNVIVA AEEAKRHGIK IFTVGLSYVS FQKENKEKLQ
AIASSPAERF VHSMEDQQLQ EKLLKEMGAV AVEGCPPCVC EKGEKGSSGS PGRKGDQGDE
GPSGQKGAKG EPGLNGKPGN DGSKGFPGFK GNKGMKGNCG LPGGKGATGL EGPPGPPGLK
GEQGEIGPVG DVGPEGPAGP KGDRGHSGEP GPPGDFGIGP PGAKGEKGIQ GKPGGIGPAG
KGDPGPPGPQ GPAGPQGKPG IPGEGFPGPK GDRGFEGPRG NRGPPGIGIK GDKGNYGLPG
PQGPVGEPGI GLPGEKGVQG PVGLTGPRGA PGIGLTGQKG NQGLPGEPGL PGERGAGAPG
PKGDSGVPGS SGFPGIPGED GSPGQKGDVG LPGPRGPDGI PGRGVPGGKG DKGDRGSRGQ
PGPSGPMGPL GPKGDSGNVG LPGATGPPGR GISGPKGDQG PPGPVGQMGE PGVGLPGPKG
DRGPPCPPGP PGPKGDGFVG PPGLPGPPGL PGETGLDGIG LPGPKGDRGF PGPPGPAGPP
GIGLFGPKGS PGPAGPPGLP GLPGEGAQGE KGDRGFQGIP GPRGPPGQGL QGDKGDRGLR
GETGKKGDRG QTGQSGDKGS AGRMGQKGEA GLTETEIIEL IRKICNCSES CKQKPLELVF
VIDSSESVGP QNFQVIKDLV NAVVDRTTVS WNATRVGVVL YSDINVVVVD LKQEATADEV
KSAVYAMDYL GEGTYTGSAI EKANQMFEAA RRDVRKVAVI ITDGQTDTRD VVSLESAVLK
ANESQIERFV IGVVNESDPN SEEFKKELNF IASDPDQDYM FLIKDFKVLK VLEKRLLRCV
FEEGKVALFD HPTIATFLLP GISQGTGKNG RAPFRTGGDT PTFPGDSRRD KIVPGYPASQ
LEPELVLPER PNTDEDRNPT EPQRFPFFDR ELYRPVEEFL PPFDANKPKH QEPGANGQAG
ASTRALTKKE SSLVISPPLK PSDSWRSAGC SQNLDPGPCR DYVVKWYYDA TSNSCAQFWF
GGCQGNQNRF DTEKKCRETC VKV
//