ID M4A0Z0_XIPMA Unreviewed; 1114 AA.
AC M4A0Z0;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 60.
DE SubName: Full=Collagen alpha-2(V) chain {ECO:0000313|Ensembl:ENSXMAP00000008134.2};
OS Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Xiphophorus.
OX NCBI_TaxID=8083 {ECO:0000313|Ensembl:ENSXMAP00000008134.2, ECO:0000313|Proteomes:UP000002852};
RN [1] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RA Walter R., Schartl M., Warren W.;
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RX PubMed=23542700; DOI=10.1038/ng.2604;
RA Schartl M., Walter R.B., Shen Y., Garcia T., Catchen J., Amores A.,
RA Braasch I., Chalopin D., Volff J.N., Lesch K.P., Bisazza A., Minx P.,
RA Hillier L., Wilson R.K., Fuerstenberg S., Boore J., Searle S.,
RA Postlethwait J.H., Warren W.C.;
RT "The genome of the platyfish, Xiphophorus maculatus, provides insights into
RT evolutionary adaptation and several complex traits.";
RL Nat. Genet. 45:567-572(2013).
RN [3] {ECO:0000313|Ensembl:ENSXMAP00000008134.2}
RP IDENTIFICATION.
RC STRAIN=JP 163 A {ECO:0000313|Ensembl:ENSXMAP00000008134.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; M4A0Z0; -.
DR STRING; 8083.ENSXMAP00000008134; -.
DR Ensembl; ENSXMAT00000008144.2; ENSXMAP00000008134.2; ENSXMAG00000007999.2.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000168325; -.
DR HOGENOM; CLU_001074_2_3_1; -.
DR InParanoid; M4A0Z0; -.
DR OMA; KPDQEFG; -.
DR Proteomes; UP000002852; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.1000; -; 1.
DR Gene3D; 6.20.200.20; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1108; ENDOSTATIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00093; VWC; 1.
DR SMART; SM00038; COLFI; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000002852};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 1..59
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 880..1114
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 76..828
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 852..874
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 811..826
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1114 AA; 108592 MW; 581994944348F23E CRC64;
ESCKVDGELY HHNDIWKPAP CRVCVCDNGV SICDEIQCEP LANCEKVTTP EGECCPVCDS
FASAIMGFQG QKGEPGDIPY VVGNPGHQGP MGPPGPQGRT GPRGFKGRRG EPGVPGNPGE
PGPPGDSGPP GLQGERGRIG PTGPLGKRGS NGHPGKPGPL GPIGIPGMPG FPGGPGMKGE
AGPTGVRGSR GQQGPRGDGG RVGPPGPIGQ QGAAGPDGAP GARGSSGVAG LQGAPGLVGP
AGPPGPQGTT GAPGPKGQLG DAGLAGLKGE AGHKGERGPP GNRGFPGADG LPGQKQGAQG
ERGLPGSPGI KGTLGDPGRN GEPGLTGARG LAGEDGKPGP AGSVGTRGGA GTMGLPGPKG
FGPAGTRGEQ GPQGQTGFQG LPGPSGPPGE AGKPGDEGLA GEAGSVGETG PRGERGAPGE
RGETGPNGLQ GPKGGPGAPG SDGPKQGSTG PKGAAGEPGG PGLQGMPGER GISGPSGPKG
DPQGETGPPG PTGRRGTRGI PGSSGEPGPT GAVGFPGPSG QDGQPGVKGE TGEPGPKGEA
GASGPQGMAG KPGEQGATGA DGPVGAPGKK GPPGVGGENG APGRQGENGP PGPAGSPGEK
GGPGEDGPLV GPDGPPGPAG TTGQRGAVGL PGVRGERGSV GLPGPAGPPG KPGATGAQGT
NGPPGGVGLP GATGPRGDPG PEGDRGNPGP EGLPGVLGSP GNEGPVGTIG GPGQRGGPQG
PQGEKGGAGE PGERGQKGHR GFTGLQGLPG ITQGATGDAG ARGIFGPAGQ RGPPGVVGPP
GKEGNIGQPG PMGAPGGRGT IGDLGAQGPP GEPGPAGPPG PPGPPTAAAE DLYAVDYDAH
GEVQEAVELG EYDDTADPPP PPEFNKDEAK PNNNILGAET GVRATLKTLN GHLQNLRSPD
GSKTNPAKTC QDIRQCYPQK RSEYWLDPNQ GSTKDAIRVL CNMETGETCI PANPASIPRK
AWWTKSTPSP SKPTWFGADM NSGAKFFYGS KEEQPNAVAV QMKLLQLLSK ESHQNVTYHC
RNSVAFKDEQ AGNFKKALVL RGANGQEMRA QGNSRLRYAV VEDGCSKPNG EWGKTVIEYR
TQTTTRLPIV DLAPMDVGKA DQEFGLDIGP VCFS
//