ID A0A087XCU1_POEFO Unreviewed; 1784 AA.
AC A0A087XCU1;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 2.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=Collagen, type XIV, alpha 1b {ECO:0008006|Google:ProtNLM};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000003594.2, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000003594.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01012825; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01012826; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01012827; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 48698.ENSPFOP00000003594; -.
DR Ensembl; ENSPFOT00000003601.2; ENSPFOP00000003594.2; ENSPFOG00000002928.2.
DR eggNOG; KOG1217; Eukaryota.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000153769; -.
DR OMA; REMQSDX; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 7.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 7.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1784
FT /note="Collagen, type XIV, alpha 1b"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5001832694"
FT DOMAIN 30..121
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 156..328
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 359..449
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 450..540
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 541..630
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 631..720
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 736..827
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 828..918
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1036..1209
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 102..145
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1003..1024
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1457..1617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1643..1784
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..131
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1008..1024
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1652..1666
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1784 AA; 191787 MW; A106CB342CD4DFC3 CRC64;
MQFSVRLSFF LLGFVFFAVL PHTAKGQVSS PRRFRVKVLG QDKLEVSWKE PKGDFEGYKV
IYVTRPGGQQ KVLELAKQET KLVIKDYDPS KDYNFKISAV NGGKESKPLQ GKHEAQQSGS
ETAQTQGTKK SDVTRENNEI SEGEDGFMCK TPAIADIVIL VDGSWSIGRI NFRLVRAFLE
NLVNAFSVEF DKTRIGLAQY SGDPRIEWHL NAHTTKEAVI EAVKNLPYKG GNTLTGLALT
FILENSFKVE SGSRPGVPKI GILITDGKSQ DDVIPPAQSL KDAGIELFAI GVKNADENEL
KAIASPPEET HVYNVADFSV MSDIVEGLTK TVCDRVEQLD KKIKGGGGEP APPVTSIAPP
RNLVTSEITA RSFRVTWTHA PGQVEKYRVV YYPASGGQPE EKVVLGTENS VELTHLNSLT
EYQVAVFAVY RSSASEALRG SATTLALPTV NNLELFDVTH STMRVRWKIA AGASEYMILY
APLTEQEAAD EKEVKVGDSV NEVELDGLTP ATEYTVTVYA MYGEEASDPM TSQQTTLPLI
PASNLRFKDV DHSSVRITWD SPSRLVRGYR IMYVKTSGVQ TTEVDLGAVT SYLLKNLTSL
TEYTVGVFAV YKEGEAPAVT ESFTTKTVPD PMDLKSSDIS SEGFRVSWQH PASDVVLYRL
TWTPTDGGDS EEVLVNGNIN TYLIKGLEPA SEYEVLLAAI YANEVESDEM VLIETTAKRT
TTVATTTTTT LTPRYAVKNL KIDEETTFSM RVSWQAVDSR NVRHYRLTYI SARGDRAEET
RTVPSGQTSL VLQPLLSDTQ YKVTVIPVYN EGDGPASSQM GRTHPLSAPR NLRVSEEWYN
RFRISWDVPP SPTMGYRVVY QPLSAPGQAL ETFVGEDVNT MLIVNLLSGT EYSVKVIASY
TTGSSEALTG KAKTLYLGVT NLSTYQVRMS GVCAQWVPHQ HASTYRLVIQ SVTGSQKQET
KLGGGASRHC FSSLKPNMEY KISIFSQLQD GTEGPAVTAN VKTLPIPTPA PAKRPTTTPP
PTIPPAKEVC KAAKADLVFL VDGSWSIGDD NFLKIIRFLY STVGALDRIG PDGTQVAIAQ
FSDDARTEFK LNSYGNKERL LDAINKISYK GGNTKTGRAI QHVKENIFTA EGGVRRGIPN
VLVVLTDGRS QDDVNKVSKE MQMSGYIVFA IGFADADYGE LVSIASKPSD RHVFFVDDLD
AFQRIEEKLV TFVCEAATAT CPSIPMSGST TPGFRMMELF GLVENKYNSI AGVSMVPGTF
NAFPCFHLHS NALMAQPTRF IHPEGLPSDY TITLLFRVLP DTPEEPFAIW EILNKNNDPL
TGVILDNGGK TLTFFNNDYR GDFQTVTFEG PEIKKLFFGS FHKLHIAISK TSAKVFIDCK
MVAEKVINAA GNITTDGVEV LGRMVRSREN KDNSAPFQLQ NFDIICSTSW ANRDKCCELP
GLRKEVDCPA LPKACTCTQD SKGPAGPPGV PGGPGIRGAR GDRGEPGPVG PAGPVGDTGV
PGPQGPPGRQ GPSGRSIIGP PGPAGERGQK GEAGQQGQQG IPGRPGPAGR EGPPGPRGLI
GKDGPQGRQG PPGTMGTPGA PGSPGNTGPQ GKQGTLGPPG PPGTKGEKGE RGDLQSTASV
QAIARQVCEQ LIQSHMSRYN TILNQAPSPP VSIRTVPGPP GEPGREGAPG PQGEQGPPGR
PGFPGQNGQN GNPGERGQPG EKGEKGSAGV GVQGPRGPPG PPGPQGQGRP GSQGPTGRPG
NPGTSGRPGV PGPVGPPGPP GYCDQNSCLG YNVGASPENH YGDY
//