GenomeNet

Database: UniProt
Entry: A0A087XCU1_POEFO
LinkDB: A0A087XCU1_POEFO
Original site: A0A087XCU1_POEFO 
ID   A0A087XCU1_POEFO        Unreviewed;      1784 AA.
AC   A0A087XCU1;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 2.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=Collagen, type XIV, alpha 1b {ECO:0008006|Google:ProtNLM};
OS   Poecilia formosa (Amazon molly) (Limia formosa).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC   Poecilia.
OX   NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000003594.2, ECO:0000313|Proteomes:UP000028760};
RN   [1] {ECO:0000313|Proteomes:UP000028760}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA   Schartl M., Warren W.;
RL   Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPFOP00000003594.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AYCK01012825; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AYCK01012826; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AYCK01012827; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 48698.ENSPFOP00000003594; -.
DR   Ensembl; ENSPFOT00000003601.2; ENSPFOP00000003594.2; ENSPFOG00000002928.2.
DR   eggNOG; KOG1217; Eukaryota.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000153769; -.
DR   OMA; REMQSDX; -.
DR   Proteomes; UP000028760; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 7.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00041; fn3; 7.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 8.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 6.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 7.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..1784
FT                   /note="Collagen, type XIV, alpha 1b"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001832694"
FT   DOMAIN          30..121
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          156..328
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          359..449
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          450..540
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          541..630
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          631..720
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          736..827
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          828..918
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1036..1209
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          102..145
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1003..1024
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1457..1617
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1643..1784
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        112..131
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1008..1024
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1652..1666
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1784 AA;  191787 MW;  A106CB342CD4DFC3 CRC64;
     MQFSVRLSFF LLGFVFFAVL PHTAKGQVSS PRRFRVKVLG QDKLEVSWKE PKGDFEGYKV
     IYVTRPGGQQ KVLELAKQET KLVIKDYDPS KDYNFKISAV NGGKESKPLQ GKHEAQQSGS
     ETAQTQGTKK SDVTRENNEI SEGEDGFMCK TPAIADIVIL VDGSWSIGRI NFRLVRAFLE
     NLVNAFSVEF DKTRIGLAQY SGDPRIEWHL NAHTTKEAVI EAVKNLPYKG GNTLTGLALT
     FILENSFKVE SGSRPGVPKI GILITDGKSQ DDVIPPAQSL KDAGIELFAI GVKNADENEL
     KAIASPPEET HVYNVADFSV MSDIVEGLTK TVCDRVEQLD KKIKGGGGEP APPVTSIAPP
     RNLVTSEITA RSFRVTWTHA PGQVEKYRVV YYPASGGQPE EKVVLGTENS VELTHLNSLT
     EYQVAVFAVY RSSASEALRG SATTLALPTV NNLELFDVTH STMRVRWKIA AGASEYMILY
     APLTEQEAAD EKEVKVGDSV NEVELDGLTP ATEYTVTVYA MYGEEASDPM TSQQTTLPLI
     PASNLRFKDV DHSSVRITWD SPSRLVRGYR IMYVKTSGVQ TTEVDLGAVT SYLLKNLTSL
     TEYTVGVFAV YKEGEAPAVT ESFTTKTVPD PMDLKSSDIS SEGFRVSWQH PASDVVLYRL
     TWTPTDGGDS EEVLVNGNIN TYLIKGLEPA SEYEVLLAAI YANEVESDEM VLIETTAKRT
     TTVATTTTTT LTPRYAVKNL KIDEETTFSM RVSWQAVDSR NVRHYRLTYI SARGDRAEET
     RTVPSGQTSL VLQPLLSDTQ YKVTVIPVYN EGDGPASSQM GRTHPLSAPR NLRVSEEWYN
     RFRISWDVPP SPTMGYRVVY QPLSAPGQAL ETFVGEDVNT MLIVNLLSGT EYSVKVIASY
     TTGSSEALTG KAKTLYLGVT NLSTYQVRMS GVCAQWVPHQ HASTYRLVIQ SVTGSQKQET
     KLGGGASRHC FSSLKPNMEY KISIFSQLQD GTEGPAVTAN VKTLPIPTPA PAKRPTTTPP
     PTIPPAKEVC KAAKADLVFL VDGSWSIGDD NFLKIIRFLY STVGALDRIG PDGTQVAIAQ
     FSDDARTEFK LNSYGNKERL LDAINKISYK GGNTKTGRAI QHVKENIFTA EGGVRRGIPN
     VLVVLTDGRS QDDVNKVSKE MQMSGYIVFA IGFADADYGE LVSIASKPSD RHVFFVDDLD
     AFQRIEEKLV TFVCEAATAT CPSIPMSGST TPGFRMMELF GLVENKYNSI AGVSMVPGTF
     NAFPCFHLHS NALMAQPTRF IHPEGLPSDY TITLLFRVLP DTPEEPFAIW EILNKNNDPL
     TGVILDNGGK TLTFFNNDYR GDFQTVTFEG PEIKKLFFGS FHKLHIAISK TSAKVFIDCK
     MVAEKVINAA GNITTDGVEV LGRMVRSREN KDNSAPFQLQ NFDIICSTSW ANRDKCCELP
     GLRKEVDCPA LPKACTCTQD SKGPAGPPGV PGGPGIRGAR GDRGEPGPVG PAGPVGDTGV
     PGPQGPPGRQ GPSGRSIIGP PGPAGERGQK GEAGQQGQQG IPGRPGPAGR EGPPGPRGLI
     GKDGPQGRQG PPGTMGTPGA PGSPGNTGPQ GKQGTLGPPG PPGTKGEKGE RGDLQSTASV
     QAIARQVCEQ LIQSHMSRYN TILNQAPSPP VSIRTVPGPP GEPGREGAPG PQGEQGPPGR
     PGFPGQNGQN GNPGERGQPG EKGEKGSAGV GVQGPRGPPG PPGPQGQGRP GSQGPTGRPG
     NPGTSGRPGV PGPVGPPGPP GYCDQNSCLG YNVGASPENH YGDY
//
DBGET integrated database retrieval system