ID H2L5M0_ORYLA Unreviewed; 1869 AA.
AC H2L5M0;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 73.
DE SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSORLP00000001100.2};
GN Name=COL14A1 {ECO:0000313|Ensembl:ENSORLP00000001100.2};
OS Oryzias latipes (Japanese rice fish) (Japanese killifish).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Beloniformes; Adrianichthyidae; Oryziinae;
OC Oryzias.
OX NCBI_TaxID=8090 {ECO:0000313|Ensembl:ENSORLP00000001100.2, ECO:0000313|Proteomes:UP000001038};
RN [1] {ECO:0000313|Ensembl:ENSORLP00000001100.2, ECO:0000313|Proteomes:UP000001038}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000001100.2,
RC ECO:0000313|Proteomes:UP000001038};
RX PubMed=17554307; DOI=10.1038/nature05846;
RA Kasahara M., Naruse K., Sasaki S., Nakatani Y., Qu W., Ahsan B., Yamada T.,
RA Nagayasu Y., Doi K., Kasai Y., Jindo T., Kobayashi D., Shimada A.,
RA Toyoda A., Kuroki Y., Fujiyama A., Sasaki T., Shimizu A., Asakawa S.,
RA Shimizu N., Hashimoto S., Yang J., Lee Y., Matsushima K., Sugano S.,
RA Sakaizumi M., Narita T., Ohishi K., Haga S., Ohta F., Nomoto H., Nogata K.,
RA Morishita T., Endo T., Shin-I T., Takeda H., Morishita S., Kohara Y.;
RT "The medaka draft genome and insights into vertebrate genome evolution.";
RL Nature 447:714-719(2007).
RN [2] {ECO:0000313|Ensembl:ENSORLP00000001100.2}
RP IDENTIFICATION.
RC STRAIN=Hd-rR {ECO:0000313|Ensembl:ENSORLP00000001100.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_011478960.1; XM_011480658.1.
DR STRING; 8090.ENSORLP00000001100; -.
DR Ensembl; ENSORLT00000001101.2; ENSORLP00000001100.2; ENSORLG00000000896.2.
DR GeneID; 101171503; -.
DR KEGG; ola:101171503; -.
DR GeneTree; ENSGT00940000153769; -.
DR HOGENOM; CLU_002527_2_0_1; -.
DR InParanoid; H2L5M0; -.
DR OrthoDB; 5353225at2759; -.
DR Proteomes; UP000001038; Chromosome 11.
DR Bgee; ENSORLG00000000896; Expressed in muscle tissue and 8 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0005614; C:interstitial matrix; IBA:GO_Central.
DR CDD; cd00063; FN3; 8.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000001038};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1869
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017216055"
FT DOMAIN 30..122
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 156..328
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 358..448
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 449..539
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 540..629
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 630..719
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 734..825
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 826..916
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1034..1207
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 106..135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1455..1616
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1645..1775
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1793..1869
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 116..135
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1650..1664
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1742..1756
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1869 AA; 201002 MW; 6A6D2EC6C827A13E CRC64;
MQSSVTPSLC LLGCILIALL PNSAQGQVSS PRRFRARILS PTKLDVAWKE PKGEFEGFKV
SYIMNPGGRQ KMVELSKQKT NLLIEDFDST KEYVFKITAV GGGRESKPLH GKLKAQRSTL
ESTQSQRGQS GSAIQENNEI SEDMEGFMCK TPAIADIVIL VDGSWSIGRI NFRLVRTFLE
NLVRAFSVDF DKTRIGLAQY SGDPRIEWHL NTHSSKEAVI EAVKNLPYKG GNTLTGLALT
FILENSFKPE SGSRPGVPKI GILITDGKSQ DDVIPPAQSL KDAGIELFAI GVKNADENEL
KAIASPPEET HVYNVADFSV MSDIVEGLTK GVCDRVDQLD KQIKGGGESA PPPDSLAPPR
DLVIADITAR SFRVTWTHAT GQVEKYRVVY YPASGGQPEE KVVQGTDNSV ELNYLNSLTE
YQVAVFAIYR SSASQALRGS ATTLALPTVN NLELHDITHS TMRVRWRAAI GATGYMILYA
PLTEGESADE KEVKVADSVN EVELEGLSPD TEYTVTVYAM YGEEASDPMT SQETTLPLIP
ARNLRFSEVD HSSARLTWES TSRLVRGYRV MYVKTNGVQT TEVDVGKVTT YLLKNLTSLT
EYTVGVFAIY DEGEAEAVTE SFTTKVVPDP LDLRSSDITA ESFRVSWQHP ATDVTLYRIT
WTPTDGGDSK DVLVDSNVNT YKITGLSPDS EYEVLLAAIY ANEIESDEVI LVENTAKRTT
TVATTSSKPS PRHGVRNMKI DDETTFSLRV SWQPVDSRNV RQYRLSYISM RGDRATETRT
VPPAQNSIVL QPLLSDTEYK ITLIPVYPDG DGPVASQVGR TLPLSAPKNL RVSEEWYNRF
RISWDVPPSP TMGYRVVYQP LSAPGPALET FVGEDVNTML IVNLLSGTEY SVKVIASYTT
GSSEALSGRA KTLYLGVTNL STYQVRMTSV CAQWLTHRHA SAYRVVIQPL LGSQKQEIRL
GAGSNLHCFS NLKPNTEYKI SVYAQLQDGT EGPAATATVK TLPVPTQAPT KPPATTPLPT
IPSAKEVCRA AKADLVFLVD GSWSIGDDNF LKIIRFLYST VGALDRIGPD GTQVAIAQFS
DDARTEFKLN SYTNKERLLD AVNKISYKGG NTKTGRAIQH VKENIFTAEG GVRRGIPNVL
VVLTDGRSQD DVNKVSKEMQ MEGYIVFAIG FADADYGELV SIASKPSDRH VFFVDDLDAF
QKIEEKLVTF VCEAATATCP SVPMSGSTTP GFRMMELFGL VEDRYNSIYG VSMVPGTFNA
FPSFHLHSNA LLAQPTRFIH PEGLPSDYTV SLLFRLLHDT PEEPFALWEI LNNNNEPLAG
VILDNGGKTL TFFNNDYKGD FQTVTFEGPE IKKLFFGSFH KLHVAISKTS AKVFVDCKMV
SERAINAAGN ITTDGLEVLG RMVRSRGNKD NSAPFQLQNF DIVCSTSWAS RDKCCELPGL
RKEAECPALP KACTCTQDSK GPPGPPGVPG GPGIRGARGD RGEPGPVGPA GAVGDMGVPG
PQGPPGPQGP SGRSIIGPPG SPGERGQKGD PGQQGQQGIP GRPGAPGREG PPGPRGLVGK
DGPQGRQGPP GSMGTPGAPG SPGSTGPPGK QGELGPPGSP GSRGEKGDRG DVQSTASVQA
IARQVCEQLI QSHMARYNTL LTQVPSPPVS IRTVPGPPGE PGRQGSPGPQ GEQGPPGRPG
FPGQNGQNGN PGERGQPGEK GEKGSQGVGV QGPRGPPGPP GAAGQGRPGS QGQSGRPGNP
GAPGRPGVPG PVGPPGPQGY CDQNSCVGYN VGEGEDVTDR GAVSAVQLPP NVFQNYGEVE
EDDPYRYYQP NYPAPQPVSP EDPSLAYGDI ELRSPGVHRS SRSVGSEGEK VGPKRRRKSR
AKELPGLTN
//