ID K7FM37_PELSI Unreviewed; 1809 AA.
AC K7FM37;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSPSIP00000009097.1};
GN Name=COL14A1 {ECO:0000313|Ensembl:ENSPSIP00000009097.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000009097.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000009097.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01192358; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192359; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192360; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192361; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192362; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192363; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192364; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192365; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01192366; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSPSIT00000009143.1; ENSPSIP00000009097.1; ENSPSIG00000008034.1.
DR GeneTree; ENSGT00940000153769; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 7.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 7.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 7.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1809
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003901894"
FT DOMAIN 33..123
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 162..337
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 362..451
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 452..543
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 544..634
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 635..727
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 743..835
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 837..927
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1044..1217
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 114..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1474..1625
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1651..1809
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 114..139
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1573..1587
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1722..1750
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1790..1809
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1809 AA; 195380 MW; FC9D7D4BBAFDD82A CRC64;
MKVCQYKIRS QFFWAFLVIT VHFISPAQGQ VAPPTRLRYN LVTHDSVQIS WKAPKGKFTG
YKLLVTPSEG GKTNQLTLQN SATKAIIQGL IPDQSYVVQI ITYHKDQESK AAQGQFRIKD
LERKKETSSK SKVKGKEGTN GSKPHPSTEE NQFTCKTPAI ADIVILVDGS WSIGRFNFRL
VRLFLENLVG AFNVGSEKTR IGSLGLAQFS GDPRIEWHLN AFSTKDGVLD AVRNLPYKGG
NTLTGLALTF ILENSFKPEA GARSNVSKIG ILITDGKSQD DVIPPAKNLR DAGVELFAIG
VKNADINELK EIASEPDSTH VYNVADFSYM NTIVESLTRT VCSRVEEQEK EIKGTFVTTL
GAPTDLVTSE VTARGFRVSW THAPGKVEKY RVVYYPTRGG QPEEVVVDGS VSTAVLKNLM
SLTEYQIAVF AVYVSAASEG LRGTETTLAL PMASNLQLYD VSQSSMRAKW SAVMGATGYM
ILYAPLTEGL AADEKEMKIG EALTDIQLDG LLPNTEYTVT VYAMFGEEAS DPLTGQETTL
PLSPPRNLRF SDIGHSTARI TWEPASKKVK GYRIMYVKTD GTETNEVEVG RVSTQTLKRL
TSHTEYTVAI FSLYEEGQSE PLTGSFTTQK VPAPQYLDVD EVSTDSFRVS WKPMSSDIAH
YKLAWIPLNG GTSEEVVLSG DKDTHVVEGL LSNTEYEVSL LAVYSDESES DVVAVLGTTH
DFCTKPRTTT LSTSIATSIF RTGIRNLVID DETTSSLRVK WDISDYNVHQ FRVTYLTSKG
DRAEEVVRMV IVPGRQNNLL LQPLLSDTIY KVTVTPIYSD GEGVSLSAPG KTLPLSAPRN
LRVSDEWYNR IRISWDAPPS PTMGYRIVYK PINIPGPALE TFVGDDINTI LVLNLFSGTD
YSVKVFASYS TGFSDALTGT AKTLYLGVTN LDVYQVRMTS ICAQWQLHRH ATAYRIVVES
LVDGRKKEVI LGGGTPSHCF FELTPGTEYK VSVYAQLQEL EGPGVSIMET TLPFPTQPPT
PPSTTVPPPT IPPAKEVCKA AKADLVFLVD GSWSIGDDNF NKIIGFLYST VGALDRIGPD
GTQVAIAQFS DDPRTEFKLN LYKTKETLLE AIRQIAYKGG NTKTGKAIKH AREALFTTDS
GIRRGIPKVL VVITDGRSQD DVNKVSREMQ LDGFSIFAIG VADADYSELV NIGSKPSERH
VFFVDDFDAF EKIEDELITF VCETASATCP LVYKDGNSLA GFKMMEMFGL VEKEFSTVEG
VSMEPGTFNA YPCYRLHKDA LISQPTKYLH PEGLPSDYTI TFLFRILPDT PQEPFALWEI
LNEKYEPLVG VILDNDGKTL IFFNYDYRGD FQTVTFEGPE IKKIFYGSFH KLHVVISKTM
AKIIIDCKQV SEKTINAAGN ITSDGIEVLG RMVRSRGQRD NSAPFQLQMF DIICTTSWAN
RDKCCELPAL RDEESCPSLP HSCSCSEFSK GPLGPPGPPG GPGVRGPKGQ HGDQGPKGLD
GPRGEVGAPG PQGPPGPQGP SGLSIQGLPG TPGEKGEKGD RGLPGQQGVP GASGSPGRDG
SQGQRGLPGK DGPTGPQGPP GPVGIPGAPG VPGVMGNTGP QGGVGPPGAP GVKGERGERG
DVQSQTMVRA VARQVCEQII QSHMARYNSL LNQIPSQSDS TRTIPGPPGE PGRPGSPGPQ
GEQGASGRPG FPGNPGQPGR PGERGLPGEK GERGTPGVGT QGPRGPPGPP GPSGESRTGS
PGPPGSPGPR GPVGHAGVPG PQGPSGQPGY CDPSSCAGYD VGEPRIPQPE FTPVREEMET
IEEVRSPGI
//