GenomeNet

Database: UniProt
Entry: K7FM37_PELSI
LinkDB: K7FM37_PELSI
Original site: K7FM37_PELSI 
ID   K7FM37_PELSI            Unreviewed;      1809 AA.
AC   K7FM37;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 54.
DE   SubName: Full=Collagen type XIV alpha 1 chain {ECO:0000313|Ensembl:ENSPSIP00000009097.1};
GN   Name=COL14A1 {ECO:0000313|Ensembl:ENSPSIP00000009097.1};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC   Trionychidae; Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000009097.1, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX   PubMed=23624526; DOI=10.1038/ng.2615;
RA   Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA   White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA   Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA   Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA   Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT   "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT   into the development and evolution of the turtle-specific body plan.";
RL   Nat. Genet. 45:701-706(2013).
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000009097.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01192358; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192359; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192360; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192361; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192362; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192363; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192364; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192365; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01192366; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   Ensembl; ENSPSIT00000009143.1; ENSPSIP00000009097.1; ENSPSIG00000008034.1.
DR   GeneTree; ENSGT00940000153769; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 7.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00041; fn3; 8.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 8.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 7.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 7.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..29
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           30..1809
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003901894"
FT   DOMAIN          33..123
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          162..337
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          362..451
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          452..543
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          544..634
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          635..727
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          743..835
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          837..927
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1044..1217
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          114..152
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1474..1625
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1651..1809
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        114..139
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1573..1587
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1722..1750
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1790..1809
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1809 AA;  195380 MW;  FC9D7D4BBAFDD82A CRC64;
     MKVCQYKIRS QFFWAFLVIT VHFISPAQGQ VAPPTRLRYN LVTHDSVQIS WKAPKGKFTG
     YKLLVTPSEG GKTNQLTLQN SATKAIIQGL IPDQSYVVQI ITYHKDQESK AAQGQFRIKD
     LERKKETSSK SKVKGKEGTN GSKPHPSTEE NQFTCKTPAI ADIVILVDGS WSIGRFNFRL
     VRLFLENLVG AFNVGSEKTR IGSLGLAQFS GDPRIEWHLN AFSTKDGVLD AVRNLPYKGG
     NTLTGLALTF ILENSFKPEA GARSNVSKIG ILITDGKSQD DVIPPAKNLR DAGVELFAIG
     VKNADINELK EIASEPDSTH VYNVADFSYM NTIVESLTRT VCSRVEEQEK EIKGTFVTTL
     GAPTDLVTSE VTARGFRVSW THAPGKVEKY RVVYYPTRGG QPEEVVVDGS VSTAVLKNLM
     SLTEYQIAVF AVYVSAASEG LRGTETTLAL PMASNLQLYD VSQSSMRAKW SAVMGATGYM
     ILYAPLTEGL AADEKEMKIG EALTDIQLDG LLPNTEYTVT VYAMFGEEAS DPLTGQETTL
     PLSPPRNLRF SDIGHSTARI TWEPASKKVK GYRIMYVKTD GTETNEVEVG RVSTQTLKRL
     TSHTEYTVAI FSLYEEGQSE PLTGSFTTQK VPAPQYLDVD EVSTDSFRVS WKPMSSDIAH
     YKLAWIPLNG GTSEEVVLSG DKDTHVVEGL LSNTEYEVSL LAVYSDESES DVVAVLGTTH
     DFCTKPRTTT LSTSIATSIF RTGIRNLVID DETTSSLRVK WDISDYNVHQ FRVTYLTSKG
     DRAEEVVRMV IVPGRQNNLL LQPLLSDTIY KVTVTPIYSD GEGVSLSAPG KTLPLSAPRN
     LRVSDEWYNR IRISWDAPPS PTMGYRIVYK PINIPGPALE TFVGDDINTI LVLNLFSGTD
     YSVKVFASYS TGFSDALTGT AKTLYLGVTN LDVYQVRMTS ICAQWQLHRH ATAYRIVVES
     LVDGRKKEVI LGGGTPSHCF FELTPGTEYK VSVYAQLQEL EGPGVSIMET TLPFPTQPPT
     PPSTTVPPPT IPPAKEVCKA AKADLVFLVD GSWSIGDDNF NKIIGFLYST VGALDRIGPD
     GTQVAIAQFS DDPRTEFKLN LYKTKETLLE AIRQIAYKGG NTKTGKAIKH AREALFTTDS
     GIRRGIPKVL VVITDGRSQD DVNKVSREMQ LDGFSIFAIG VADADYSELV NIGSKPSERH
     VFFVDDFDAF EKIEDELITF VCETASATCP LVYKDGNSLA GFKMMEMFGL VEKEFSTVEG
     VSMEPGTFNA YPCYRLHKDA LISQPTKYLH PEGLPSDYTI TFLFRILPDT PQEPFALWEI
     LNEKYEPLVG VILDNDGKTL IFFNYDYRGD FQTVTFEGPE IKKIFYGSFH KLHVVISKTM
     AKIIIDCKQV SEKTINAAGN ITSDGIEVLG RMVRSRGQRD NSAPFQLQMF DIICTTSWAN
     RDKCCELPAL RDEESCPSLP HSCSCSEFSK GPLGPPGPPG GPGVRGPKGQ HGDQGPKGLD
     GPRGEVGAPG PQGPPGPQGP SGLSIQGLPG TPGEKGEKGD RGLPGQQGVP GASGSPGRDG
     SQGQRGLPGK DGPTGPQGPP GPVGIPGAPG VPGVMGNTGP QGGVGPPGAP GVKGERGERG
     DVQSQTMVRA VARQVCEQII QSHMARYNSL LNQIPSQSDS TRTIPGPPGE PGRPGSPGPQ
     GEQGASGRPG FPGNPGQPGR PGERGLPGEK GERGTPGVGT QGPRGPPGPP GPSGESRTGS
     PGPPGSPGPR GPVGHAGVPG PQGPSGQPGY CDPSSCAGYD VGEPRIPQPE FTPVREEMET
     IEEVRSPGI
//
DBGET integrated database retrieval system