ID K7FMG5_PELSI Unreviewed; 1115 AA.
AC K7FMG5;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 08-NOV-2023, entry version 30.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000009225.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000009225.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01057094; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01057095; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; K7FMG5; -.
DR Ensembl; ENSPSIT00000009271.1; ENSPSIP00000009225.1; ENSPSIG00000008248.1.
DR GeneTree; ENSGT00940000156850; -.
DR HOGENOM; CLU_308962_0_0_1; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR025615; TILa_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR46160; ALPHA-TECTORIN-RELATED; 1.
DR PANTHER; PTHR46160:SF4; ZONADHESIN; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF01826; TIL; 3.
DR Pfam; PF12714; TILa; 2.
DR Pfam; PF00094; VWD; 3.
DR SMART; SM00832; C8; 3.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267}.
FT DOMAIN 1..178
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 384..559
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 780..959
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
SQ SEQUENCE 1115 AA; 122611 MW; 9AF1182C98519A22 CRC64;
TCSASGDPHY NTFDGRVHHF MGNCTYTLSK VCNASGGLPP FDVSTTNEHR GSNTKVSYVK
SVHVEVYGTQ ISLLQNRKVN VNGTRRNLPV EIENKIKIQV SGGYVLLETD FGLWVRFDGN
HYAEVSVPCD YQGQLCGLCG NYNGDPKDDN RKPNGSIAAD STELGESWLA AENNTVCSPD
QPGICDPQLE SEARKNTACG MITDPSGIFK DCHGHVAPQQ FLENCVYDMC HTDDPTASLC
NGLQAYAESC TNAGICREWR NSTLCPISCA EGSQYKSCGS RCPSSCVTPS AFSPCSSLPV
EGCFCEEGYT LSGDTCVPES KCGCVDGNNH YHQLGESWFT HDACKERCTC NSNNSIACAA
WECGTLEECR VEEGVLGCYS SGRASCQVAG DPHYFTFDKV MHTFLGTCTY TLMKVCNSSS
VVPVTISGKN EERGLRGATY LKEVYIDVYG NRITLQKDRA ILFNKERIRT PVKNRLRGVS
IGNVGIYLVV ETDFGLLVKY DGNQHLEISL PKSYFSKVCG MCGNYNDQRG DELLMPNGMQ
ASNVTQFGNS WKADTLLSPL KYALSVLPCS CLPDTREDLG PPCSVEDKPV VEKQCNVLKS
DIFKPCHHLV EPDLFIQTCV YDMCKYNGML STLCGIAQAY VDTCKQQGVI LKWRNSTFCP
LPCPSNSYYT DCASPCPATC NDIYAPSLCE KPDECAEGCV CSEGYVLSDD QCVPLSECGC
RDKDDSYYSA GESWITAHCT YKCKCQKDNI IKCQPYGCDA SEVCVLNNKG KYHCKPTGFG
KCLVIGDPHY MTFDGLMHHF QGKKTYVLSQ TVSSISDRLP AFSIEGKNKL VTKRPDLSFL
QEIHISVYNH TVWLGQKKQL VVDGVNTIPP AQPHEGLRIY QKPMRIYLET DFGLSLSYDG
VENLDMVLAN TYKNKVEGLC GNFDGKYKND FTKPDGTRVQ DVDVFGESWK VPVTRTVSRR
RRDVLSEEQL DALELNTGLT RGCSSSQLSL VNSTSKCGIL TDPNGPFAKC HPNVSVSFSQ
MGCVFDLCAE ADNSAVLCRH LEQYAQTCQQ NGVTLENWRQ QTSCEMQCPL NSKYSSCMSA
CPDSCSNLAA SSECEAPCVE GCECLSGYVL SGYNC
//