ID A0A0N8K2F7_SCLFO Unreviewed; 1334 AA.
AC A0A0N8K2F7;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
GN ORFNames=Z043_102950 {ECO:0000313|EMBL:KPP77609.1};
OS Scleropages formosus (Asian bonytongue) (Osteoglossum formosum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Osteoglossocephala;
OC Osteoglossomorpha; Osteoglossiformes; Osteoglossidae; Scleropages.
OX NCBI_TaxID=113540 {ECO:0000313|EMBL:KPP77609.1, ECO:0000313|Proteomes:UP000034805};
RN [1] {ECO:0000313|EMBL:KPP77609.1, ECO:0000313|Proteomes:UP000034805}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Aro1 {ECO:0000313|EMBL:KPP77609.1};
RA Tan M.H., Gan H.M., Croft L.J., Austin C.M.;
RT "The genome of the Asian arowana (Scleropages formosus).";
RL Submitted (AUG-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KPP77609.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARO02000721; KPP77609.1; -; Genomic_DNA.
DR STRING; 113540.ENSSFOP00015037482; -.
DR Proteomes; UP000034805; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR025615; TILa_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR46160; ALPHA-TECTORIN-RELATED; 1.
DR PANTHER; PTHR46160:SF4; VWFD DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF01826; TIL; 3.
DR Pfam; PF12714; TILa; 3.
DR Pfam; PF00094; VWD; 4.
DR SMART; SM00832; C8; 3.
DR SMART; SM00215; VWC_out; 2.
DR SMART; SM00216; VWD; 4.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS51233; VWFD; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000034805};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..180
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 392..571
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 780..965
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1168..1334
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
SQ SEQUENCE 1334 AA; 145967 MW; 0DDD5EA50A16003D CRC64;
MGDPHYRTFD GQYFTFMGNC TYIMTKNCQV DSNHPGFQVE TKNDKNANSQ VTSVGMVTIN
ILGTTITVVR NEYGYVRVDY EVWSLPITME RGNNIVELYQ SGMSFFMEAD FGLTVQYDWQ
QYITITVADT FAGRVCGLCG NFNGKQGDDL TTPNGSEASS VVALGKSWRV QGAPGDANCG
DECSGQCEDC KSGLLGHLEG EIFCSLLTRI MEGPFKRCNA VIEPKFYKQM CLYDFCMGEG
VKKYLCDTLQ VYTDACQRAG IKVLDWRKLA RCPNPQCPEN SHYELCGNPC PTTCDNPSAP
SMCNTDCVET CACDEGYIRS GNQCVPPSEC GCQYEGHYVQ AGKSFWGDSN CSKRCQCSKT
GGKVTCEETS CQSGQLCMVL DGIRDCHPTS YATCLVSGDP HYLTFDGQRY NFQGTCVYQM
AGVCSKDATL EPFDVLVQND FRGNRVSSTT KLVEVRVYGQ TIVISEQYPG VVMVNGELAN
LPVTLADENV MIYKSGLFAV VQISFGVKIS FDWNSVAFVI VPSTYEGAMC GLCGNYNQNP
KDDMKMKDGE IAANGTELGQ SWRVAEIPGC VHGCKGPCPD CDITQKVQYE TNQYCGLLQD
PQGPFSNCFS TVDPSGFFQD CLYDVCLYKG QNAMQCKTLT AYTAACQSKG VKLDEWRTPN
FCELNCPANS HYELCSGGCP ATCDNLSPHI GCKELCQEGC TCNKSFILSG NQCVPFEKCG
CTYDGRYYSF GETFYPHGQC QEECKCTSDG KTECKNFSCS ADEKCEVKNG VRGCYPVGKA
VCTIVGDPHY KTFDNSTYDF QGTCTYVAAK GCYLEGTQLT PFTVVVENEQ WYPMVPNRNV
SVAKLVALEA YGNVLILRKN QIGKIMVNGV LVNLPLSINN GAIQAYQEGY YDVIKTDFGV
TIKYDLVYHI TIAVPADYEE KTCGLCGNFN GNKNDDFQLP DGKTTKNLST FGAMWKVSTP
GVICDDGCTG DLCPKCKKDM VVYEEECDII VNPNGPFAAC QKVIDPGSFF RDCVYDVCMS
EGDRKVLCSS IAAYVANCQN VKVEIKSWRT PSFCPLSCPV NSHYEICDET CSTTCPGLTD
VVNCPTTCVE GCTCNTGYFF NGTGCVSWDQ CSCYANGLTY KIGESIITEN CDEVCTCQPS
GVVVCETMQC TASETCRIEK GVRGCYQNQC LLQAGGVFTL FSGMSGTLTS AGAYEIVEVC
DDTLVAEWFR VVADLQMCGQ TGTATVAAVY TFFEDMAIAV NSKHSVWVNG KTVSLPIMLN
NEISITVSDK NLIIENQSGL LVTYSLSLDL SVTVSATLSG KMCGACGKIS GNNTVIASIQ
NYMNSWRAPD FPSW
//