ID A0A3B1JN91_ASTMX Unreviewed; 1498 AA.
AC A0A3B1JN91;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=VWFD domain-containing protein {ECO:0000259|PROSITE:PS51233};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000043788.1, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000043788.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSAMXT00000045431.1; ENSAMXP00000043788.1; ENSAMXG00000034755.1.
DR GeneTree; ENSGT00940000162219; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000034755; Expressed in pharyngeal gill and 3 other cell types or tissues.
DR CDD; cd19941; TIL; 3.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR InterPro; IPR025155; WxxW_domain.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF371; MUCIN-5B; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF13330; Mucin2_WxxW; 1.
DR Pfam; PF01826; TIL; 1.
DR Pfam; PF00094; VWD; 5.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00832; C8; 3.
DR SMART; SM00215; VWC_out; 1.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57567; Serine protease inhibitors; 3.
DR PROSITE; PS51233; VWFD; 3.
PE 4: Predicted;
KW Copper {ECO:0000256|ARBA:ARBA00023008};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 13..163
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 327..472
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 745..912
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT REGION 1080..1201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1315..1485
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1315..1446
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1453..1485
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1498 AA; 164424 MW; DEAD9D5D65DBF8DF CRC64;
MFHPGLNVDH SGQVCSTWGN FHFNTFDGQF FQLPYACNYI LATMCDSIMP DFNIQLRRTY
DNDIVTSTFT LWLQENVIKL HNVRIERSTD YVKIRSSKLG LTVYWDKQNS LWIEINEKYS
GQICGLCGDF NGIKDNEFTE NGRAKLGPEN NLHLKQLTRA MCEEYFYIKA FSGCHSLLPV
AEFVEACIKD LCQCNSSQAD CLCDTLSEYS RQCAHAGGQP QSWRTNALCD KKCPFNLEHS
ECSRPCQDTC SNQQTSQVCA EHCVDGCFCP SGTVFNDIEK NGCVSVNECP CTHNGNIYQA
GESYTRTCQK CKCTNGRWNC VKLDCLGTCS LLGGSHITTY DGKAYTFQGN CYYILSKDKE
QTVSGSLAQC GTQTCLTEVQ LSIQGTTVNR LTIFRPSTFF IIAQTPSFQL TIQMVPIMQV
YILVSPEKKG TLNGLCGNFN DIQADDFTTE SGQKEGTATT FANIWKARSD CPDQQIHTTD
IYYSYNYFVI SEKYAKEWCH ILTDTDGVFS PCHAEVNPNN YKDRCVYDTC QCANSEECMC
AALSSYAHVC AAKGVLLDGW RNNTCGKFSK CEGNMQYSYS QTSCGSTCRS LSGIDYTCKV
KHTPVDGCGC AEGTYLNDKD ECVSDSSCPC YYNNQNLIKW LCILMHVHFL VTDCKAPMYY
FNCSDADPGA KGIECQKSCQ TQSPYCISTP CQSGCMCPNG QLVNRHGTCV TEDLCECFHN
GLYYQPYETI QENWQLICTT KKCPATCTIF GEGHYKTFDG TRYDFNGDCE YLLAQDYCSN
SFTGSFRIIT ENIPCGTTGT TCSKSIKIFL GERILLNENI TDIPYNYSKK IPYKIHTVGI
YRIIEASNGL VLYWDNKTSL HIRLSPSFQG KVCGLCGNYD GNGKNDFVTR AGEEVVEPLK
FGNSWRVFTT CSKAKIVSSP CELRPHRHTW AAKHCNIINE EGVFGLCHNL VDRSKYYDIC
MQDTCACDTG GDCECFCTAV AAYAAACREQ GICISWRNPT TCPLFCDYYN SPGSCEWHYQ
SCGDPCIKTC TNPLGHCSNI TSIKVEGCFP KCPEDKPYFD EKNMTCVECC NNTCTEKQTN
TTTNTTTPST PSTTIPTTTP STASTTIPTT TPSTTPSTTP TTTPSTTPST TTPSTTPSTT
TPSTTPSTTT PSTTPSTTTP STTPSTASTT TPSTTPSTTP STTPSTTPST ASTTPSTTTT
TTSTYFCPVL CQWSEWLDTH KIYYEPNIAE DEFESIESMW KSKKISCEHP QEIKCRHTDV
ELATQHTDVE LSDLKENLIC NVSVGLICKH KDQKKGQCYN HEIQVKCCTP SCAPHVHSST
STPSTTTHST TTQSTTTQLT TPSTTTQSTT PSTTPSTTPS TTPSTTPSTT TQSTTPSTTL
STTPTTPSTL PSTTPSTTPS TTPSTTPSTT PTTTLSTTPR TPSTTPSTTP STTPTTTLST
TPTTPSTTPS TTPPTPSTTT SSTTPSTTTS PTTTSTTTPP TTTPCSCTYQ TKMFHPGK
//