ID G3WK21_SARHA Unreviewed; 514 AA.
AC G3WK21;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 24-JAN-2024, entry version 59.
DE RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN Name=MATN4 {ECO:0000313|Ensembl:ENSSHAP00000015776.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015776.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000015776.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000015776.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3WK21; -.
DR STRING; 9305.ENSSHAP00000015776; -.
DR Ensembl; ENSSHAT00000015905.2; ENSSHAP00000015776.2; ENSSHAG00000013164.2.
DR eggNOG; KOG1217; Eukaryota.
DR GeneTree; ENSGT00940000157086; -.
DR HOGENOM; CLU_008905_1_1_1; -.
DR TreeFam; TF330078; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR Gene3D; 1.20.5.30; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR036337; Matrilin_cc_sf.
DR InterPro; IPR019466; Matrilin_coiled-coil_trimer.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF14; MATRILIN-4; 1.
DR Pfam; PF10393; Matrilin_ccoil; 1.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM01279; Matrilin_ccoil; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF58002; Chicken cartilage matrix protein; 1.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 9..188
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 279..454
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
SQ SEQUENCE 514 AA; 56924 MW; 0C69D5143E67953B CRC64;
SKCRTGPLDL IFLIDSSRSV RPFEFETMRR FLVNIIRGLD IGPNATHVGV IQYSSQVQSV
FPLGAFFRRE DMERAIQAIV PLAQGTMTGL AIQYTMNVAF SVAEGARPPQ ARVPRVVVIV
TDGRPQDRVA EVAAQARNRG IEIYAVGVQR ADVGSLRAMA SSPLDEHVFL VESFDLIQQF
GFHFQGRLCV IDHCSFGNHS CQHECVNFPE GPRCRCRQGF ELQSDGRHCK ARDYCNGVDH
GCEYQCVSSG SSYSCICPEG RQLQADGKSC DRCRAGHVDL VLVIDGSKSV RPQNFELVKR
FVNQIVDFLD VSPEGTRVGL VQYSSRVRTE FPLGRYGTAD EVKQAVLAIE YMEKGTMTGL
ALRHLVEHSF SEAQGARPRA QNVPRVGLVF TDGRSQDDIS VWAARAKEEG IIMYAVGVGK
AVEEELREIA SDPPEQHVSY SPDFNTMTHM LENLKVNICP EEGKGVTELR SPCDCESLVE
FQSSALAALG SLEQKLTQLT ARLEDLENQL ANQK
//