GenomeNet

Database: UniProt
Entry: G3WK21_SARHA
LinkDB: G3WK21_SARHA
Original site: G3WK21_SARHA 
ID   G3WK21_SARHA            Unreviewed;       514 AA.
AC   G3WK21;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   24-JAN-2024, entry version 59.
DE   RecName: Full=VWFA domain-containing protein {ECO:0000259|PROSITE:PS50234};
GN   Name=MATN4 {ECO:0000313|Ensembl:ENSSHAP00000015776.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015776.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000015776.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000015776.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3WK21; -.
DR   STRING; 9305.ENSSHAP00000015776; -.
DR   Ensembl; ENSSHAT00000015905.2; ENSSHAP00000015776.2; ENSSHAG00000013164.2.
DR   eggNOG; KOG1217; Eukaryota.
DR   GeneTree; ENSGT00940000157086; -.
DR   HOGENOM; CLU_008905_1_1_1; -.
DR   TreeFam; TF330078; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   Gene3D; 1.20.5.30; -; 1.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR036337; Matrilin_cc_sf.
DR   InterPro; IPR019466; Matrilin_coiled-coil_trimer.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF14; MATRILIN-4; 1.
DR   Pfam; PF10393; Matrilin_ccoil; 1.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00181; EGF; 2.
DR   SMART; SM00179; EGF_CA; 2.
DR   SMART; SM01279; Matrilin_ccoil; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF58002; Chicken cartilage matrix protein; 1.
DR   SUPFAM; SSF57196; EGF/Laminin; 1.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT   DOMAIN          9..188
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          279..454
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
SQ   SEQUENCE   514 AA;  56924 MW;  0C69D5143E67953B CRC64;
     SKCRTGPLDL IFLIDSSRSV RPFEFETMRR FLVNIIRGLD IGPNATHVGV IQYSSQVQSV
     FPLGAFFRRE DMERAIQAIV PLAQGTMTGL AIQYTMNVAF SVAEGARPPQ ARVPRVVVIV
     TDGRPQDRVA EVAAQARNRG IEIYAVGVQR ADVGSLRAMA SSPLDEHVFL VESFDLIQQF
     GFHFQGRLCV IDHCSFGNHS CQHECVNFPE GPRCRCRQGF ELQSDGRHCK ARDYCNGVDH
     GCEYQCVSSG SSYSCICPEG RQLQADGKSC DRCRAGHVDL VLVIDGSKSV RPQNFELVKR
     FVNQIVDFLD VSPEGTRVGL VQYSSRVRTE FPLGRYGTAD EVKQAVLAIE YMEKGTMTGL
     ALRHLVEHSF SEAQGARPRA QNVPRVGLVF TDGRSQDDIS VWAARAKEEG IIMYAVGVGK
     AVEEELREIA SDPPEQHVSY SPDFNTMTHM LENLKVNICP EEGKGVTELR SPCDCESLVE
     FQSSALAALG SLEQKLTQLT ARLEDLENQL ANQK
//
DBGET integrated database retrieval system