GenomeNet

Database: UniProt
Entry: A0A2Y9S448_PHYMC
LinkDB: A0A2Y9S448_PHYMC
Original site: A0A2Y9S448_PHYMC 
ID   A0A2Y9S448_PHYMC        Unreviewed;      2121 AA.
AC   A0A2Y9S448;
DT   12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT   05-JUN-2019, sequence version 2.
DT   27-MAR-2024, entry version 21.
DE   SubName: Full=LOW QUALITY PROTEIN: collagen alpha-4(VI) chain-like {ECO:0000313|RefSeq:XP_023970945.2};
GN   Name=LOC102979304 {ECO:0000313|RefSeq:XP_023970945.2};
OS   Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC   Physeteridae; Physeter.
OX   NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_023970945.2};
RN   [1] {ECO:0000313|RefSeq:XP_023970945.2}
RP   IDENTIFICATION.
RC   TISSUE=Muscle {ECO:0000313|RefSeq:XP_023970945.2};
RG   RefSeq;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_023970945.2; XM_024115177.2.
DR   KEGG; pcad:102979304; -.
DR   InParanoid; A0A2Y9S448; -.
DR   OrthoDB; 5359724at2759; -.
DR   Proteomes; UP000248484; Chromosome 1.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd01472; vWA_collagen; 3.
DR   CDD; cd01450; vWFA_subfamily_ECM; 2.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 8.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00092; VWA; 8.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00327; VWA; 9.
DR   SUPFAM; SSF53300; vWA-like; 9.
DR   PROSITE; PS50234; VWFA; 8.
PE   4: Predicted;
KW   Collagen {ECO:0000313|RefSeq:XP_023970945.2};
KW   Reference proteome {ECO:0000313|Proteomes:UP000248484};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..2121
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5030062149"
FT   DOMAIN          33..209
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          234..412
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          429..601
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          632..809
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          842..1011
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1023..1196
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1578..1721
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1781..1938
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1408..1541
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2059..2089
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1503..1517
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2060..2079
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2121 AA;  233251 MW;  C3816C22BCDC252F CRC64;
     METWKTFWGI IFLAAGFGFK SQRIVCRDDS VGDVVFLVDT SNNPQNTHSA RSFLYTVVNG
     VSVGGEAVCV GLAWYSDQPQ SEFLLSTYHR KGEVLRRIQR FPFTPGGHKM GLALQFLLDH
     HFQETAGSRA SQGVPQVAVV ISSSPAEDHV QEPADDFRRA GILLYAVGVG DAVSADLKEI
     SSSPVEKFVA FVPNFSALGS FALKLRQELC DTLAKAAPPV GHVSPACRET VLADIVFLVD
     SSTSIGXQNF QKVKNFLHSV ALGLDISSDQ VRVGLAQYND NIYPAFWLNQ HRLKSVVLEH
     IWNLPYHTGG TNTGSALEFI RTNYLTEAAG SRAKDGVPQV VILVTDGESN DEVQEAADRL
     KQDGVVVYVV GVNVQNVQEL QTIASEPLEK FLFNAENFNI LQDFSGSILQ TLCLAVEGKI
     KDSAQHYADV VFLADTSQNT SWTSFQWMQT FISRVDGMLD VGRDKYQIGM AQYGGQGHTE
     FLFNTYQTQD EMMTHIHEHF VLWGGSSRTG KALQYLYQTF FQEAEGSRFL QGIPQYAVVI
     TSGKSEDEVH EAAETLREKG VKVMSVGVQD SDKRELQGMG TPSLVYEMQG QDRVRQVMHH
     VSGVIQGTGQ LETKNEAKME PIEACLTAIP ADLVFLIEEF SRAQQSNFQH VVNFLKTTVS
     SLSIHPDVVR IGLVFYSEEP XLEFSLDTFQ NSAKILEHLD KLTCRRRGER TKTGAALDFL
     RNEVFIQEKG SQFQQGVQQV GVVIVEDFSQ DNVSXPASLL RRMGVTVYAV GTQLPLESMD
     LEKIASYPPC KHVIPLESFL QLAVVGSKIK NQLCPEITGK RASVSGMSSA LQEGCVHIEK
     AELYFLIDGS DSIHQDDFLE MKVFMNEVIK MFHIGPDRVQ FGVFQYSDEI SSQFTLSQHT
     SVAGLKVAID GIQQNGGGTM TGQALGSLRQ VFADTALSNV PWYLIIITDG KSMDPVAEVV
     EALRGDGVTI YAIGIRDANT IELQEIAEDR MFFVNDSDSL KAIQQEVLQD ICSSETCKNR
     KADIIFLTDG SESISLKDFE KIKGFMKRMV NESNIGADEI QIAFXTLLQF SSNPQEEFRL
     NRRYSSKVDI HGAISDVKQI NDDTYTGKAL NFTLPFFGSS RGGRPSVHQN LIVITDGVAR
     DNVAIPARAL RNRNIIIFAI GVGEAKHSQL LEITNDQSKV YYEEKFEFLQ NLEKKMLYQV
     CIPQGECNVD FSVAIDLSTP TRQVQQRLQG LLPELMQELA MLSNISCGVP GQTNVMLRYL
     VPGSKGQLIF DSGFEKYSDE TIQKFLIHQA ASSNHMDVDF LQSLGHSAIH LSSAKVKVLL
     VFTDGLDDDL ERLKEISKLL RSKGFSRLLT VGLEGVHKLX ELEFGGGFAY TQPLSITQRS
     LPSILLKQLD TIVERSCCNM YAKCFGEDGH RGDGGSPGRK GERGPQGLPG PRGEEGCWGM
     RGPKGVSGFS REKGNPGEEG PDGLDGEQGY HGVPGSSGEK GNWGNRGEPG FPGYPGAQGE
     DGDLGHQGEK GAKEIRGKRG NAGLPGFVGT PGDPGPVGRL GIKGPKGVVD MMPCXIINFA
     RENSPCSGVS KCPCFPTEVV FALDVSNDVS QLDFERMGGV LLSLLMKMEI SGSNCPTGAR
     VAIVSYDSKT DYVVRFSDHK ERPALLQEVR GLSLAGSSSS RNLGDSMRFV VRHVFKCVRA
     GRLLRKVAVF FQAGWTQDAG SISTATLELS ALDITSVVIT FMEDHKLPDA LLMDGTNKFH
     LYVWETENQQ DMAHCTLCYD QCSPAPECGL GVPRPLAVGM DVAFVVDSSV GVGTDLYRTA
     LTLVDTTLDD LEVAAQASAS PHGARMALVT HTTPYFWPGA GWPPVREGFH LTSYAXRTQM
     QRHVREALDH PLRGAPALGH ALEWTLEKVL LANPLPRKVQ VLFTIVASET SSWDREKLRT
     PSLDAKCKRI TLFVLALGPG MGTHVASAPS EQHMLHLEGL LDAEVAYARG FTRAFLNLLK
     SGTNQYPPPE LIEECGDPSR GDTFLQPILS VKRLPKHQFG KSGLADDLEA LKATGSFLEE
     NRKAMMTSFT QQEALENYEK SGYNAEENEQ EKPTKPKGMG KERNLGTAFG PCSLDPMEGD
     YVLKWSYNEK EQACRQFWYG G
//
DBGET integrated database retrieval system