ID A0A2Y9S448_PHYMC Unreviewed; 2121 AA.
AC A0A2Y9S448;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 2.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=LOW QUALITY PROTEIN: collagen alpha-4(VI) chain-like {ECO:0000313|RefSeq:XP_023970945.2};
GN Name=LOC102979304 {ECO:0000313|RefSeq:XP_023970945.2};
OS Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Physeteridae; Physeter.
OX NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_023970945.2};
RN [1] {ECO:0000313|RefSeq:XP_023970945.2}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_023970945.2};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_023970945.2; XM_024115177.2.
DR KEGG; pcad:102979304; -.
DR InParanoid; A0A2Y9S448; -.
DR OrthoDB; 5359724at2759; -.
DR Proteomes; UP000248484; Chromosome 1.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd01472; vWA_collagen; 3.
DR CDD; cd01450; vWFA_subfamily_ECM; 2.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 8.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00092; VWA; 8.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00327; VWA; 9.
DR SUPFAM; SSF53300; vWA-like; 9.
DR PROSITE; PS50234; VWFA; 8.
PE 4: Predicted;
KW Collagen {ECO:0000313|RefSeq:XP_023970945.2};
KW Reference proteome {ECO:0000313|Proteomes:UP000248484};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..2121
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030062149"
FT DOMAIN 33..209
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 234..412
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 429..601
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 632..809
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 842..1011
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1023..1196
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1578..1721
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1781..1938
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1408..1541
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2059..2089
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1503..1517
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2060..2079
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2121 AA; 233251 MW; C3816C22BCDC252F CRC64;
METWKTFWGI IFLAAGFGFK SQRIVCRDDS VGDVVFLVDT SNNPQNTHSA RSFLYTVVNG
VSVGGEAVCV GLAWYSDQPQ SEFLLSTYHR KGEVLRRIQR FPFTPGGHKM GLALQFLLDH
HFQETAGSRA SQGVPQVAVV ISSSPAEDHV QEPADDFRRA GILLYAVGVG DAVSADLKEI
SSSPVEKFVA FVPNFSALGS FALKLRQELC DTLAKAAPPV GHVSPACRET VLADIVFLVD
SSTSIGXQNF QKVKNFLHSV ALGLDISSDQ VRVGLAQYND NIYPAFWLNQ HRLKSVVLEH
IWNLPYHTGG TNTGSALEFI RTNYLTEAAG SRAKDGVPQV VILVTDGESN DEVQEAADRL
KQDGVVVYVV GVNVQNVQEL QTIASEPLEK FLFNAENFNI LQDFSGSILQ TLCLAVEGKI
KDSAQHYADV VFLADTSQNT SWTSFQWMQT FISRVDGMLD VGRDKYQIGM AQYGGQGHTE
FLFNTYQTQD EMMTHIHEHF VLWGGSSRTG KALQYLYQTF FQEAEGSRFL QGIPQYAVVI
TSGKSEDEVH EAAETLREKG VKVMSVGVQD SDKRELQGMG TPSLVYEMQG QDRVRQVMHH
VSGVIQGTGQ LETKNEAKME PIEACLTAIP ADLVFLIEEF SRAQQSNFQH VVNFLKTTVS
SLSIHPDVVR IGLVFYSEEP XLEFSLDTFQ NSAKILEHLD KLTCRRRGER TKTGAALDFL
RNEVFIQEKG SQFQQGVQQV GVVIVEDFSQ DNVSXPASLL RRMGVTVYAV GTQLPLESMD
LEKIASYPPC KHVIPLESFL QLAVVGSKIK NQLCPEITGK RASVSGMSSA LQEGCVHIEK
AELYFLIDGS DSIHQDDFLE MKVFMNEVIK MFHIGPDRVQ FGVFQYSDEI SSQFTLSQHT
SVAGLKVAID GIQQNGGGTM TGQALGSLRQ VFADTALSNV PWYLIIITDG KSMDPVAEVV
EALRGDGVTI YAIGIRDANT IELQEIAEDR MFFVNDSDSL KAIQQEVLQD ICSSETCKNR
KADIIFLTDG SESISLKDFE KIKGFMKRMV NESNIGADEI QIAFXTLLQF SSNPQEEFRL
NRRYSSKVDI HGAISDVKQI NDDTYTGKAL NFTLPFFGSS RGGRPSVHQN LIVITDGVAR
DNVAIPARAL RNRNIIIFAI GVGEAKHSQL LEITNDQSKV YYEEKFEFLQ NLEKKMLYQV
CIPQGECNVD FSVAIDLSTP TRQVQQRLQG LLPELMQELA MLSNISCGVP GQTNVMLRYL
VPGSKGQLIF DSGFEKYSDE TIQKFLIHQA ASSNHMDVDF LQSLGHSAIH LSSAKVKVLL
VFTDGLDDDL ERLKEISKLL RSKGFSRLLT VGLEGVHKLX ELEFGGGFAY TQPLSITQRS
LPSILLKQLD TIVERSCCNM YAKCFGEDGH RGDGGSPGRK GERGPQGLPG PRGEEGCWGM
RGPKGVSGFS REKGNPGEEG PDGLDGEQGY HGVPGSSGEK GNWGNRGEPG FPGYPGAQGE
DGDLGHQGEK GAKEIRGKRG NAGLPGFVGT PGDPGPVGRL GIKGPKGVVD MMPCXIINFA
RENSPCSGVS KCPCFPTEVV FALDVSNDVS QLDFERMGGV LLSLLMKMEI SGSNCPTGAR
VAIVSYDSKT DYVVRFSDHK ERPALLQEVR GLSLAGSSSS RNLGDSMRFV VRHVFKCVRA
GRLLRKVAVF FQAGWTQDAG SISTATLELS ALDITSVVIT FMEDHKLPDA LLMDGTNKFH
LYVWETENQQ DMAHCTLCYD QCSPAPECGL GVPRPLAVGM DVAFVVDSSV GVGTDLYRTA
LTLVDTTLDD LEVAAQASAS PHGARMALVT HTTPYFWPGA GWPPVREGFH LTSYAXRTQM
QRHVREALDH PLRGAPALGH ALEWTLEKVL LANPLPRKVQ VLFTIVASET SSWDREKLRT
PSLDAKCKRI TLFVLALGPG MGTHVASAPS EQHMLHLEGL LDAEVAYARG FTRAFLNLLK
SGTNQYPPPE LIEECGDPSR GDTFLQPILS VKRLPKHQFG KSGLADDLEA LKATGSFLEE
NRKAMMTSFT QQEALENYEK SGYNAEENEQ EKPTKPKGMG KERNLGTAFG PCSLDPMEGD
YVLKWSYNEK EQACRQFWYG G
//