ID A0A643BY15_BALPH Unreviewed; 1086 AA.
AC A0A643BY15;
DT 22-APR-2020, integrated into UniProtKB/TrEMBL.
DT 22-APR-2020, sequence version 1.
DT 28-JAN-2026, entry version 19.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAB0392435.1};
GN ORFNames=E2I00_013453 {ECO:0000313|EMBL:KAB0392435.1};
OS Balaenoptera physalus (Fin whale) (Balaena physalus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC Balaenopteridae; Balaenoptera.
OX NCBI_TaxID=9770 {ECO:0000313|EMBL:KAB0392435.1, ECO:0000313|Proteomes:UP000437017};
RN [1] {ECO:0000313|EMBL:KAB0392435.1, ECO:0000313|Proteomes:UP000437017}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FinWhale-01 {ECO:0000313|EMBL:KAB0392435.1};
RX PubMed=31553763;
RA Westbury M.V., Petersen B., Lorenzen E.D.;
RT "Genomic analyses reveal an absence of contemporary introgressive admixture
RT between fin whales and blue whales, despite known hybrids.";
RL PLoS ONE 14:0-e0222004(2019).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAB0392435.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; SGJD01003852; KAB0392435.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A643BY15; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000437017; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR010363; DUF959_COL18_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF910; COLLECTIN-12; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06121; DUF959; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000437017}.
FT DOMAIN 1..118
FT /note="DUF959"
FT /evidence="ECO:0000259|Pfam:PF06121"
FT DOMAIN 994..1042
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT REGION 1..81
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 194..233
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 262..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..576
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 595..751
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 763..794
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 830..861
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 926..945
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 66..78
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..286
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..329
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..371
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 462..483
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 595..609
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 701..712
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1086 AA; 114228 MW; D23FBEF2D096A6CC CRC64;
MSWLWPGNAA GSTVAPASTP PGSSPVRPTE DTTTHVAPQD DPTQQWKALA SPEPPLERPE
VGQGQAPAVP SAASSASPDT KEENIAGVGA KILNVALGIR SFVQLSSETF PEAVIGHGGR
EVPGRPDPAL GGQTPGAHRT LTHGHSHAVT CTHSHVHTGG LIRALIDQRQ DAIVGDLVPP
SADLRLREDP QVSPLHCLDE DEEDDDDRGA PLGPRLPEAP PVTSPPLAGV GNQEDFRTEE
IEEETTVSSL GAQTLPSLST VTTWAGSEWS PGRGLKEGDP GEDGKPGDTG PQGFPGTPGD
VGPKGEKGDP GVGPRGPPGP QGPPGPPGPS FRRDRLTFID MEGSGFGGDL ESLRGPPGPP
GKDGQPGQTG QKGSLNIDAL LDLDILFLVS ILVALIMEKL SRLDKPVTGA RGFRRGRRPR
TQGTDKAPVP PLFLDGMGPP AEGVGGDPGV MGPPGTKGEV GADGAPGAPG LPGREGAAGL
QGPKGEKGPQ GEKGPAGPKG DLGSRGQQGL PGPKGEKGEP GMVFSPDGRA LTSAQKGAKG
EPGFRGPPGR GKGAQGPKSR VRPAHLGPQG PRAFPGLLSM TATRLWSLAA LDPRDCQPTK
QENKCDVGHR PGKSQESWTP GHCQRLRRGS GQRGLSGHHP QWGREGRCGE GSGADDTGTE
EEQQGGGETT RPRSAPLQRG RIPETPAWPR GGRNPVTSDP SEGRTCHLRD QPRTGGNVPL
ITEQEGRTAE HRRHNRDGPT EQGACGEERP QWELETLELK DKAVSEGQRC SQGRGWDSRP
SLRGPRWQWG HPALRKDPGQ ELRAVNMSRV ALGRRQFPFD LLHLGAEMKG EKGDQGAAGQ
KGERGEPGGG GFFSSSMPGP PGPPGYPGIP GLVADAARTF KLARPQPGDI GDSALCFAFH
ARGPRERASR ASLALLDLRD PLASATRDTR GLLGPPDLQG PRDPHPFLAL TGRLSVFLAP
QAHLGPQDRP DPWAPPQGGL LCAGDPIGLG FQQVRVWATY QTLLDQVPEV PEGWLVYVAD
REELYVRVRN GFRKVLLEAR TPLPRGTDEV LSPSWEALFS GSEGQLKPGA RIFSFDGRDV
LQHPAW
//