GenomeNet

Database: UniProt
Entry: A0A643BY15_BALPH
LinkDB: A0A643BY15_BALPH
Original site: A0A643BY15_BALPH 
ID   A0A643BY15_BALPH        Unreviewed;      1086 AA.
AC   A0A643BY15;
DT   22-APR-2020, integrated into UniProtKB/TrEMBL.
DT   22-APR-2020, sequence version 1.
DT   28-JAN-2026, entry version 19.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KAB0392435.1};
GN   ORFNames=E2I00_013453 {ECO:0000313|EMBL:KAB0392435.1};
OS   Balaenoptera physalus (Fin whale) (Balaena physalus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC   Balaenopteridae; Balaenoptera.
OX   NCBI_TaxID=9770 {ECO:0000313|EMBL:KAB0392435.1, ECO:0000313|Proteomes:UP000437017};
RN   [1] {ECO:0000313|EMBL:KAB0392435.1, ECO:0000313|Proteomes:UP000437017}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=FinWhale-01 {ECO:0000313|EMBL:KAB0392435.1};
RX   PubMed=31553763;
RA   Westbury M.V., Petersen B., Lorenzen E.D.;
RT   "Genomic analyses reveal an absence of contemporary introgressive admixture
RT   between fin whales and blue whales, despite known hybrids.";
RL   PLoS ONE 14:0-e0222004(2019).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAB0392435.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; SGJD01003852; KAB0392435.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A643BY15; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000437017; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF910; COLLECTIN-12; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000437017}.
FT   DOMAIN          1..118
FT                   /note="DUF959"
FT                   /evidence="ECO:0000259|Pfam:PF06121"
FT   DOMAIN          994..1042
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   REGION          1..81
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          194..233
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          262..371
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          409..576
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          595..751
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          763..794
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          830..861
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          926..945
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        66..78
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        274..286
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        315..329
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        362..371
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        462..483
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        595..609
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        701..712
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1086 AA;  114228 MW;  D23FBEF2D096A6CC CRC64;
     MSWLWPGNAA GSTVAPASTP PGSSPVRPTE DTTTHVAPQD DPTQQWKALA SPEPPLERPE
     VGQGQAPAVP SAASSASPDT KEENIAGVGA KILNVALGIR SFVQLSSETF PEAVIGHGGR
     EVPGRPDPAL GGQTPGAHRT LTHGHSHAVT CTHSHVHTGG LIRALIDQRQ DAIVGDLVPP
     SADLRLREDP QVSPLHCLDE DEEDDDDRGA PLGPRLPEAP PVTSPPLAGV GNQEDFRTEE
     IEEETTVSSL GAQTLPSLST VTTWAGSEWS PGRGLKEGDP GEDGKPGDTG PQGFPGTPGD
     VGPKGEKGDP GVGPRGPPGP QGPPGPPGPS FRRDRLTFID MEGSGFGGDL ESLRGPPGPP
     GKDGQPGQTG QKGSLNIDAL LDLDILFLVS ILVALIMEKL SRLDKPVTGA RGFRRGRRPR
     TQGTDKAPVP PLFLDGMGPP AEGVGGDPGV MGPPGTKGEV GADGAPGAPG LPGREGAAGL
     QGPKGEKGPQ GEKGPAGPKG DLGSRGQQGL PGPKGEKGEP GMVFSPDGRA LTSAQKGAKG
     EPGFRGPPGR GKGAQGPKSR VRPAHLGPQG PRAFPGLLSM TATRLWSLAA LDPRDCQPTK
     QENKCDVGHR PGKSQESWTP GHCQRLRRGS GQRGLSGHHP QWGREGRCGE GSGADDTGTE
     EEQQGGGETT RPRSAPLQRG RIPETPAWPR GGRNPVTSDP SEGRTCHLRD QPRTGGNVPL
     ITEQEGRTAE HRRHNRDGPT EQGACGEERP QWELETLELK DKAVSEGQRC SQGRGWDSRP
     SLRGPRWQWG HPALRKDPGQ ELRAVNMSRV ALGRRQFPFD LLHLGAEMKG EKGDQGAAGQ
     KGERGEPGGG GFFSSSMPGP PGPPGYPGIP GLVADAARTF KLARPQPGDI GDSALCFAFH
     ARGPRERASR ASLALLDLRD PLASATRDTR GLLGPPDLQG PRDPHPFLAL TGRLSVFLAP
     QAHLGPQDRP DPWAPPQGGL LCAGDPIGLG FQQVRVWATY QTLLDQVPEV PEGWLVYVAD
     REELYVRVRN GFRKVLLEAR TPLPRGTDEV LSPSWEALFS GSEGQLKPGA RIFSFDGRDV
     LQHPAW
//
DBGET integrated database retrieval system