GenomeNet

Database: UniProt
Entry: A0A087XCB9_POEFO
LinkDB: A0A087XCB9_POEFO
Original site: A0A087XCB9_POEFO 
ID   A0A087XCB9_POEFO        Unreviewed;       661 AA.
AC   A0A087XCB9;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   27-MAR-2024, entry version 40.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSPFOP00000003422.1};
OS   Poecilia formosa (Amazon molly) (Limia formosa).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC   Poecilia.
OX   NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000003422.1, ECO:0000313|Proteomes:UP000028760};
RN   [1] {ECO:0000313|Proteomes:UP000028760}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA   Schartl M., Warren W.;
RL   Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSPFOP00000003422.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AYCK01014962; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A087XCB9; -.
DR   STRING; 48698.ENSPFOP00000003422; -.
DR   Ensembl; ENSPFOT00000003428.2; ENSPFOP00000003422.1; ENSPFOG00000003278.2.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000154865; -.
DR   OMA; ETMQVPF; -.
DR   Proteomes; UP000028760; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..661
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5001832666"
FT   REGION          116..482
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          515..624
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        408..428
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   661 AA;  65555 MW;  FB9BD3A42C024BCA CRC64;
     MGASRDKVSL LSGLLLLVLC VASSLCQDEY SGYHSGEKHT YESSAVNHYN NKGSRRPGRT
     PPTVKYYDEE QQLVEEDYDP PRVTMVTEDI FPFATTTESH YAKEDTETMQ QVPFKYPAGP
     ETEESGGDAA LGEEGPTECN CEPGEPGFAG FAGQKGSRGL QGQTGLPGIQ GREGYKGAKG
     TRGRGGDTGP VGDAGPEGEE GASGFSGGMG EPGLQGDPGE RGEPGLKGDV GMEGPRGGGG
     AAGETGPRGD PGPPGSTGGK GVKGSRGDLG KQGSQGKDGN KGQTGAPGFP GDVGERGNAG
     YPGQTGPFGP NGPKGSKGSG GFPGSNGDPG EDGPIGVPGV IGEPGLFGVK GSKGDRGVRG
     PRGRLGRLGA QGEPGDTGIP GKHGPKGLQG PTGAKGETGP DGQKGAGGRK GLKGEKGQKG
     EKGVHGDRGQ RGPVGTGGRN GVPGPVGPPG IPGSRGDRGP IGDPGGKGRT GPKGAPGPAG
     PGLTDEQVLQ LCRGVVTAQL AQYAASIRAK CTQGCPINNR TLIGPPGARG PAGPPGRAGK
     AGKAGEKGAR GVQGQRGLEG QKGAEGQRGP KGAKGTTGDP GKGLQGPDGP QGPRGEQGHQ
     AEPKDGTEGP RGPRGFPGSV GPPGLVGFPG VPGFCEMRDC GIYAPVMRKE QGLVKGPASS
     S
//
DBGET integrated database retrieval system