GenomeNet

Database: UniProt
Entry: T1FSS5_HELRO
LinkDB: T1FSS5_HELRO
Original site: T1FSS5_HELRO 
ID   T1FSS5_HELRO            Unreviewed;       280 AA.
AC   T1FSS5;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:ESO06926.1, ECO:0000313|EnsemblMetazoa:HelroP191247};
GN   Name=20211872 {ECO:0000313|EnsemblMetazoa:HelroP191247};
GN   ORFNames=HELRODRAFT_191247 {ECO:0000313|EMBL:ESO06926.1};
OS   Helobdella robusta (Californian leech).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Clitellata;
OC   Hirudinea; Rhynchobdellida; Glossiphoniidae; Helobdella.
OX   NCBI_TaxID=6412 {ECO:0000313|EnsemblMetazoa:HelroP191247, ECO:0000313|Proteomes:UP000015101};
RN   [1] {ECO:0000313|Proteomes:UP000015101}
RP   NUCLEOTIDE SEQUENCE.
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ESO06926.1, ECO:0000313|Proteomes:UP000015101}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:HelroP191247}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQM01003721; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB096275; ESO06926.1; -; Genomic_DNA.
DR   RefSeq; XP_009015022.1; XM_009016774.1.
DR   AlphaFoldDB; T1FSS5; -.
DR   STRING; 6412.T1FSS5; -.
DR   EnsemblMetazoa; HelroT191247; HelroP191247; HelroG191247.
DR   GeneID; 20211872; -.
DR   KEGG; hro:HELRODRAFT_191247; -.
DR   CTD; 20211872; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   HOGENOM; CLU_994906_0_0_1; -.
DR   InParanoid; T1FSS5; -.
DR   OMA; SHDANAC; -.
DR   Proteomes; UP000015101; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF966; COLLAGEN ALPHA-1(XXII) CHAIN-LIKE; 1.
DR   Pfam; PF01391; Collagen; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000015101}.
FT   REGION          35..111
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          134..264
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        78..97
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        195..209
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        215..229
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   280 AA;  28309 MW;  E789A93A0CA953E8 CRC64;
     MVKDCSQMAE VLGTNNFEVE KSIIFNNFQN SQMFRDSYPE YQSGPPGFPG TQGRTGSPGP
     PGPRGPSGDS GIPGTSGSPG SPGPPGNPGP PGPQGPRGQA GMPGVPGQAA RMHTVAEVKE
     ICAYVLQERM HELSLNMKGP PGPPGKGVPG RRGPPGRQGI PGDQGVPGLP GDKGLIGPVG
     PQGPPGIPGS KGDKGDQGPE GECSSEYDLK EVEAMPGPPG PPGQPGIGLP GNPGPRGEPG
     HPGIQGPPGN VGPRGPPAQC PSNCNYDQYF QLLQQAQESS
//
DBGET integrated database retrieval system