GenomeNet

Database: UniProt
Entry: T1G769_HELRO
LinkDB: T1G769_HELRO
Original site: T1G769_HELRO 
ID   T1G769_HELRO            Unreviewed;       209 AA.
AC   T1G769;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   Name=20216916 {ECO:0000313|EnsemblMetazoa:HelroP88834};
GN   ORFNames=HELRODRAFT_88834 {ECO:0000313|EMBL:ESN93324.1};
OS   Helobdella robusta (Californian leech).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Clitellata;
OC   Hirudinea; Rhynchobdellida; Glossiphoniidae; Helobdella.
OX   NCBI_TaxID=6412 {ECO:0000313|EnsemblMetazoa:HelroP88834, ECO:0000313|Proteomes:UP000015101};
RN   [1] {ECO:0000313|Proteomes:UP000015101}
RP   NUCLEOTIDE SEQUENCE.
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ESN93324.1, ECO:0000313|Proteomes:UP000015101}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:HelroP88834}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQM01007456; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB097620; ESN93324.1; -; Genomic_DNA.
DR   RefSeq; XP_009028536.1; XM_009030288.1.
DR   AlphaFoldDB; T1G769; -.
DR   STRING; 6412.T1G769; -.
DR   EnsemblMetazoa; HelroT88834; HelroP88834; HelroG88834.
DR   GeneID; 20216916; -.
DR   KEGG; hro:HELRODRAFT_88834; -.
DR   CTD; 20216916; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   HOGENOM; CLU_001074_13_3_1; -.
DR   InParanoid; T1G769; -.
DR   OMA; FRSTWGC; -.
DR   Proteomes; UP000015101; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF880; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000015101}.
FT   REGION          1..195
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        69..97
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   209 AA;  19436 MW;  30ADA972EB8048E8 CRC64;
     PGPAGQVGNP GQNGAQGPAG QPGLMGDTGF SGPQGPPGQP GQNGSPGYDG YPGQPGLTGQ
     PGLPGNPGVQ GFTGSTGYRG PQGFSGNIGS PGSTGGTGYP GNPGLPGPQG TQGYIGLPGQ
     TGPIGLQGNK GNAGAAGPQG TSGFPGPMGA TGAQGSPGLS GNPGLPGLPG PPANTGATGL
     TGPIGSPGNI GPIGRNGEKG CIEVLYYHQ
//
DBGET integrated database retrieval system