ID T1G769_HELRO Unreviewed; 209 AA.
AC T1G769;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=20216916 {ECO:0000313|EnsemblMetazoa:HelroP88834};
GN ORFNames=HELRODRAFT_88834 {ECO:0000313|EMBL:ESN93324.1};
OS Helobdella robusta (Californian leech).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Clitellata;
OC Hirudinea; Rhynchobdellida; Glossiphoniidae; Helobdella.
OX NCBI_TaxID=6412 {ECO:0000313|EnsemblMetazoa:HelroP88834, ECO:0000313|Proteomes:UP000015101};
RN [1] {ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RA Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESN93324.1, ECO:0000313|Proteomes:UP000015101}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23254933; DOI=10.1038/nature11696;
RA Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT "Insights into bilaterian evolution from three spiralian genomes.";
RL Nature 493:526-531(2013).
RN [3] {ECO:0000313|EnsemblMetazoa:HelroP88834}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMQM01007456; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KB097620; ESN93324.1; -; Genomic_DNA.
DR RefSeq; XP_009028536.1; XM_009030288.1.
DR AlphaFoldDB; T1G769; -.
DR STRING; 6412.T1G769; -.
DR EnsemblMetazoa; HelroT88834; HelroP88834; HelroG88834.
DR GeneID; 20216916; -.
DR KEGG; hro:HELRODRAFT_88834; -.
DR CTD; 20216916; -.
DR eggNOG; KOG3544; Eukaryota.
DR HOGENOM; CLU_001074_13_3_1; -.
DR InParanoid; T1G769; -.
DR OMA; FRSTWGC; -.
DR Proteomes; UP000015101; Unassembled WGS sequence.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF880; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000015101}.
FT REGION 1..195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 69..97
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 209 AA; 19436 MW; 30ADA972EB8048E8 CRC64;
PGPAGQVGNP GQNGAQGPAG QPGLMGDTGF SGPQGPPGQP GQNGSPGYDG YPGQPGLTGQ
PGLPGNPGVQ GFTGSTGYRG PQGFSGNIGS PGSTGGTGYP GNPGLPGPQG TQGYIGLPGQ
TGPIGLQGNK GNAGAAGPQG TSGFPGPMGA TGAQGSPGLS GNPGLPGLPG PPANTGATGL
TGPIGSPGNI GPIGRNGEKG CIEVLYYHQ
//