GenomeNet

Database: UniProt
Entry: R7U833_CAPTE
LinkDB: R7U833_CAPTE
Original site: R7U833_CAPTE 
ID   R7U833_CAPTE            Unreviewed;       206 AA.
AC   R7U833;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 37.
DE   RecName: Full=Collagen IV NC1 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=CAPTEDRAFT_130099 {ECO:0000313|EMBL:ELT99280.1};
OS   Capitella teleta (Polychaete worm).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Annelida; Polychaeta;
OC   Sedentaria; Scolecida; Capitellidae; Capitella.
OX   NCBI_TaxID=283909 {ECO:0000313|EMBL:ELT99280.1};
RN   [1] {ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|Proteomes:UP000014760};
RA   Hellsten U., Grimwood J., Chapman J.A., Shapiro H., Aerts A., Otillar R.P.,
RA   Terry A.Y., Boore J.L., Simakov O., Marletaz F., Cho S.-J.,
RA   Edsinger-Gonzales E., Havlak P., Kuo D.-H., Larsson T., Lv J., Arendt D.,
RA   Savage R., Osoegawa K., de Jong P., Lindberg D.R., Seaver E.C.,
RA   Weisblat D.A., Putnam N.H., Grigoriev I.V., Rokhsar D.S.;
RL   Submitted (DEC-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:ELT99280.1, ECO:0000313|Proteomes:UP000014760}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=I ESC-2004 {ECO:0000313|EMBL:ELT99280.1,
RC   ECO:0000313|Proteomes:UP000014760};
RX   PubMed=23254933; DOI=10.1038/nature11696;
RA   Simakov O., Marletaz F., Cho S.J., Edsinger-Gonzales E., Havlak P.,
RA   Hellsten U., Kuo D.H., Larsson T., Lv J., Arendt D., Savage R.,
RA   Osoegawa K., de Jong P., Grimwood J., Chapman J.A., Shapiro H., Aerts A.,
RA   Otillar R.P., Terry A.Y., Boore J.L., Grigoriev I.V., Lindberg D.R.,
RA   Seaver E.C., Weisblat D.A., Putnam N.H., Rokhsar D.S.;
RT   "Insights into bilaterian evolution from three spiralian genomes.";
RL   Nature 493:526-531(2013).
RN   [3] {ECO:0000313|EnsemblMetazoa:CapteP130099}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (JUN-2015) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMQN01010088; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KB307016; ELT99280.1; -; Genomic_DNA.
DR   AlphaFoldDB; R7U833; -.
DR   STRING; 283909.R7U833; -.
DR   EnsemblMetazoa; CapteT130099; CapteP130099; CapteG130099.
DR   HOGENOM; CLU_001074_13_3_1; -.
DR   OMA; IRMYLIH; -.
DR   Proteomes; UP000014760; Unassembled WGS sequence.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; MACROPHAGE RECEPTOR MARCO; 1.
DR   Pfam; PF01391; Collagen; 2.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000014760}.
FT   REGION          1..206
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        33..83
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        91..105
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        171..185
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   206 AA;  19137 MW;  FCABC3B6A7812846 CRC64;
     MTGATGLDGE KGEKGSAGPS GQPGVAGPTG PQGPQGQRGS IGLSGQTGQS GITGPTGVDG
     QKGSQGPTGQ QGPQGQSGVQ GPKGRQGESG PSGPQGLQGP SGQSGQRGER GEAGPTGPSG
     PSGPQGPRGP TGTRGEPGSP GATGGPGSKG ETGRIGPSGP DGPSGPTGPS GPSGEAGSQG
     SRGETGQRGE IGVSGPTGKI QRRKKS
//
DBGET integrated database retrieval system