GenomeNet

Database: UniProt
Entry: W2TJP4_NECAM
LinkDB: W2TJP4_NECAM
Original site: W2TJP4_NECAM 
ID   W2TJP4_NECAM            Unreviewed;       353 AA.
AC   W2TJP4;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   SubName: Full=Nematode cuticle collagen domain protein {ECO:0000313|EMBL:ETN81232.1};
GN   ORFNames=NECAME_08672 {ECO:0000313|EMBL:ETN81232.1};
OS   Necator americanus (Human hookworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC   Necator.
OX   NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN81232.1, ECO:0000313|Proteomes:UP000053676};
RN   [1] {ECO:0000313|Proteomes:UP000053676}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24441737; DOI=10.1038/ng.2875;
RA   Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA   Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA   Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA   Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA   Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT   "Genome of the human hookworm Necator americanus.";
RL   Nat. Genet. 46:261-269(2014).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KI658811; ETN81232.1; -; Genomic_DNA.
DR   RefSeq; XP_013303459.1; XM_013448005.1.
DR   AlphaFoldDB; W2TJP4; -.
DR   STRING; 51031.W2TJP4; -.
DR   EnsemblMetazoa; NECAME_08672; NECAME_08672; NECAME_08672.
DR   GeneID; 25348701; -.
DR   KEGG; nai:NECAME_08672; -.
DR   CTD; 25348701; -.
DR   OMA; CPGREGD; -.
DR   OrthoDB; 2882577at2759; -.
DR   Proteomes; UP000053676; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0042302; F:structural constituent of cuticle; IEA:InterPro.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   InterPro; IPR002486; Col_cuticle_N.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   PANTHER; PTHR24637:SF276; CUTICLE COLLAGEN LON-3; 1.
DR   Pfam; PF01484; Col_cuticle_N; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   SMART; SM01088; Col_cuticle_N; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:ETN81232.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053676};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..353
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004825536"
FT   DOMAIN          7..57
FT                   /note="Nematode cuticle collagen N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM01088"
FT   REGION          76..127
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          146..353
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        85..126
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        243..259
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..336
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   353 AA;  35229 MW;  A608B4D900CB3C71 CRC64;
     MSASGATSGA LLFSGATLLV SLVAAAAIYS QVNSIWSELD AEMNNFKVLT DDLWKDMIGL
     GAGTPSNRLR RQSYGGYAAS GAQPPAPTPL SSSVNAYGGG APQNSDYAGS SNTPLSNPSS
     NFGFPTGPGS FVPGGNARCV CTMESSCPPG APGATGEPGP DGLDGLDGIP GFDGLDAEDI
     SNEAPQGCFT CPQGLPGPQG PSGPPGIRGM RGAKGQSGRP GKDGNPGMPG EMGPPGPPGE
     DGHPGKPGDK GDDAEKPVGR PGLRGPPGDQ GPEGPEGTPG RDAYPGPQGP IGEPGVPGYQ
     GAAGPDGEEG PPGPQGDVGK DAEYCKCPDR EAHRPLQSPP HGSGYGRKKY RKH
//
DBGET integrated database retrieval system