GenomeNet

Database: UniProt
Entry: W2T6H9_NECAM
LinkDB: W2T6H9_NECAM
Original site: W2T6H9_NECAM 
ID   W2T6H9_NECAM            Unreviewed;       560 AA.
AC   W2T6H9;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   27-MAR-2024, entry version 30.
DE   SubName: Full=Nematode cuticle collagen domain protein {ECO:0000313|EMBL:ETN77493.1};
GN   ORFNames=NECAME_03157 {ECO:0000313|EMBL:ETN77493.1};
OS   Necator americanus (Human hookworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC   Necator.
OX   NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN77493.1, ECO:0000313|Proteomes:UP000053676};
RN   [1] {ECO:0000313|Proteomes:UP000053676}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24441737; DOI=10.1038/ng.2875;
RA   Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA   Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA   Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA   Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA   Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT   "Genome of the human hookworm Necator americanus.";
RL   Nat. Genet. 46:261-269(2014).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KI660167; ETN77493.1; -; Genomic_DNA.
DR   RefSeq; XP_013299720.1; XM_013444266.1.
DR   AlphaFoldDB; W2T6H9; -.
DR   STRING; 51031.W2T6H9; -.
DR   EnsemblMetazoa; NECAME_03157; NECAME_03157; NECAME_03157.
DR   GeneID; 25343195; -.
DR   KEGG; nai:NECAME_03157; -.
DR   CTD; 25343195; -.
DR   OMA; GAHQHED; -.
DR   OrthoDB; 2883557at2759; -.
DR   Proteomes; UP000053676; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0042302; F:structural constituent of cuticle; IEA:InterPro.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   InterPro; IPR002486; Col_cuticle_N.
DR   PANTHER; PTHR24637; COLLAGEN; 1.
DR   PANTHER; PTHR24637:SF422; GENE, 37797-RELATED; 1.
DR   Pfam; PF01484; Col_cuticle_N; 1.
DR   SMART; SM01088; Col_cuticle_N; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000313|EMBL:ETN77493.1}; Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053676};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        6..29
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          6..57
FT                   /note="Nematode cuticle collagen N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM01088"
FT   REGION          80..560
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        336..355
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        397..491
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        503..536
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   560 AA;  51981 MW;  628F509484EE38E6 CRC64;
     MGTTTLVGGV ATMSAVAVVI SLVSVTYIVN DINTFYEDAL LELDNFKDIA NSAWHKMRTA
     SEVAREKRAV LIRRRNAGGS ACNCGHQASS CPAGPPGPPG EAGQPGDDGE AGALGQPGRD
     GSSEGGNGSN GSCIQCPAGP PGPPGPDGDA GPAGPDGNPG APGAASGPGP AGPQGPPGDA
     GQPGAPGEAG APGEVGAPGT SGKGLPGPAG PAGSIGAPGQ PGADGDAAGA GNPGPPGPSG
     PAGSPGQSGT DGAPGNPGAD GTPGSDGEYC PCPARSSAPM DAPSSANYEN AALADNPEPA
     EIGKDDNDSN SGSDSISGSD SSSDEGEDEK SRKALHRVAA TRRLAAKKVV KKVARNAAKK
     PALPAGGASI GAPARGGHGQ YSASLPGAAS GPGSPGVPSA PACPGLPAGP AGPGGPGLAP
     PPAPPSCPGC PAGPAGPAGP AGPGRPFPLV PGAPGCPGGP GGPAGPGPDP PPGAPGFPSG
     PAGPASPSGP GGPGGPTGHV MQLPFEPSPP PELPSRPGCP GAPGSPSSPG SPASPGGPGG
     PAGQLLACWP QLHPDPPPVA
//
DBGET integrated database retrieval system