ID W2SMC9_NECAM Unreviewed; 299 AA.
AC W2SMC9;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Collagen triple helix repeat protein {ECO:0000313|EMBL:ETN70775.1};
GN ORFNames=NECAME_14535 {ECO:0000313|EMBL:ETN70775.1};
OS Necator americanus (Human hookworm).
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae; Bunostominae;
OC Necator.
OX NCBI_TaxID=51031 {ECO:0000313|EMBL:ETN70775.1, ECO:0000313|Proteomes:UP000053676};
RN [1] {ECO:0000313|Proteomes:UP000053676}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24441737; DOI=10.1038/ng.2875;
RA Tang Y.T., Gao X., Rosa B.A., Abubucker S., Hallsworth-Pepin K., Martin J.,
RA Tyagi R., Heizer E., Zhang X., Bhonagiri-Palsikar V., Minx P., Warren W.C.,
RA Wang Q., Zhan B., Hotez P.J., Sternberg P.W., Dougall A., Gaze S.T.,
RA Mulvenna J., Sotillo J., Ranganathan S., Rabelo E.M., Wilson R.K.,
RA Felgner P.L., Bethony J., Hawdon J.M., Gasser R.B., Loukas A., Mitreva M.;
RT "Genome of the human hookworm Necator americanus.";
RL Nat. Genet. 46:261-269(2014).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI668895; ETN70775.1; -; Genomic_DNA.
DR RefSeq; XP_013293002.1; XM_013437548.1.
DR AlphaFoldDB; W2SMC9; -.
DR STRING; 51031.W2SMC9; -.
DR EnsemblMetazoa; NECAME_14535; NECAME_14535; NECAME_14535.
DR GeneID; 25354562; -.
DR KEGG; nai:NECAME_14535; -.
DR CTD; 25354562; -.
DR Proteomes; UP000053676; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR PANTHER; PTHR24637:SF422; GENE, 37797-RELATED; 1.
DR Pfam; PF01391; Collagen; 2.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:ETN70775.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000053676};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REGION 72..100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 114..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..87
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..207
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 299 AA; 30783 MW; 985D35B7E4D0FF6F CRC64;
MAIFLSTCLF RETTEFQSQI TEDLRKFKIH ANDLWKEILR TRIYISITTD SQQSREKREE
AAGCSCSEIC PPGPPGPAGP EGIPGFPGIP GPDGSHGLSG DQLNMETKEC IKCPQGAPGI
QGTPGSAGLP GNPGPRGLDG SIGIAGPPGI PGTPGVPGSN GIRRISIPGP PGPPGPQGSE
GPLGEDAAIV PPPPIGPPGP PGTPGSPGNP GRDGIAGPQG SAGAPGSDAA YCACPPRTRQ
DAYVQPEAPK MQENTYTYGE RKGGNPMEPP STTYVAKPPV NFDRYKLGNS LYTHIPFTQ
//