ID A0A016WP63_9BILA Unreviewed; 407 AA.
AC A0A016WP63;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Nematode cuticle collagen N-terminal domain-containing protein {ECO:0000259|SMART:SM01088};
GN Name=Acey_s0570.g108 {ECO:0000313|EMBL:EYC41401.1};
GN ORFNames=Y032_0570g108 {ECO:0000313|EMBL:EYC41401.1};
OS Ancylostoma ceylanicum.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Strongyloidea; Ancylostomatidae;
OC Ancylostomatinae; Ancylostoma.
OX NCBI_TaxID=53326 {ECO:0000313|EMBL:EYC41401.1, ECO:0000313|Proteomes:UP000024635};
RN [1] {ECO:0000313|Proteomes:UP000024635}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=HY135 {ECO:0000313|Proteomes:UP000024635};
RX PubMed=25730766; DOI=10.1038/ng.3237;
RA Schwarz E.M., Hu Y., Antoshechkin I., Miller M.M., Sternberg P.W.,
RA Aroian R.V.;
RT "The genome and transcriptome of the zoonotic hookworm Ancylostoma
RT ceylanicum identify infection-specific gene families.";
RL Nat. Genet. 47:416-422(2015).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EYC41401.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JARK01000170; EYC41401.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A016WP63; -.
DR STRING; 53326.A0A016WP63; -.
DR Proteomes; UP000024635; Unassembled WGS sequence.
DR GO; GO:0042302; F:structural constituent of cuticle; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR InterPro; IPR002486; Col_cuticle_N.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24637:SF433; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24637; COLLAGEN; 1.
DR Pfam; PF01484; Col_cuticle_N; 1.
DR Pfam; PF01391; Collagen; 2.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM01088; Col_cuticle_N; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000024635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 6..58
FT /note="Nematode cuticle collagen N-terminal"
FT /evidence="ECO:0000259|SMART:SM01088"
FT REGION 121..147
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 164..365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..194
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..286
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 315..353
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 407 AA; 42637 MW; 8E3AA5E7F87B344F CRC64;
MLATQTVAQL SVASAAVLLT ISIITIFLTA QEINEVYNEA LRELDEWKHY SNEAWLDMKS
LLQRAPRSTN RAKTHSAKTV FRRNSYYVAS TASNPQPYYS QDAYVEGDMC NCGPQPNCPP
GPPGPPGPRG YDGEPGYPGE PGRRGADGIA LGLYKQEAPG CIRCPVGPPG RPGPTGYPGE
QGPPGPPGPP GATAYQGKPG PCGPPGDRGS DGQPGLPGPP GEPGRSFTVQ IGLPGPKGAP
GRPGPLGRPG PPGFCPPPGP PGPVGPMGPP GQPGPLGPPG APGYPGAPGE PGQDGEYCPC
PPKSGGVNRF EQPPPQNYQP APQAPYQPPP PQPAPQAPYQ QAPPPPPPPQ AQNYNFEQYP
NKYGVTSYGS NAGPIYTSPA EAQTEYRRRV IARMLRRRRL HAQKKQA
//