GenomeNet

Database: UniProt
Entry: A0A0B2VRT8_TOXCA
LinkDB: A0A0B2VRT8_TOXCA
Original site: A0A0B2VRT8_TOXCA 
ID   A0A0B2VRT8_TOXCA        Unreviewed;      1149 AA.
AC   A0A0B2VRT8;
DT   04-MAR-2015, integrated into UniProtKB/TrEMBL.
DT   04-MAR-2015, sequence version 1.
DT   28-JAN-2026, entry version 46.
DE   SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:KHN84383.1};
GN   Name=COL18A1 {ECO:0000313|EMBL:KHN84383.1};
GN   ORFNames=Tcan_12230 {ECO:0000313|EMBL:KHN84383.1};
OS   Toxocara canis (Canine roundworm).
OC   Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC   Spirurina; Ascaridomorpha; Ascaridoidea; Toxocaridae; Toxocara.
OX   NCBI_TaxID=6265 {ECO:0000313|EMBL:KHN84383.1, ECO:0000313|Proteomes:UP000031036};
RN   [1] {ECO:0000313|EMBL:KHN84383.1, ECO:0000313|Proteomes:UP000031036}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PN_DK_2014 {ECO:0000313|EMBL:KHN84383.1};
RA   Zhu X.-Q., Korhonen P.K., Cai H., Young N.D., Nejsum P.,
RA   von Samson-Himmelstjerna G., Boag P.R., Tan P., Li Q., Min J., Yang Y.,
RA   Wang X., Fang X., Hall R.S., Hofmann A., Sternberg P.W., Jex A.R.,
RA   Gasser R.B.;
RT   "Genetic blueprint of the zoonotic pathogen Toxocara canis.";
RL   Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KHN84383.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JPKZ01000987; KHN84383.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A0B2VRT8; -.
DR   STRING; 6265.A0A0B2VRT8; -.
DR   OMA; AREANFR; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000031036; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050938; Collagen_Structural_Proteins.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KHN84383.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000031036};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..1149
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5002077538"
FT   DOMAIN          623..668
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          966..1131
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          151..172
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          277..327
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          353..493
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          549..618
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          716..784
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          814..955
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        285..299
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        367..391
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        417..426
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        427..439
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        460..472
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        574..583
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        588..600
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        601..610
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        735..777
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        814..831
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        832..859
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        872..912
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        924..947
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1149 AA;  127328 MW;  4D926BBA0BD9A84C CRC64;
     MPPQRALLLA FVALAFVLVR NASAAVANPS SKQNDKSNAQ VQRDVEDLIV NEGFLLTTVK
     QPQNQPKNVP TVQPKRPPFH GPVAFKPHIA SLNRTRLSLR RPKRTHRPKV HKTVRPHATT
     PIRNLSRPSV LKSIKPLVKV APPVRVPTRL EQRLRRKANN EEEGSDSREP VPLDPIFVEQ
     NGGNAQITVV APGGEGPTIA SSENQPTRDA PFQPHIVEEE ENVFPIAFEE VQVVEPEGGD
     FSAEYDQEGE LDDGVATQTF LPEDHLGAFQ EMRISDDPND AAVSCDDENE GSGMEEEAEA
     ISKVVEPEPA TPSGERPSYP QDGRKLQLIV SDHSDSLILP FYHLDTKAYK QHATSVSTNH
     LPEYNEPDAK PGRGEKGEKG EKGDKGDKGD QGEPGPPGPV GPPGVCMAEC RDGRDGMPGP
     RGEQGPMGPP GPPGPPGPPG QVIQQGMEDS GYPIQAIAGP PGPQGPPGPQ GPMGPRGDPG
     VGIPGPPGPP GRYMGLTQED IDRIVSDPRL RAQQSECCTP ARQFPGTETG TDELPAYNPA
     VHRFTMKGEK GERGEHGPMG APGPMGPQGP MGPMGPMGIQ GPVGPVGPAG PPGPPGPPGP
     AGSAQPGSPS CPQTPPGATP GCVQVFQTSI ELFSSSGGIP LGSLSFSISS QQLFIRVNGG
     WKEVRLEGFH PTLEQRPSLE INLDSQNENY IPYWLSDEAA VSVHAPCVRR SVPVEVSIQP
     HPPQPQERPQ YVPESRPEYR PEHRPEYRPE YPPHRRPDYE VLRRPHPGPE HRPEPRPHYR
     PGQRPEITIE RHPYHHHERP SESLPKQLQP HEWPEVLPEH HEHRPEPEEG RSPGYRPEQY
     SGAQRQPEPI QQQSSQIPIR RPEIQPEQQV PHPEERAEQV AQEEHPEPQL EHRPELVSER
     QRQIEGELRP IEHPAPTIYP PYPERPRYPE PERQPHYRER PTPEPHRPRY PPTRRPYTLG
     EKDHVLHLIA LNTPMTGNMR GVRGADLACY QQARQAGFRT TFRAFLSSHV QDLNKVVHFG
     DRETPVVNLR GERLFNSWSD IFRERPMFDA PLYSFNRRNV FDDNTWREKR VWHGSDASGT
     RSESGYCNAW RSSDPSQVGS SSSIGPRLPL LDGCQNVDCS HEFVVLCVEN MSKYSVDKRL
     GKKRMHYEE
//
DBGET integrated database retrieval system