ID A0A158REQ2_HYDTA Unreviewed; 1758 AA.
AC A0A158REQ2;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Collagen IV NC1 domain-containing protein {ECO:0000313|WBParaSite:TTAC_0000780801-mRNA-1};
GN ORFNames=TTAC_LOCUS7793 {ECO:0000313|EMBL:VDM32251.1};
OS Hydatigena taeniaeformis (Feline tapeworm) (Taenia taeniaeformis).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Hydatigera.
OX NCBI_TaxID=6205 {ECO:0000313|Proteomes:UP000046396, ECO:0000313|WBParaSite:TTAC_0000780801-mRNA-1};
RN [1] {ECO:0000313|WBParaSite:TTAC_0000780801-mRNA-1}
RP IDENTIFICATION.
RG WormBaseParasite;
RL Submitted (APR-2016) to UniProtKB.
RN [2] {ECO:0000313|EMBL:VDM32251.1, ECO:0000313|Proteomes:UP000274429}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Pathogen Informatics;
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; UYWX01020405; VDM32251.1; -; Genomic_DNA.
DR STRING; 6205.A0A158REQ2; -.
DR WBParaSite; TTAC_0000780801-mRNA-1; TTAC_0000780801-mRNA-1; TTAC_0000780801.
DR Proteomes; UP000046396; Unplaced.
DR Proteomes; UP000274429; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF533; COLLAGEN ALPHA-6(IV) CHAIN; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 11.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000274429};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..1758
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035112158"
FT DOMAIN 1527..1753
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 79..394
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 438..581
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 634..837
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 865..910
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 926..1520
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 172..186
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 221..238
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 465..479
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 991..1005
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1065..1082
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1300..1314
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1758 AA; 176993 MW; 04E7ACA2FC5587A4 CRC64;
MPIVDKLHST CLGTVAIALF FSLIAPALAQ IPWQTDPARC QTLCTPDQCR CVGIQGPRGP
PGEVGPMGAW GPPGDVGLPG LPGEQGDRGY PGAPGLQGQK GEPGVQGPPG YAGVHGSQGH
PGVRGIKGER GATGCEGPRG PDGSPGISGP PGIQGPPGER GPRGPKGNPG FMNTGPPGPP
GQPGDVGAVG PPGYKGAIGP RGPPGRDAPI ITIEGQKGDQ GRRGMPGRDG RPCEILELRS
GPRGRSGYQG PRGPKGDRGE RGTEGVPGRD GYSGLAGEKG LPGREGDRGQ VGDIGDPGRV
GEPGDRGFDG DTGRIGPVGV PGERGPPGVD GLPGFRGPKG RPGSLRDCVG PRGRKGEIGP
PGTKGFRGYG GRQGPTGYKG PKGSPGPRGR PGLPGRMCET GEMGLAGEKG DTGACLNYQI
TGLHVLCTYS NGMPGIRGED GDPGLMGPQG PRGPDGLTPM YGPVGEPGIP GPPGRVGPKG
DPGPRGDRGD KGYRGTPTVG MPGPRGPPGD IGEAGRSGAP GSMGPPGPRA PPCDACAAGE
PGGRGETGED GETGQPGQPG QRGQDGDRGI PGSRGPPGLP GLDGYRVSFK YDLFDFQYCD
LSLCHEGQQG VAGIPGPKGE PGETGFVTIT NREVIPGSQG DSGRDGLPGA MGDRGEMGPP
GLPGPAGPRG PKGQPGVDGM AGYLGSPGDK GEPGEPGEDG YGVKGAKGEP GDMGISGPPG
RRGPRGIQGL AGQRGSRGDR GPRGPPGLPG DVGIPGPSGQ DGREGSFGPS GVPGQPGPVG
DAGYPGRGGE KGERGEPGDS GIRVSGPPGE KGRPGLSGLP GDRGPPGRTG SMGLRGRPGK
RLFSLPLHFS SLFNPLVLPN SYFSSGHSGS SGIPGTSGRM GPPGEKGPKG LPSTRMSSGA
PGPPGVQGPR VEPWHLIRFV LFLGPVGSKG QRGPQGRCYG RGPSGDMGSP GDRGYPGMQG
SRGPEGAIGL PGFQGRSGRA GEKGERGPRG PTPEPGRPGP AGPSGRPGYP GRSGRPGEVG
IPGFPGQKGS IGLPGISGPK GRRGRPGISR GGRPGATGRP GLEGPKGEKG IRGIDGKDGK
PGEPGRPGVG SSGPPGPKGY RGPVGDPGSP GMDPVPGPVG DVGYEGRVGP KGSRGLTGPV
GFTGREGLPG KKGERGEPGD DGLPGLPGDQ GDAGYPGPSP RFMKGRKGYE GRSGLKGYPG
PAGPYGETGQ RGTLASVSRF IANKPCKRRT GTAGPSGRPG ARGPDGYPGL PGDKGAPGYP
GRPGANGTMG PPGDSGLPGF QGQAGRQGPS GLKGAKGRPG FCPVPPPGPK GEPGPRGRPG
RDGSQGERGV EGLPGLKGSP GQPRMGPEGR KGERGFDGQP GNMATCPSLK GSPGDHGDRG
IPGYRANRGE PGDMGPPGLR GERGPMGLPG NNGPPGYRGA PGKDGVKGEP GLPGISGRPG
PDGPIGDHGE PGIRGIDGPM GTSGRMGPPG EKGYKGSAGY ISGPRGPSGE RGPPGDMGYQ
GTKGQRGPQG YRGPDGPNGT LGLGDGGYLF AVHSQKDTPP SCPEFTHELY TGYSLVTVQG
DDDSITMDLG SPGSCARVFS IMPFASCFSS PPSGACQFNM RNGRSFWLST LEHYMTEPLP
VESIQRYISR CVVCESRTNL VAFHSQRPYL EESCPPGWEH AWNGFSMPMV VSAAVSGGVQ
PLHSPGSCLM NFRALPFIEC NNHHGNCFHW QDGRSYWLRA MTFNEQFTKP VGATYKTDLL
RQVSRCSVCR KSMEVLVK
//