ID A0A0V1MPG8_9BILA Unreviewed; 1293 AA.
AC A0A0V1MPG8;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:KRZ73675.1};
GN Name=Col15a1 {ECO:0000313|EMBL:KRZ73675.1};
GN ORFNames=T10_13275 {ECO:0000313|EMBL:KRZ73675.1};
OS Trichinella papuae.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=268474 {ECO:0000313|EMBL:KRZ73675.1, ECO:0000313|Proteomes:UP000054843};
RN [1] {ECO:0000313|EMBL:KRZ73675.1, ECO:0000313|Proteomes:UP000054843}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS1980 {ECO:0000313|EMBL:KRZ73675.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRZ73675.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDO01000060; KRZ73675.1; -; Genomic_DNA.
DR Proteomes; UP000054843; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd06263; MAM; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR000998; MAM_dom.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1034; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF00629; MAM; 1.
DR SMART; SM00409; IG; 1.
DR SMART; SM00137; MAM; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS50060; MAM_2; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:KRZ73675.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000054843};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 52..127
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 144..316
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
FT REGION 546..660
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 672..735
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 748..800
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 827..857
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 943..965
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 755..791
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 834..852
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1293 AA; 143262 MW; CA2934C0B894E7DE CRC64;
MKFRFNTTKV IVMCKMCLDA KKEAEMEIRR SVKVVIENEK DVLKVNEIFV ASCVIETNSQ
RSQRRWYFPN ETVVPFDTGI ERAYVSFWKN HTILRLVIEN VQPEDSGTYI CGKKHRNKLY
GQSVHLTVKP LYVKQPSDNG FKPIVCKFEK LNQSEFCQWK VEAESTNSWR LAYIDPYAPN
LSVTDGHESS HQQVMLFESD STMARTIGRL ISPVLPSDFS FHYCFNFAYR IRPESNGKIF
IYALPQPEQF DNQVLLMHFK LSLAHGQEHH QWQNCAIKLL PIKHNFRIAI EVIKSAGEKI
FIAIDSLQLM PGICKSACNQ PSPAANHRQL QSIHQPLQYS SVKDDEVEID LLKAVKAPLD
PNIYHAKGKQ GLPGFGFHQG ANVVAPYRFY MPRRFFRDFA ILITVRPDDD RGGYLFAVVN
PFDTVVELGV LLEPLTDRKA NLSLVYSDHK RDTECRAIAS FQIPTITGKW TELALKIEGT
EVSLYYNCQH YETQTVQRRQ KQLNFDDASK LYIAQAGPII NKPFVGALQE LKIFANPAEA
ETQCDDIAFD GNGSGDNELF TDVGLSGESP DPPSLTDPPP HYPDLDNSKE PLLSEIQGPP
GPKGDPGVPG LSIKGEKGDP GPPGVCSCPA VGEVEKSIQT VGPVGPKGEP GEKGEPCKSD
VDYEAIIKKY ALKGDRGYPG PPGPPGSPGQ KGDTGDAGPV GSIGPPGFPG PPGKPGATEV
VHSAPKEVAR SRYGHRRHEY DQYELRMEQL QGPPGPPGHR GPEGPQGPPG YPGLPGNPGP
PGPPGPPGPA GLPVSVGSRG TSYTNGFSNI IPGAERLRIF QGEEKQRQNY GYPGYPGPPG
PPGPPGPPGP PGSSAYANAA VKGAQGEQLR NAHGANYITP GTLVYSTNLD TFYASEHIQI
GSLAFSLSSE ELFIRVKGGW RQVKLEGFQS SMEEKMSLLS DATARKTETV RPPVDASSKH
TSSEPMAAND LNKKEVLHLI ALNDPFTGNM HGVRGADFAC YHQARAAGFT TTFRAFVSSQ
VQDLDKIVHH SDRGTPVVNL RGEVLFNSWD DMFRDGGAFF SLNTPIYSFD RKDVFSHHGW
PEKYVWHGSD TKGTRRSGKF CDAWRSNSPK RTGMASSLYA RSLLGQKDFP CNSTLVVLCI
ENMSKGNVDR RLAKKRFGPT LRDQIYLMSV TALMPALYKI ASTNAQLNQP SERFLKKTDL
KKKYSLLLLL KKLCNTSQQQ AAQALNNKIK RKNYSYILHG TNLSFVKKEK KYKFIIGHTM
YNNGNNNKWK ARNRINDTYI HPPRESSHLK TTR
//