ID A0A3Q3LHS7_9TELE Unreviewed; 3456 AA.
AC A0A3Q3LHS7;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Collagen, type XII, alpha 1b {ECO:0000313|Ensembl:ENSMAMP00000013718.2};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSMAMP00000013718.2};
OS Mastacembelus armatus (zig-zag eel).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Anabantaria; Synbranchiformes; Mastacembelidae; Mastacembelus.
OX NCBI_TaxID=205130 {ECO:0000313|Ensembl:ENSMAMP00000013718.2, ECO:0000313|Proteomes:UP000261640};
RN [1] {ECO:0000313|Ensembl:ENSMAMP00000013718.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSMAMT00000014095.2; ENSMAMP00000013718.2; ENSMAMG00000008858.2.
DR GeneTree; ENSGT00940000154923; -.
DR InParanoid; A0A3Q3LHS7; -.
DR Proteomes; UP000261640; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 23.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 24.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 1.
DR Pfam; PF00041; fn3; 22.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 24.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 16.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 23.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000261640};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 30..120
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 141..313
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 337..427
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 441..617
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 635..727
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 728..819
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 821..913
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 914..1001
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1002..1091
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1093..1183
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1204..1376
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1393..1476
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1477..1567
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1568..1658
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1659..1748
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1749..1837
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1839..1927
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1929..2018
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2019..2107
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2109..2198
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2199..2293
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2379..2468
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2469..2559
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2560..2650
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2651..2738
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2739..2829
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2857..3034
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 797..822
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1084..1111
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3299..3319
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3362..3441
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 797..815
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1086..1107
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3456 AA; 376717 MW; 023B796F220A42FC CRC64;
IWVRLCFVLS ASQKIQIFPL SRLLFFLFDP PTDLRFKILN ENTVQMLWSR PQSRIQGYRV
QVTSDTDEST KEFTLPASAT KTSISDLSPN VDYVVTISTY AGSEESLPIS GQITIESAGR
GPAKKPDSSE SVKCSAGTFA DVVFLVDGSW SVGRPNFKYV RNFISAAAGA FQIGEDKTRV
GVVQYSSDAR TEFNLNEHRT RPALLRAIGS LPYKGGNTMT GDALGYLLRN TFTETAGARK
DFPKLLVIIT DDKSEDPVET YAKQLRSRGV EIFVLGIQQA DEEEMKLMAS TPHRSHIYNV
ATFDLIKNVQ KDLITEMCAG VDDQRSSLVS GEEVVEAASN LQVLEVASKS MRVIWDASIG
DVSGYKVQLN PMMPGSKRQQ LYVGPTQTSV VVRGLSPDTE YQISLFAMSG GMLSEPIEVM
QKTEPVKVSV ECSLGVDVQA DVVLLVDGSY SIGLENFAKV KAFLEVLVNT FDIGPDKVQI
SLVQYSREAH TEFYLNTHHN LNTVVKALRS FPYRGGSTNT GKAMTFVREK IFQAARGARA
NVPRVTILIT DGKSSDAFQE PATKLRNADV EIFAVGVKDA VRNELEAIAN TPAETHVYTV
EDFDAFQRIS TELTQSICLR IEQELRNIFQ RQLVQPKDLS FSEIGPRSFR ASWDIDSTNV
ESYLLQFKPA DDVDGHYVSM SVPGDTLTAL LPQLTPLTRY EVNIYAQYDK GESLPLTGYE
TTITEQGPVR NLRVSEETTD SFRVSWQPAP GVVTRYLLTY EPVGDESSRL ETTTVGSETS
IVLRELQPQT TYRVTVTPEH RSGPGVPQQT DGTTKEGEEG SPRDLRVFDE TVSSMKVSWE
PAPGKVLQYR LAYRPSAGGP RREMSVKGDN TAALLKNLEP GTEYDISVSA RYSSGLGDAL
EGRGTTLEDL GPPKNLVTSD VTDTSFAVSW TAAPGNVQLY RVRWKSLFSE EAGNKVVPGD
VSGTVLEGLS PETLYQVSVV AAYGHKDSDP LTGQETTDAS SAGKKLTVSD ETERTMKVTW
TPAPGKVLHY RLKYVPSSGG KEVALKIPAT ATSTVMKRLQ PMTTYNIIVH PIYKRGEGKA
RQGVGTTLSP YKSPRNLQTS EPTRNSFRVS WDPAPGDVKG YKVTFSPAGS DIDLGELLVG
PYDNTVVLEE LRAETTYAVS VFGMFDGGES LPLAGQESTT LSDDPELPLP DPSDTQCKTT
AKADIVLLVD GSWSIGRMNF KTIRNFIGRM VGVFDIGPDR IQIGLAQYSG DPRTEWHLNT
YQNKEALLNA IANLPYKGGN TMTGMALNYI LQNNFKSNVG MRAGARKIGV LITDGKSQDE
VVMNSQNLRD SDIELYAIGV KNADENELRS IASDPDEIHM YNVNDFKFLL DIVDDLSENL
CNSVKGSGDE LEAPTNLVTS EVTHHSFRAT WTGPKEVTRL GELLVFPRLL MVDGTVNTVV
LENLNPLTEY LVSVYSVVGG ESSEPLKGTE TTLPLPPVER MNVYDEAVTT MRVSWEAVDG
ATGYMLLYKS INASEPQLEQ ELRVGEDVAN VQLVQLIPNT AYTITLYALH GEFASDPLED
KGVTLPVPPA GVLRITDVTH STMRLNWDAA PGAVRKYIIT YKPEDGDVKE AEVNGDITTL
LLTNLKSQTE YDVTVTPVYD EGPGARMIGN AITDVVPAPK NLRFSEVTQT SFRATWDHGA
PDVSLYRIGW TKKGKTDFQF AILGENENTH VLPNLDPDTE YTVTVTAIYP DESESEDLMG
SERTLISGPP RNLRVFNATT STLQVKWDPA VGPVQNYEVI YKPVAGGEPV STQVGGKKTS
TVLQKLEPDT QYSVTVAAVY PNGDRKDISG EGKTKPLGGV RNLQVLNPTM TTLNVRWDPA
EGRVKEYKVI YAPAAGGAES METVSAGTTT TMLRGLQPDT LYTVSLVPVY PEGDGKTMSE
NGKTRPLGGV KNLRVTDPTM TSLTVDWEPA DGAVRLYKVF FVPVGGEREE MEQVPASTTS
IVLRNLMPDT PYTVSVLPVY PTREGKRQSE NGKTLPLGGV GTMRVTNPTI TTLTVTWNPA
DGNVQGYKVI YVPTDGGLEI VEQVSESTTT TVLDKLLPDT RYSVTVVPVY AEGDGPSLSD
RGKTKPLGFA RNLQVTDPTT STLNVRWDPA EGNVREYIVI WVPSAGGEQE VDQVTGTTTS
TVLKNLDPNT DYTVTVVPVY HEMEGKPQSE QGRTNPMGGV KNLEVIDPTV STLTVRWDPA
VGNVRSYKVY YVAAPSGEEH MVEVSGDTTN TVLRNLNPDT AYNVAVVPIY PDVDGIRQSA
TGNTRPVSGV KNLQVTDPTT NSLRVLWEPA EGDVRQYQII YVPAAGGSES MTQVSGMSTS
TVLRDLQPDV EYKVTLVPIY ADMEGKRVSE NGKTKPLGGV KNLQVTDPTT SSLKVRWDPA
EGNVRQYRIF YVPASGGTED MDQVSGGTTN TVLRNLLSNT PYTVTVVPVY PEGEGLRQSD
RGKTLPRTPP RNIQVYNPTP NSLNVRWEPA SGQIQQYRVV YSPLTGNRPA ETVLVPGNTN
NAFLDNLIPD TPYSVSVSAL YADGTGPPVK DTGKTLPRAG PRNMRVFDAT TSTLTIGWDH
AEGPVRQYRI AYAPMTGDPI TEFTVVPGNR NNAMLQNLLP DTPYNITVEA VYAEGPGGTL
NGNGRTVGLL SPRNLRVSDE WYTRFRVAWD PVTAPVQGYR LIYTPEGETA VDFFVGDVTS
YTLHNLQPGT TYDVKVIAQY TGGMSGPLTG QGTTLYLNVS NIETYDVGHD KFCVKWIPHR
AATSYRIKLN PADPSSKGQH EITIPAGLPQ YCFDGLSPDA LYTATVFVQT PNLEGPGVSA
KERTLVKPTP APTLPPTPTP PPTIPPGWAV CKSAKADVVF VIDGSWSIGE ESFTKVVHFI
SSVIAAFDVI GPSGMQVSFV QYSDDAKTEF KLNTYKDKGV AMSALPYIHY RGGNTKTGVA
LKHTYEKAFS VDNGMRRNVP KVVVVITDGR SQDEVKKNAA KLQHAGYSVF AIGVADVDFV
ELQEIGSKPS ERHVFVVDDF DAFNTIKENL ITFICETASS SCPLIFLNGF TSPGFRMLEA
FNLTEKTYSY VKGVSMEPGS FNSYAAYRLH KDAFLTQPTA DVHPYGIPHA YTIILMFRLL
SDSPTEAFDI WQVSSKDHKP ETGVTIDPSS QTVSFYNKDE RGEIQRVTFD NNQVKKIFHG
SFHKLHILVS STGVKLNIDC QEVAEKEIKA AGNTSSDGYQ VLGKMSKSIG SKGQSATFQL
QMFDIVCSLA WTSRDRCCDL PSMRDELKCP SLPNACTCTS ERTGLPGPQG PVGPVGPRGD
IGPPGPMGLP GPQGPSGLSI PGEAVSGDFA PQNMMRSIAR QVCEQLVNAQ MTRVNTLLNQ
IPSGMYRNNN PGPVGPPGPP GSQGPRGEPG ATGRNGFPGS PGLPGQQGER GPAGEKGDRG
ESVVGQKGPR GPPGTLQNLV EPVCCDWNSP LHLVSN
//