ID A0A3B4DSL7_PYGNA Unreviewed; 3615 AA.
AC A0A3B4DSL7;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Collagen alpha-1(XII) chain-like {ECO:0000313|Ensembl:ENSPNAP00000025949.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSPNAP00000025949.1};
OS Pygocentrus nattereri (Red-bellied piranha).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Pygocentrus.
OX NCBI_TaxID=42514 {ECO:0000313|Ensembl:ENSPNAP00000025949.1, ECO:0000313|Proteomes:UP000261440};
RN [1] {ECO:0000313|Ensembl:ENSPNAP00000025949.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 42514.ENSPNAP00000025949; -.
DR Ensembl; ENSPNAT00000006344.1; ENSPNAP00000025949.1; ENSPNAG00000011368.1.
DR GeneTree; ENSGT00940000154923; -.
DR OMA; WKRPPDE; -.
DR Proteomes; UP000261440; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 23.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 3.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 24.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 22.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 24.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 15.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 24.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..3615
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017310169"
FT DOMAIN 30..120
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 142..314
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 338..427
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 442..618
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 636..727
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 729..820
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 821..910
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 912..1001
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1002..1091
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1093..1183
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1204..1376
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1393..1482
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1483..1574
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1575..1664
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1665..1759
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1762..1850
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1852..1941
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1942..2031
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2032..2120
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2122..2210
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2212..2301
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2303..2391
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2393..2482
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2483..2573
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2574..2664
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2665..2753
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2754..2844
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2872..3049
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1080..1103
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2833..2856
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3294..3449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3481..3615
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1089..1103
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3329..3344
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3490..3506
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3615 AA; 391785 MW; 4AC77A2C052DF222 CRC64;
MMIKMRAKVV SAFITLTLLS SIKAQGQVTP PSDLRFKILN ESTVQMTWKL PLTRIEGFRI
QVVSSIDEPV KEFTLPASVT KTSIKDLTPD VDYVVTISSY SGSEESFPIS GQITIQSSGS
EGAPRRPQVS DVVKCSTSAL VDLVFLVDGS WSVGRENFKH IRNFISSVAG AFDIGEDKSR
VGVVQYSTDP RTEFTLSQHL RRVELLRAID SLPYKGGNTM TGDALDYLHK NIFTETTGTR
KGFPKAAVVI TVGKSQDPVE DYAKTLRDSG VEIFTLGIKD ADEEELKQMA STPYSAHVYT
VSNFDQIKSV QKSLTTQLCA GIEDQLSSLA SGEEVVEPAS NLQVTEVASK SMRLTWDASL
DEVTGYKLQM VPMLAGSKRQ DLYVGATQTY VNVRDLSPET EYEISLFALK GLTPSEPVTA
FQKTQPVKVS LECSLGVDVQ ADVVLLVDGS YSIGLANFAK VRAFLEVLVN SFDIGANKVQ
ISLVQYSRDP HTEFYLNTHH DNSAVVKAVR TFPYRGGSTN TGKAMTYVRE KIFIANRGAR
HNVPRVTILI TDGKSSDAFK DPATRLRSSD VEIFAVGVKD AVRSELEAIA NTPAETHVYT
VEDFDAFQRI SKELTQSICL RIEQELMNIN KRKLIPPKSL SFSEISSRKF RAMWATDAVN
VESYLVQYKP AADPAAGYVS VSVPGDTTTA MLVHLTPLTK YEVNVYAQYD KGESFPLTGF
ETTLEEQGTV GNLRVSEETT DSFRVTWTAA PGPVVRYRLT YRPVRGDSAA LETATEGTET
SIVLQQLFPI TTYRVSVAAE YPSGIGPQMH IDGTTKEARG SPRNLRVFDE TVSSMRVVWE
AAPGQVQQYV VSYQPTAGGE IKEVTVKGEN TETLLRNLQP DTEYQLAVKA RYSSGLGQPL
EGTGTTLEEL GSPRDLTTSD VTDSSFMLSW SAAPGRVRQY RVRWKSQFSD ESGEKMVPGD
VTSTLLEGLS PETRYQISVF ATYGLGEGEP LVGEETTDAS AAAKALLVSD ETERTMKVTW
QAAPGRVLNY RVAYSPQLGG KEVTTKVPHN TTTIVLKRLQ PMTTYDITVH PIYKRGEGKA
RQGVGTTLSP FKAPRNLQTS EPTKTSFRVT WDHAPGNVRG YKVTFHPSDN EVNPEELLVG
PYDNTVVLEE LRAGTKYSVA VFGMFDGGQS LPLAGEERTT LVDPPEPPPM DPPDAQCKTT
AKADIVLLVD GSWSIGRVNF KTIRSFIGRM VGVFDIGPDK VQIGLAQYSG DPKTEWHLNA
HPTRESLLEA VANLPYKGGN TMTGLALNYI LQNNFKTNVG LRPDSRRIGV LITDGKSQDD
IIRSSENLRG QGIELYAIGV KNADENELRS IASDPDEIHM YNVADFSFLL DIVDDLTVNL
CNSVKGPDTG PGTPTNLETS EVTHYSFRVT WLPPDEPVER FRIEYVPVIG GKTEVMYADG
EENTLVLNNL TPMTEYVVNV YSMIGDDSSE PLKGTETTLP LPAVKSMTVY DEALTSMRVK
WEQAAGATGY RLLYRAINAT VPTVEKEMVV GADVNDVQLL QLLPNTAYTL SLFALHGQAA
SDPLMDQGVT LPLPPAGKLR INEVTHSSMR LHWDAAPGKV RKYIITYKPE GGEPKEVEVG
GDVTTLPLTS LRSQTEYDVT VTPVYDEGPG NRMIGSEITD VVPAPKNLRF SEVTQTSFRA
TWEHGAPDVA LYRVGWTKKG ENNFQYVILT SEETTHVLTD LDPDTLYDVT VTAIYPDESE
SEDLMGTQRT VSKMITPAAN GPPQNLQVFN ATTTTLTVKW DHAPGPVQNY KINFQPVAGG
KNLSTQVGGK KNSVVLQKLT PDTPYSITVT SVYRTGENKD ISGQGKTKPL GGVKNLQVLN
PTMTTLNARW EPAEGKVKEY KVVYVPTAGG AESMEQVSGT TTNTVLRGLQ PDTLYTVTVY
PVYAEGDGKR MSENGKTKLL GGVKNLRVTD PTMTSLNVKW EPADGAVRQY KIFYVPSAGG
PEDMEQVPVG TTNIVLRNLQ PDTPYTVSVV PVYPATEGRR QSEKGRTLPL GGVRNLRITD
ATFTTLTATW DAADGNVQGY KIIYVPTDGG PELEEQVSES TTILTMKSLK PDTRYTVTVL
PVYAEGDGPQ LSKEGKTKPL GSVRNLQVTD PTISTLNVRW DPAEGSVREY IVTYVPAAGG
EENVEQVSGT TTSSVLKNLE PDTEYTVTVM PVYHEMEGKS LSENGKTKPL GGVQNLRVTD
PTTSTLAVRW DHADGNPRHY KVFYVPQPGT EEKMEQVSGG TTTTVLRNLN PNTVYKVTLL
PIYEKDVEGK RQSENGKTKP LGTVKNLQVT DPTVNSLRVR WDPADGDVHQ YNVFYVPAAG
GTESMTQVSG VSTNTVLRNL QPNTEYRVSV VPVYADMEGK RKSENGKTKP LGGVRNLQVT
DPTTSSLRVR WDPAEGNVRQ YRLFYVPASG GAEDMEQVSG GTTNTVLRNL LSDTVYTVTV
APVYPEGEGL RLSEKGKTLP RTPPRNIQVY NPTPNSLNVR WEPATGQVQQ YKVLYAPLSG
VRPTEFVLVP GNTNNAFLDQ LIPDMPYSVN VLAVYADGDG PQIKGNGKTL PRAGPRNMRV
FDATTSTLSI AWDHAEGPVQ QYKIAYAPIT GDPITEFTVV PGNRNNAILQ NLLPDTPYNI
TVQAIYADGA GGSLVGNGRT LGLLEPRNLR ISDEWYTRFR VAWDPAPVPV MGYKLVYQPT
DKDESLEVFV GDVTSYTLHN LLPGTTYDVK VYAQYDGGLS GALTGQGTTL YLNVTNLETY
NVDHDKFCIR WSPHRAATSY RIKLNPLDPA AKGQQEVTIT AAQSHYCFEG LSPDSLYNAT
VFVQTPNLEG PGVSKNERTL VKPTPVPTLP PTPPPPPTIP PGWAVCKGAK ADVVFLIDGS
WSIGEDSFNK VLQFVFSIIG AFDVIGPSGM QVSFVQYSDD AKTEYKLNTY DDKGTALAAT
QLIHYKGGNT KTGIALNHVH EKVFASDNGM RRNVPKVVVA VTDGRSQDEV KKNAAKLQHA
GYSVFVIGVA DVDFAELQNI GSKPSERHIF VVDDFDAFST IQENLVTFIC ETASSSCPLI
FVNGFTSPGF RMLEAFNLTE RMYAATKGVS MEPGSFNSYT AYRLHKDAFL SQPSVDIHPD
GLPAAYTIIL MFRLLPDSPK QAFDIWQVSD KNHKPEVGVT IDPSSQTVAF YNKDTRGEIQ
KVTFDNHQVK KIFHGSFHKL HILVSSKSVK LNIDCKEVAE KEIKEAGNTS PDGYQVLGKM
SKSIGSKGES ATFQIQMFDI ICSLGWTSRD RCCDLPSMRD EAKCPPLPNA CTCTSDSSGP
PGPMGPVGAP GSKGLRGERG DSGPPGPVGP RGDVGPPGPL GLPGPQGPSG LSIPGQAGRP
GPKGDPGDAG LPGQKGPPGK AGAVGPMGPS GVRGPQGKEG PAGPRGPPGQ MGPPGSPGMQ
GNAGKPGNPG DNGVPGPSGL KGDKGERGDL ASQGMMRSIA RQVCEQLVNS QMGRFNDMLN
HIPSEARSNS PGPAGPPGPP GSPGPQGEPG RIGRNGFPGT PGLPGRQGDR GPAGEKGERG
NPGIGERGQR GMPGPPGNPG ESRTGPPGPS GAPGSRGPPG RNGVPGARGP PGPPGYCDSS
QCVGIPYNGQ GYRGS
//