ID A0A2K6DG14_MACNE Unreviewed; 3063 AA.
AC A0A2K6DG14;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSMNEP00000034872.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSMNEP00000034872.1};
OS Macaca nemestrina (Pig-tailed macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9545 {ECO:0000313|Ensembl:ENSMNEP00000034872.1, ECO:0000313|Proteomes:UP000233120};
RN [1] {ECO:0000313|Ensembl:ENSMNEP00000034872.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_011735701.1; XM_011737399.1.
DR STRING; 9545.ENSMNEP00000034872; -.
DR Ensembl; ENSMNET00000059309.1; ENSMNEP00000034857.1; ENSMNEG00000040601.1.
DR Ensembl; ENSMNET00000059324.1; ENSMNEP00000034872.1; ENSMNEG00000040601.1.
DR GeneID; 105479442; -.
DR CTD; 1303; -.
DR GeneTree; ENSGT00940000154923; -.
DR OrthoDB; 5353225at2759; -.
DR Proteomes; UP000233120; Unplaced.
DR Bgee; ENSMNEG00000040601; Expressed in lung and 12 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR CDD; cd00063; FN3; 18.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 18.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 18.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 18.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 11.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 17.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000233120};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..3063
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014561529"
FT DOMAIN 27..117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 140..316
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 336..426
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 440..616
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 634..722
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 725..816
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 817..905
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 907..998
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 999..1087
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1089..1179
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1199..1371
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1387..1476
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1477..1567
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1568..1658
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1659..1754
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1755..1849
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1936..2026
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2027..2117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2118..2206
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2207..2294
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2323..2496
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 799..830
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1076..1099
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2283..2314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2743..2896
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2931..3063
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2297..2313
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2778..2797
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2824..2838
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2940..2954
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3018..3036
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3048..3063
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3063 AA; 333116 MW; F0CEB30A80FC2D7C CRC64;
MRTRLPPVLA ALGAALLLSS IEAEVDPPSD LNFKIIDENT VHMSWAKPVD PIVGYRITVD
PTTDGPTKEF TLAASTTETL LSELVPETEY VVTITSYDEV EESVPVIGQL TIQTGSSTKP
VEKKPGKTEI QKCSVSAWTD LVFLVDGSWS VGRNNFKYIL DFIAALVSAF DIGEEKTRVG
VVQYSSDTRT EFNLNQYYQR DELLAAIKKI PYKGGNTMTG DAIDYLVKNT FTESAGARVG
FPKVAIIITD GKSQDEVEIP ARELRNVGVE VFSLGIKAAD AKELKQIAST PSLNHVFNVA
NFDAIVDIQN EIISQVCSGV DEQLGELVSG EEVVEPPSNL IAVEVSSKYV KLSWNPSASP
VTGYKVILTP MTAGSRQHAL SVGPQTTTLS VRDLSADTEY QISVSAMKGM TSSEPISIME
KTQPMKVQVE CSRGVDIKAD IVFLVDGSYS IGIANFVKVR AFLEVLVKSF EISPNRVQIS
LVQYSRDPHT EFTLKKFTKV EDIIEAINTF PYRGGSTNTG KAMTYVREKI FVPSKGSRSN
VPKVMILITD GKSSDAFRDP AIKLRNSDVE IFAVGVKDAV RSELEAIASP PAETHVFTVE
DFDAFQRISF ELTQSICLRI EQELAAIKKK AYVPPKDLSF SEVTSYGFKT NWSPAGENVF
SYHITYKEAT GDDEVTVVEP ASSTSVVLSN LKPETLYLVN VTAEYEDGFS IPLAGEETTE
EVKGAPRNLK VTDETTDSFK ITWTQAPGRV LRYRIIYRPV AGGESREVTT PPNQRRRTLE
NLIPDTKYEV TVIPEYFSGP GSPLTGNAAT EEVRGNPRDL RVSDPTTSTM KLSWSGAPGK
VKQYLVTYTP VAGGETQEVT VRGDTTNTVL RGLKEGTQYA LSVTALYASG AGDALFGEGT
TLEERGSPRD LVTKDITDTS IGAYWTSAPG MVRGYRVSWK SLYDDVDTGE KNLPEDAIHT
MIENLQPETK YRISIFATYS SGEGEPLTGD ATTELSQDSK TLKVDEETES TMRVTWKPAP
GKVVNYRVVY RPHGGGRQMV AKVPPTVTST VLKRLQPQTT YDITVLPIYK TGEGKLRQGS
GTTASRFKSP RNLKTSDPTM SSFRVTWEPA PGEVKGYKVT FHPTGDDRRL GELVVGPYDN
TVVLEELRAG TTYKVNVFGM FDGGESSPLV GQEMTTLSDT TVMPILSSGM ECLTRAEADI
VLLVDGSWSI GRANFRTVRS FISRIVEVFD IGPKRVQIAL AQYSGDPRTE WQLNAHRDKK
SLLQAVANLP YKGGNTLTGM ALNFIRQQSF RTQAGMRPRA RKIGVLITDG KSQDDVEAPS
KKLKDEGVEL FAIGIKNADE VELKMIATDP DDTHAYNVAD FESLSRIVDD LTINLCNSVK
GPGDLEAPSN LVISERTHRS FRVSWTPPSD SVDRYKVEYY PVSGGKRQEF YVSRMETSTV
LKDLKPETEY VVNVYSVVED EYSEPLKGTE KTLPVPVVSL NIYDVGPTTM HVQWQPVGGA
TGYILSYKPV KDTEPTTPKE MRLGPTVNDM QLTDLVPNTE YAVTVQAVLH DLTSEPVTVR
EVTLPLPRPQ DLKLRDVTHS TMNVFWEPVP GKVRKYIVRY KTPEEDVKEV EVDRSETSTS
LKDLFSQTLY TVSVSAVHDE GESPPVTAQE TTRPVPAPTN LRITEVTSEG FRGTWDHGAS
DVSLYRITWA PFGSSDKMET ILNGDENTLM FENLNPNTVY EVSITAIYPD ESESDDLIGS
ERTLPILTTQ APKSGPRNLQ VYNATSNSLT VKWDPASGRV QKYRITYQPS TGEGNEQTTT
IGGRQNSVVL QKLKPDTPYT ITVSSLYPDG EGGRMTGRGK TKPLNTVRNL RVYDPSTSTL
NVRWDHAEGN PRQYKLFYAP AADGPEELVP IPGNTNYAIL RNLQPDTSYT VTVVPVYTEG
DGGRTSDTGR TLMRGLARNV QVYNPTPNSL DVRWDPAPGP VLQYRVVYSP VDGTRPSESI
VVPGNTRMVH LERLIPDTLY SVNLVALYSD GEGNPSPAQG RTLPRSGPRN LRVFGETTNS
LSVAWDHADG PVQQYRIIYS PTVGDPIDEY TTVPGRRNNV ILQPLQPDTP YKITVIAVYE
DGDGGHLTGN GRTVGLLPPQ NIHISDEWYT RFRVSWDPSP SPVLGYKIVY KPVGSNEPME
AFVGEMTSYT LHNLNPSTTY DVNVYAQYDS GLSVPLTDQG TTLYLNVTDL KTHQIGWDTF
CVKWSAHRAA TSYRLKLSPA DGTRGQEITV RGSETSHCFT GLSPDTDYGV TVFVQTPNLE
GPGVSVKEHT TVKPTEAPTE PPTPPPPPTI PPARDVCKGA KADIVFLTDA SWSIGDDNFN
KVVKFIFNTV GGFDEISPAG IQVSFVQYSD EVKSEFKLNT YNDKALALGA LQNIRYRGGN
TRTGKALTFI KEKILTWESG MRKNVPKVLV VVTDGRSQDE VKKVALVIQQ SGFSVFVVGV
ADVDYNELAN IASKPSERHV FIVDDFESFE KIEDNLITFV CETATSSCPL IYLDGYTSPG
FKMLEAYNLT EKNFASVQGV SLESGSFPSY SAYRIQKNAF VNQPTADLHP NGLPPSYTII
LLFRLLPETP SDPFAIWQIT DRDYKPQVGV IADPSSKTLS FFNKDIRGEV QTVTFDTDEV
KTLFYGSFHK VHIVVTPKSV KTYIDCYEII EKDIKEAGNI TTDGYEILGK LLKGERKSAA
FQIQSFDIVC SPVWTSRDRC CDIPSRRDEA KCPAFPNSCT CTQDSVGPPG PPGPAGGPGA
KGPRGERGIS GAVGPPGPRG DIGPPGPQGP PGPQGPNGLS IPGEQGRQGT KGDAGEPGLP
GRTGTPGLPG PPGPMGPPGD RGFTGKDGAM GPRGPPGPPG SPGSPGVTGP SGKPGKPGDH
GRPGPSGLKG EKGDRGDIAS QNMMRAVARQ VCEQLISGQM NRFNQMLNQI PNDYHSSRNQ
PGPPGPPGPP GSAGARGEPG PGGRPGFPGT PGMQGPPGER GLPGEKGERG TGSPGPRGLP
GPPGPQGESR TGPPGSTGSR GPPGPPGRPG NSGIRGPPGP PGYCDSSQCA SIPYNGQGYP
GSG
//