ID A0A2K6U4J6_SAIBB Unreviewed; 3041 AA.
AC A0A2K6U4J6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=Collagen type XII alpha 1 chain {ECO:0000313|Ensembl:ENSSBOP00000026825.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSSBOP00000026825.1};
OS Saimiri boliviensis boliviensis (Bolivian squirrel monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Platyrrhini; Cebidae;
OC Saimiriinae; Saimiri.
OX NCBI_TaxID=39432 {ECO:0000313|Ensembl:ENSSBOP00000026825.1, ECO:0000313|Proteomes:UP000233220};
RN [1] {ECO:0000313|Ensembl:ENSSBOP00000026825.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 39432.ENSSBOP00000026825; -.
DR Ensembl; ENSSBOT00000043699.1; ENSSBOP00000026825.1; ENSSBOG00000029360.1.
DR GeneTree; ENSGT00940000154923; -.
DR Proteomes; UP000233220; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR CDD; cd00063; FN3; 18.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 18.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF20; COLLAGEN ALPHA-1(XXI) CHAIN; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00041; fn3; 18.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 18.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 10.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 17.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000233220};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 18..108
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 131..307
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 327..417
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 431..607
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 625..714
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 716..807
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 808..896
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 898..989
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 990..1078
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1080..1170
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1190..1362
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1378..1467
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1468..1558
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1559..1649
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1650..1734
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1742..1836
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1923..2013
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2014..2104
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2105..2193
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2194..2281
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2310..2483
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 792..818
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 975..997
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1067..1090
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2270..2297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2737..2872
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2906..3041
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2765..2784
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2811..2825
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2915..2929
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2993..3011
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3041 AA; 331182 MW; 52530D5DFFB7415B CRC64;
GEPRMAPLHS SLGNKIDPPS DLNFKIIDEN TVHMSWAKPV DPIVGYRITV DPTTDGPTKE
FTLAASTTET LLSELVPETE YVVTITSYDE VEESVPVIGQ LTIQTGSSTK PGEKKPGRTE
IQKCSVSAWT DLVFLVDGSW SVGRNNFKYI LDFIAALVSA FDIGEEKTRV GVVQYSSDTR
TEFNLNQYYQ RDELLAAIKT IPYKGGNTMT GDAIDYLVKN TFTESAGARV GFPKVAIIIT
DGKSQDEVEI PARELRNVGV EVFSLGIKAA DAKELKQIAS TPSLNHVFNV ANFDAIVGIQ
NEIISQVCSG VDEQLGELVS GEEVVEPPSN LIAMEVSSKY IKLSWNPSPS PVTGYKVILT
PMTAGSRQHA LSVGPQTTTL SVRDLSADTE YQISVSAMKG MTSSEPISIM EKTQPMKVQV
ECSRGVDIKA DIVFLVDGSY SIGIANFVKV RAFLEVLVKS FEISPNRVQI SLVQYSRDPH
TEFTLKKFTK VEDIIDAINN FPYRGGSTNT GKAMTYVREK IFVPSKGSRS NVPKVMILIT
DGKSSDAFRD PAIKLRNSDV EIFAVGVKDA VRSELEAIAS PPAETHVFTV EDFDAFQRIS
FELTQSICLR IEQELAAIKK KAYIPPKDLS FSEVTSYSFK TNWSPAGENV FSYHITYKEA
TGDEEVTVVE PASSTSVVLS NLKPETLYLV NVTAEYEDGF SIPLAGEETT AEVKGAPRNL
KVTDETTDSF KITWTQAPGR VLRYRIIYRP VAGGESREVT TPPNQRRRTL ENLIPDTKYE
VSVIPEYFSG PGSPLTGNAA TEEVRGNPRD LRVSDPTTST MKLSWSAAPG KVKQYLVTYT
PVAGSETQEV TVRGDKTSTV LQGLKEGTQY ALSVTALYAS GAGDALFGEG TTLEERGSPQ
DLVTKDITDT SIGAYWTSAP GMVRGYRVSW KSLYDDVDTG EKNLPEDAIH TMIENLQPET
KYRISVFATY SSGEGEPLTG DATTELSQDS KTLKVDEETE NTMRVTWKPA PGKVVNYRVV
YRPRGAGRQM VAKVPPTVTS TVLKRLQPQT TYDITVLPIY KTGEGKLRQG SGTTASRFKS
PRNLKTSDPT MSSFRVTWEP APGEVKGYKV TFHPTGDDRR LGELVVGPYD NTVVLEELRA
GTTYKVNVFG MFDGGESSPL VGQEMTTLSD TTVMPILSSG MECLTRAEAD IVLLVDGSWS
IGRANFRTVR SFISRIVEVF DIGPKRVQIA LAQYSGDPRT EWQLNTHRDK KSLLQAVANL
PYKGGNTLTG MALNFIRQQS FRTQAGMRPR ARKIGVLITD GKSQDDVEAP SKKLKDEGVE
LFAIGIKNAD EVELKMIATD PDDTHAYNVA DFESLSRIVD DLTMNLCNSV KGPGDLEAPS
NLVISERTHR SFRVSWTPPS DSVDRYKVEY YPVSGGKRQE FYVSRMETST VLKDLKPETE
YVVNVYSVVE DEYSEPLKGT EKTLPVPVVS LNIYDVGPTT MHVQWQPVGG ATGYILSYEP
VKDTDTRRPK EMRLGPTVND VELTDLVPNT EYAVTVQAVL HDLTSEPVTV REVTLPLPGP
QDLKLRDVTH STMNVFWEPV PGKVRKYVVR YKTPEEDVKE VEVDRSETST SLKDLFSQTL
YTVSVSAVHD EGESPPVTAQ ETTRPVPAPT NLRITEVTQE SFRGTWDHGA SDVSLYRITW
APPGSSDKME TILNGDENTL VFENLNPNTV YEVSITAIYP DESESDDLIG SEQTLFCLVT
LGPRNLQVYN ATSNSLTVKW DPATGRVQKY RITYQPSTGE GNEQTTTIGG RQNSVVLQKL
KPDTPYTITV SSLYPDGEGG RMTGRGKTKP LNTVRNLRVY DPSTSTLNVR WDHAEGNPRQ
YKLFYAPAAG GPEELVPIPG NTNYAILRNL QPDTPYTVTV VPVYTEGDGG RTSDTGRTLM
RGVARNVQVY NPTPNSLDVR WDPAPGPVLQ YRVVYSPADG TRPSESIVVP GNTRTVHLER
LIPDTLYSVN LVALYSDGEG NPSPAQGRTL PRSGPRNLRV FGETTNSLSV AWDHADGPVQ
QYRIIYSPTV GDPIDEYTTV PGRRNNVMLQ PLQSDTPYKI TVIAVYEDGD GGHLTGNGRT
VGLLPPQNIH ISDEWYTRFR VSWDPSPSPV LGYKIVYKPV GSSEPMEAFV GEVTSYTLHN
LNPSTTYDVN VYAQYDSGLS VPLTDQGTTL YLNVTDLKTY QIGWDTFCVK WSPHRAATSY
RLKLSPADGT RGQEITVRGS ETSHCFTGLS PDTDYGVTVF VQTPNLEGPG VSVKEHTTVK
PTEAPTEPPT PPPAPTIPPA RDVCKGAKAD IVFLTDASWS IGDDNFNKVV KFIFNTVGGF
DEISPAGIQV SFVQYSDEVK SEFKLNTYND KALALGALQN IRYRGGNTRT GKALTFIKEK
VLTWESGMRK NVPKVLVVVT DGRSQDEVKK AALVIQQSGF SVFVVGVADV DYNELANIAS
KPSERHVFIV DDFESFEKIE DNLITFVCET ATSSCPLIYL DGYTSPGFKM LEAYNLTEKN
FASVQGVSLE SGSFPSYSAY RIQKNAFVNQ PTADLHPNGL PPSYTIILLF RLLPETPSDP
FAIWQITDRD YKPQVGVIAD PSSKTLSFFN KDTRGEVQTV TFDTDEVKTL FYGSFHKVHI
VVTSKSVKIY IDCYEIIEKD IQEAGNITTD GYEILGKLLK GERKSAAFQI QSFDIVCSPV
WTSRDRCCDI PSRRDEAKCP AFPNSCTCTL DSIGPPGPPG PAGGPGAKGP RGERGISGAI
GPPGPRGDIG PPGPQGPPGP QGPNGLSIPG EQGRQGMKGD AGEPGLPGRT GTPGLPGPPG
PMGPPGDRVS FLRQGSPGSP GVTGPSGKPG KPGDHGRPGP SGLKGEKGDR GDIASQNMMR
AVARQVCEQM ISGQMNRFNQ MLNQIPNDYH SSRNQPGPPG PPGPPGSAGA RGEPGPGGRP
GFPGTPGMQG PPGERGLPGE KGERGTGSPG PRGLPGPPGP QGESRTGPPG STGSRGPPGP
PGRPGNSGIR GPPGPPGYCD SSQCASIPYN GQGYPGMLLP L
//