ID W5NJS9_LEPOC Unreviewed; 3580 AA.
AC W5NJS9;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 47.
DE SubName: Full=Collagen, type XII, alpha 1b {ECO:0000313|Ensembl:ENSLOCP00000020888.1};
GN Name=COL12A1 {ECO:0000313|Ensembl:ENSLOCP00000020888.1};
OS Lepisosteus oculatus (Spotted gar).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC Lepisosteus.
OX NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000020888.1, ECO:0000313|Proteomes:UP000018468};
RN [1] {ECO:0000313|Proteomes:UP000018468}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT "The Draft Genome of Lepisosteus oculatus.";
RL Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSLOCP00000020888.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHAT01011852; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AHAT01011853; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 7918.ENSLOCP00000020888; -.
DR Ensembl; ENSLOCT00000020924.1; ENSLOCP00000020888.1; ENSLOCG00000016895.1.
DR eggNOG; KOG3544; Eukaryota.
DR GeneTree; ENSGT00940000154923; -.
DR HOGENOM; CLU_000467_0_0_1; -.
DR InParanoid; W5NJS9; -.
DR OMA; YTQTPNM; -.
DR Proteomes; UP000018468; Linkage group LG1.
DR Bgee; ENSLOCG00000016895; Expressed in larva and 13 other cell types or tissues.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0035987; P:endodermal cell differentiation; IBA:GO_Central.
DR CDD; cd00063; FN3; 16.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 19.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF17; COLLAGEN ALPHA-1(XII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF00041; fn3; 17.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 18.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 13.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 17.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000018468};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..3580
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004867726"
FT DOMAIN 26..116
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 140..316
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 336..425
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 440..616
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 634..723
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 725..816
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 817..906
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 908..999
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1000..1087
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1089..1179
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1200..1372
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1389..1478
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1479..1570
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1571..1661
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1662..1751
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1752..1843
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2308..2397
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2398..2488
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2489..2579
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2580..2668
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2787..2960
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1077..1102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3214..3364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3397..3580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1085..1102
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3291..3305
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3315..3330
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3407..3421
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3476..3493
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3551..3580
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3580 AA; 389945 MW; 225F9DC39CD46A22 CRC64;
MKIRLSLAAI AFLATLIASV EAQVEPPADL KFKIINENTV QMSWKRPSTR IQGYRIVVLP
TTDGPAKELN LPASATKTSI AELLPDLDYV VTISSYDGAE ESIPIFGQLT IQSNSQAPGG
PRKRPQTEDA LKCSVSAIAD LVFLVDGSWS VGRENFKHIR SFISSLASAF DIGEEKTRVA
VVQYSTDTRT EFNLNQYYKR TELLRAINSL PYKGGNTMTG EAMDYLLKNT FTEAAGARKG
FPKIAMIITD GKSQDPVEEY AKKLRNIGVE IFVLGIKGAD VEELKQMASG PFEKHVYNVA
NFDLIKDVQH DLITQVCAGV DEQLNELVSG EEIVEPPSNL QVTELTSKSM RVTWDPSSGP
ITGYKLQLIP MLAGSKRQEI HTGPSSTSMN VRDLSPETEY QIMLYALKGL TPSEPVMAME
KTQPIRVSLE CSLGVDVQAD VVLLVDGSYS IGLANFAKVR AFLEVLVNSF DIGPAKVQIS
LVQYSRDPYT EFALNTHNNL ASVLKAIRTF PYRGGSTNTG KAMTYVREKI FVPSKGSRLN
VPRVMILITD GKSSDAFKDP ATKLRNTDVE IFAVGVKDAV RAELESIANT PVETHVYTVE
DFDAFERISK ELTQSICLRI EQELLKIKTR NLTAPENLQF SEVGPRSFRV SWTSDAENVL
SFLVTYKPVA GGEYISMYVA PDTRSTVLHH LTPSTLYEVN VISQYERGNS FPLSGNETTL
EEQGAPRNLR VSEETVDSFR VTWDAAPGAV VRYHLSYSPI RGEAERKEIT TSGPETTIVL
QDLLQLTTYH VSVSAEYASG LGRKMDTSGT TKEARGSPRD LQVFDHTVSS MRLSWTAAPG
RVLQYRVTYV PTAGGDSKEI YVKGDSTAAF LKNLQPATEY EISVSAVYPS GSGDPLTGRG
TTLEELGSPR DLVTKDVTDT SFGVSWAAAP GNVRSYRIAW KSAFTEEAGE KSVRGDVTDT
VLEELTPETK YQISVYAAYG HGEGDPLVGE ETTDASAEGK TLSVSEETEK SLRVTWQAAP
GEVVNYRITY RPVAGGRQLA TKVPGSVTTT VLRRLQPMTS YDITVLPVYR RGEGKARQGV
GTTLSPFKGP RNLQTSEPTK TSFRVSWDPA PGEVRGYKVT FHPADDEVNL GELLVGPYDN
TVVLEELRAG TKYSVSVFGV FDGGESFPLA GEEKTTLSDM PEPPPIRFTD VECKTTAQAD
IVLLVDGSWS IGRLNFKTIR AFIARMVGVF EIGPNRVQIG LAQYSGDPKT EWHLNAHRTR
ESLLEAVANL PYKGGNTLTG LALNYILQNN FKPNVGLRPN ARKIGVLITD GKSQDDIIMS
SQSLRDQGIE LYAVGVKNAD ENELRSIASD PDEIHMYNVA DFSFLLDIVD DLTINLCNSV
KGKTGGLEAP TDLVTSEVTA RSFRATWTAP SGTVEQYRVE YRPAAGGRTE EIFVDGSTTT
AVLPNLNPLT EYLVQVYSVS GTESSDPLKG SETTLPLPSV RNMNVYDMTS SSMRVRWEPA
SEATGYLLLY SAINATTPES EKEMRVGSEV NDVQLEQLLS DTAYTISLYA LHGEAATDPL
TSQGVTLPLP PAGELRITDV THSTMKLNWD KAPGKVRKYI ITYKPVDGEE AKEIEVAGSI
TTKVIDSLTS QTEYDVAVTP VYDEGAGNPM IGQAVTDVVP APKNLRFSEV TQTSFRATWD
HGAPDVALYR IGWVKQGGSD IQYAILNSDE NTSVLENLDP NTIYDVQVTA IYPDESESED
LMGSQRTSPT GAPQNLQVYN ATTSSLTVKW DPAPGRVQNY RITYIPTAGG QLQTVNEAPC
NTCVLVFRMH SELFFLILVN CLVFAHQSNP VNNFVQLCSC VNAITDLRLP TPEIKSVVLK
ARRLKNEISH QRLLNYYDFA AKCDKKCLTT QMAVHVILHA ISLDFNDLIT PLRPETASLC
FSLNLTALME LTKNIVLIQN NLHCAFEKFF LLGTGKKQWQ YFFSHCTSGS VVFYCFLITK
PVIFQNNLFL DMVYFACNFI WPSNFLRCTL STYLRELKDT NNGIVINPWC ILVSGGYESL
PSLTRGSVYT ANSRGFSIVP WGTPFVVLRL LREIPDFRCH YLFLYLLMLI SIPSHQRCFV
LTVEINVAPV TVHTLHATRY CKQMAQHSHL IVVILFLYVF YEQVQVGGRR NSVVLQKLTS
DTPYTISVAS VYASGESKEI TGSGKTSKCP LGCPDLRSIT LRIRWETALL PILVTKTTKK
NILNNLTTSA GTHEGLQLVA EDLNSQGVGF NNTVIEFLNH LLLYLNGAAW SSVVQSYPLH
SVRRWVGQDQ RVKLQLVKTG GGGEPLGGVR NLQVTDPTTS TLNVRWEPAE GSVRQYRILY
VPAAGGAEDI EQVSGGTTST VLRNLLPDTV YNVAVVPVYA EGEGKRLTES GKTLERSPPR
NIQVYNPTPN SLNVRWEPAS GQVQQYRVVY APLSGTRPSE YVLVPGNTNN AVIEQLIPDT
PYSVSVLALY ADGEGSQVTG QGTTLPRSGP RNLRVFGATT DTLSVSWDHA EGPVQQYRIS
YAPTTGDPIE EFTVVPGRRN NVVLQNLQSD TPYRINVVAV YADGPGGELT GNGRTVGLLE
PRNLRVSDEW YTRFRVSWDP APLPVLGYKL VYQPTDQDET MEVFVGDVTT YTLHNLLPGT
TYDLKVYAQY DGGASGALIG QGTTLYLNVT DLTTYQVGFD TFCIKWTPHR SATSYRIKLN
PLQECVCGNN IVGVAVEETT YCFSGLSPDA LYNATVFAQT PNLEGPGVSV KERTLVKPTE
APTLPPTPPP PPTIPPAWEV CKGAKADLVF LIDGSWSIGD DNFNKVLQFV FNTIGAFDVI
SPAGMQVSFV QYSDDAKTEF KLNTYNNKGM VLSALQLVRY RGGNTRTGVA LKYIAEKVLT
PENGMRKNVP KVLVVVTDGR SQDEVKKTAP ALQHAGYSVF VIGVADVDYA ELQNIGSKPS
DRHVFVVDDF DAFEKIQDNL ITFICETATS TCPLIYLNGY TSPGFRMLEA FNLTEKTFAS
VKGVSMEPGS FNSFIAYRLH KNAHLTQPTK EVHPEGLPHA YTIILTLRLL PDSPSEAFDI
WQIADKSNKP EVGITMDPSS RTLSFYNKDT RGEIQKVTFD NDEVKKIFHG SFHKLHIAVS
PKNVKLHVDC QEVAEKEIKE ANNITLDGYE VLGKLVKSRG TRGDSATFQL QMFDIVCSLG
WTSRDRCCDL PSTRDEAKCP ALPHACTCAQ DSIGPPGPPG PTGSPGNKGP RGERGETGPA
GPIGPRGELG LPGPMGLPGP QGPNGLSIPG EPGRPGPKGD PGDSGLPGRQ GPPGSPGPIG
PVGPIGGRGP PGKEGPAGPR GPPGPMGAPG TPGMPGQTGQ PGKPGDTGQR GPAGMKGEKG
ERGDFASQNM MRSIARQVCE QLVNGQMSRI DTMLNQIPNG YYSNRNNPGP PGPPGPPGSS
GPRGEQGPTG SNGFPGSPGL PGRPGDRGPA GEKGERGSPG IGSQGQRGLT GPPGPPGDSR
TGPPGPQGSP GPRGPPGRQG SAGVRGPPGP PGYCDSSQCV GIPYNGQGFP EPYPPEHETY
VVPVIPVEQP EETELQSSGY TRNQRGKRSL SSKNPQSRSS
//