GenomeNet

Database: UniProt
Entry: W5NJS9_LEPOC
LinkDB: W5NJS9_LEPOC
Original site: W5NJS9_LEPOC 
ID   W5NJS9_LEPOC            Unreviewed;      3580 AA.
AC   W5NJS9;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 47.
DE   SubName: Full=Collagen, type XII, alpha 1b {ECO:0000313|Ensembl:ENSLOCP00000020888.1};
GN   Name=COL12A1 {ECO:0000313|Ensembl:ENSLOCP00000020888.1};
OS   Lepisosteus oculatus (Spotted gar).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Holostei; Semionotiformes; Lepisosteidae;
OC   Lepisosteus.
OX   NCBI_TaxID=7918 {ECO:0000313|Ensembl:ENSLOCP00000020888.1, ECO:0000313|Proteomes:UP000018468};
RN   [1] {ECO:0000313|Proteomes:UP000018468}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA   MacCallum I., Young S., Walker B.J., Lander E.S., Lindblad-Toh K.;
RT   "The Draft Genome of Lepisosteus oculatus.";
RL   Submitted (DEC-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSLOCP00000020888.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AHAT01011852; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AHAT01011853; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   STRING; 7918.ENSLOCP00000020888; -.
DR   Ensembl; ENSLOCT00000020924.1; ENSLOCP00000020888.1; ENSLOCG00000016895.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000154923; -.
DR   HOGENOM; CLU_000467_0_0_1; -.
DR   InParanoid; W5NJS9; -.
DR   OMA; YTQTPNM; -.
DR   Proteomes; UP000018468; Linkage group LG1.
DR   Bgee; ENSLOCG00000016895; Expressed in larva and 13 other cell types or tissues.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0062023; C:collagen-containing extracellular matrix; IBA:GO_Central.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   GO; GO:0035987; P:endodermal cell differentiation; IBA:GO_Central.
DR   CDD; cd00063; FN3; 16.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 19.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF17; COLLAGEN ALPHA-1(XII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF00041; fn3; 17.
DR   Pfam; PF00092; VWA; 4.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 18.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 4.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 13.
DR   SUPFAM; SSF53300; vWA-like; 4.
DR   PROSITE; PS50853; FN3; 17.
DR   PROSITE; PS50234; VWFA; 4.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018468};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..3580
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004867726"
FT   DOMAIN          26..116
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          140..316
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          336..425
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          440..616
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          634..723
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          725..816
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          817..906
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          908..999
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1000..1087
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1089..1179
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1200..1372
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1389..1478
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1479..1570
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1571..1661
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1662..1751
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1752..1843
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2308..2397
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2398..2488
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2489..2579
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2580..2668
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2787..2960
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1077..1102
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3214..3364
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3397..3580
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1085..1102
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3291..3305
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3315..3330
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3407..3421
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3476..3493
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3551..3580
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3580 AA;  389945 MW;  225F9DC39CD46A22 CRC64;
     MKIRLSLAAI AFLATLIASV EAQVEPPADL KFKIINENTV QMSWKRPSTR IQGYRIVVLP
     TTDGPAKELN LPASATKTSI AELLPDLDYV VTISSYDGAE ESIPIFGQLT IQSNSQAPGG
     PRKRPQTEDA LKCSVSAIAD LVFLVDGSWS VGRENFKHIR SFISSLASAF DIGEEKTRVA
     VVQYSTDTRT EFNLNQYYKR TELLRAINSL PYKGGNTMTG EAMDYLLKNT FTEAAGARKG
     FPKIAMIITD GKSQDPVEEY AKKLRNIGVE IFVLGIKGAD VEELKQMASG PFEKHVYNVA
     NFDLIKDVQH DLITQVCAGV DEQLNELVSG EEIVEPPSNL QVTELTSKSM RVTWDPSSGP
     ITGYKLQLIP MLAGSKRQEI HTGPSSTSMN VRDLSPETEY QIMLYALKGL TPSEPVMAME
     KTQPIRVSLE CSLGVDVQAD VVLLVDGSYS IGLANFAKVR AFLEVLVNSF DIGPAKVQIS
     LVQYSRDPYT EFALNTHNNL ASVLKAIRTF PYRGGSTNTG KAMTYVREKI FVPSKGSRLN
     VPRVMILITD GKSSDAFKDP ATKLRNTDVE IFAVGVKDAV RAELESIANT PVETHVYTVE
     DFDAFERISK ELTQSICLRI EQELLKIKTR NLTAPENLQF SEVGPRSFRV SWTSDAENVL
     SFLVTYKPVA GGEYISMYVA PDTRSTVLHH LTPSTLYEVN VISQYERGNS FPLSGNETTL
     EEQGAPRNLR VSEETVDSFR VTWDAAPGAV VRYHLSYSPI RGEAERKEIT TSGPETTIVL
     QDLLQLTTYH VSVSAEYASG LGRKMDTSGT TKEARGSPRD LQVFDHTVSS MRLSWTAAPG
     RVLQYRVTYV PTAGGDSKEI YVKGDSTAAF LKNLQPATEY EISVSAVYPS GSGDPLTGRG
     TTLEELGSPR DLVTKDVTDT SFGVSWAAAP GNVRSYRIAW KSAFTEEAGE KSVRGDVTDT
     VLEELTPETK YQISVYAAYG HGEGDPLVGE ETTDASAEGK TLSVSEETEK SLRVTWQAAP
     GEVVNYRITY RPVAGGRQLA TKVPGSVTTT VLRRLQPMTS YDITVLPVYR RGEGKARQGV
     GTTLSPFKGP RNLQTSEPTK TSFRVSWDPA PGEVRGYKVT FHPADDEVNL GELLVGPYDN
     TVVLEELRAG TKYSVSVFGV FDGGESFPLA GEEKTTLSDM PEPPPIRFTD VECKTTAQAD
     IVLLVDGSWS IGRLNFKTIR AFIARMVGVF EIGPNRVQIG LAQYSGDPKT EWHLNAHRTR
     ESLLEAVANL PYKGGNTLTG LALNYILQNN FKPNVGLRPN ARKIGVLITD GKSQDDIIMS
     SQSLRDQGIE LYAVGVKNAD ENELRSIASD PDEIHMYNVA DFSFLLDIVD DLTINLCNSV
     KGKTGGLEAP TDLVTSEVTA RSFRATWTAP SGTVEQYRVE YRPAAGGRTE EIFVDGSTTT
     AVLPNLNPLT EYLVQVYSVS GTESSDPLKG SETTLPLPSV RNMNVYDMTS SSMRVRWEPA
     SEATGYLLLY SAINATTPES EKEMRVGSEV NDVQLEQLLS DTAYTISLYA LHGEAATDPL
     TSQGVTLPLP PAGELRITDV THSTMKLNWD KAPGKVRKYI ITYKPVDGEE AKEIEVAGSI
     TTKVIDSLTS QTEYDVAVTP VYDEGAGNPM IGQAVTDVVP APKNLRFSEV TQTSFRATWD
     HGAPDVALYR IGWVKQGGSD IQYAILNSDE NTSVLENLDP NTIYDVQVTA IYPDESESED
     LMGSQRTSPT GAPQNLQVYN ATTSSLTVKW DPAPGRVQNY RITYIPTAGG QLQTVNEAPC
     NTCVLVFRMH SELFFLILVN CLVFAHQSNP VNNFVQLCSC VNAITDLRLP TPEIKSVVLK
     ARRLKNEISH QRLLNYYDFA AKCDKKCLTT QMAVHVILHA ISLDFNDLIT PLRPETASLC
     FSLNLTALME LTKNIVLIQN NLHCAFEKFF LLGTGKKQWQ YFFSHCTSGS VVFYCFLITK
     PVIFQNNLFL DMVYFACNFI WPSNFLRCTL STYLRELKDT NNGIVINPWC ILVSGGYESL
     PSLTRGSVYT ANSRGFSIVP WGTPFVVLRL LREIPDFRCH YLFLYLLMLI SIPSHQRCFV
     LTVEINVAPV TVHTLHATRY CKQMAQHSHL IVVILFLYVF YEQVQVGGRR NSVVLQKLTS
     DTPYTISVAS VYASGESKEI TGSGKTSKCP LGCPDLRSIT LRIRWETALL PILVTKTTKK
     NILNNLTTSA GTHEGLQLVA EDLNSQGVGF NNTVIEFLNH LLLYLNGAAW SSVVQSYPLH
     SVRRWVGQDQ RVKLQLVKTG GGGEPLGGVR NLQVTDPTTS TLNVRWEPAE GSVRQYRILY
     VPAAGGAEDI EQVSGGTTST VLRNLLPDTV YNVAVVPVYA EGEGKRLTES GKTLERSPPR
     NIQVYNPTPN SLNVRWEPAS GQVQQYRVVY APLSGTRPSE YVLVPGNTNN AVIEQLIPDT
     PYSVSVLALY ADGEGSQVTG QGTTLPRSGP RNLRVFGATT DTLSVSWDHA EGPVQQYRIS
     YAPTTGDPIE EFTVVPGRRN NVVLQNLQSD TPYRINVVAV YADGPGGELT GNGRTVGLLE
     PRNLRVSDEW YTRFRVSWDP APLPVLGYKL VYQPTDQDET MEVFVGDVTT YTLHNLLPGT
     TYDLKVYAQY DGGASGALIG QGTTLYLNVT DLTTYQVGFD TFCIKWTPHR SATSYRIKLN
     PLQECVCGNN IVGVAVEETT YCFSGLSPDA LYNATVFAQT PNLEGPGVSV KERTLVKPTE
     APTLPPTPPP PPTIPPAWEV CKGAKADLVF LIDGSWSIGD DNFNKVLQFV FNTIGAFDVI
     SPAGMQVSFV QYSDDAKTEF KLNTYNNKGM VLSALQLVRY RGGNTRTGVA LKYIAEKVLT
     PENGMRKNVP KVLVVVTDGR SQDEVKKTAP ALQHAGYSVF VIGVADVDYA ELQNIGSKPS
     DRHVFVVDDF DAFEKIQDNL ITFICETATS TCPLIYLNGY TSPGFRMLEA FNLTEKTFAS
     VKGVSMEPGS FNSFIAYRLH KNAHLTQPTK EVHPEGLPHA YTIILTLRLL PDSPSEAFDI
     WQIADKSNKP EVGITMDPSS RTLSFYNKDT RGEIQKVTFD NDEVKKIFHG SFHKLHIAVS
     PKNVKLHVDC QEVAEKEIKE ANNITLDGYE VLGKLVKSRG TRGDSATFQL QMFDIVCSLG
     WTSRDRCCDL PSTRDEAKCP ALPHACTCAQ DSIGPPGPPG PTGSPGNKGP RGERGETGPA
     GPIGPRGELG LPGPMGLPGP QGPNGLSIPG EPGRPGPKGD PGDSGLPGRQ GPPGSPGPIG
     PVGPIGGRGP PGKEGPAGPR GPPGPMGAPG TPGMPGQTGQ PGKPGDTGQR GPAGMKGEKG
     ERGDFASQNM MRSIARQVCE QLVNGQMSRI DTMLNQIPNG YYSNRNNPGP PGPPGPPGSS
     GPRGEQGPTG SNGFPGSPGL PGRPGDRGPA GEKGERGSPG IGSQGQRGLT GPPGPPGDSR
     TGPPGPQGSP GPRGPPGRQG SAGVRGPPGP PGYCDSSQCV GIPYNGQGFP EPYPPEHETY
     VVPVIPVEQP EETELQSSGY TRNQRGKRSL SSKNPQSRSS
//
DBGET integrated database retrieval system