ID A0A401RJN0_CHIPU Unreviewed; 2969 AA.
AC A0A401RJN0;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=Collagen alpha-1(XII) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=chiPu_0017897 {ECO:0000313|EMBL:GCC18340.1};
OS Chiloscyllium punctatum (Brownbanded bambooshark) (Hemiscyllium punctatum).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Orectolobiformes; Hemiscylliidae;
OC Chiloscyllium.
OX NCBI_TaxID=137246 {ECO:0000313|EMBL:GCC18340.1, ECO:0000313|Proteomes:UP000287033};
RN [1] {ECO:0000313|EMBL:GCC18340.1, ECO:0000313|Proteomes:UP000287033}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCC18340.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BEZZ01001406; GCC18340.1; -; Genomic_DNA.
DR STRING; 137246.A0A401RJN0; -.
DR OMA; WKRPPDE; -.
DR Proteomes; UP000287033; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 16.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 17.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00041; fn3; 17.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 17.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 11.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 17.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000287033};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2969
FT /note="Collagen alpha-1(XII) chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019433026"
FT DOMAIN 27..117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 139..311
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 335..427
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 439..611
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 633..722
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 724..814
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 815..906
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 907..994
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 996..1086
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1107..1279
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1295..1384
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1385..1476
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1477..1567
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1568..1664
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1667..1756
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1758..1848
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1849..1940
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1941..2030
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2031..2119
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2120..2210
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2238..2411
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 982..1014
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2666..2767
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2803..2921
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2950..2969
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 985..1010
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2719..2734
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2869..2897
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2969 AA; 323541 MW; C1E91A583D72A356 CRC64;
MKSRLCLAVA ALLAALSAPS VEAQVEPPSQ LKFKILTENS VQMMWRRSPS RIQGYRLTLA
PTTVGSAKEM ILPQGASKTT LTDLIPDVEY VVTLVAFDRS SESVPVYGQL TIQSGRTPTK
RPKKLEELSK KCSASAVADL VFLVDGSWSV GRANFKKIRE FIYSLVSAFE IGEDKTRVGI
VQYSSDTRTE FDLNRYNRKQ ELLSAITNLP YKGGNTMTGE AINYLVQNTF SETAGARKSY
PKIAVIITDG KSQDPVTESA EALRNIGVEV FTLGIKGADL DELKLIGSSP LNKHVFKVAD
FDNIQDVQNE IINLVCSGIE EQLSDIVSGE EVVEPPSNLQ VLEIASNFMK LVWDSSPGQV
TGYRVHLIPM VAGIPEQSIN VDARTRTAVV KDLTAETEYQ VNLYAMRGLA SSEPVSLMEK
TQPVKESVEC TLDETTQADV VLLVDGSYSI GLSNFAKVKD FLETLVKTFQ VGPNRIQIGL
VQYSREPHTE FTLMRHQTLE DVLSAIRTFP YRGGSTNTGR AMTYVRQKVF VPEKGARFNV
PRVMVLITDG KSSDAFKEPA LKLRDAGVEI FAVGVKDAVF SELVAIASPP ENTHVYQVDD
FDSFQRVSTK LTQTLCLRIV EEVKAIRARA FTAATDLRTS EVTSRSFRVS WTGAGPDVLS
YLLKYKVAVG GDFTSIRVPA DQTTRVLTDL LPETTYLVNV IAEYTEGSSL PLEGEETTLE
EVGSPSNLEV LDETTDSFKV RWTAAPGNVQ RYRVEYRPVL GGESKEVTVG GLETWTTLQN
LLPDTKYSVT IIPEYQTLVG QPLVGEGTTK EARGSARNLV TENVTPTTID VSWTSAPGNV
YNYRVTWKSL FDDDSGEKWV PGYSTTTTLE NLRPETKYQI RIYASYGSGE GDPLEGTETT
DATPEAKTLT ISDEMVNSFR VRWLPAPGNV LNYRLSYRPA TGGRAIGTKI PSHVTTTVLR
RLNPQTTYNI SVIPMYRKAE GKLRSGQGTT ASPYKPPQNL QTSEPARTSF RVTWDPSPGE
VTGYKVIYHP RGREEQQGEM VLGPYDTTVV LEELRAGTTY KVAVSGMFEG GESLPLFGEE
QTTLSDEALE KPLGPPGLQC TTRAAADIVL LVDGSWSIGR LNFRQIRNFI SKLVQVFDIG
PRRVQFGLAQ YSGDPRTEWN LNQHRDKKSL LDAVAGLPYK GGNTLTGMAL SHILQNNFNT
DAGARPNARK IGVLITDGKS QDDVGIPSDT LRNMGVELYA VGIKNADEAE LKQIATDPDS
IHVYTVADFA LLTEIVDDLS TNLCNSVKGP GDLNPPTNLV TSEATHHTFR VTWDHSDSNF
DRYRVEYQPV AGGKPEEVLV NGRTKTTVLR NLQPDTEYVV NVYGLLEGEV SEPLMGTETT
LPIPGVRNLN VYDITPTTMN VRWEPARGAS GYMLLYTPMN ASQPAVQKEL KVGPDNTDVR
LDNLFPNTEY TLTIHALYED VPSDPLSVQA TTLPLGGPRN IWFSDVTHSH LRAHWDPAPG
KVLKYVIKYN EVGDETVKEV EIPGNENSVP LSGLASQTEY KVAVTAVHDH GPSAPLIGRE
NTLVVPAPSN LRFTDIGERR FRVHWDHGAR DVALYRLSWV PSGGGERKEM IINGEDNSQV
LDSLTPDTLY DVSLTAIYPD EMESEDLLGS QRTLPKTIPI TPSTPTPPRN LQVYNATSHS
LTVKWDPAIG RVRGYRVIYA PMTGDPIDET VTVGARQNSV VLQNLDPDTP YSIKVVSVSR
AGDGGQIIGN GRTKPLASVR NLRVYNPTTS SLNVIWDPAE GVVRRYKIHY VPTTGVGNEE
VVTVPGNTHN AVLRSLNPDT PYKVTVVPVY SEGEGARRSS TGRTLIRGAP GNVQVFNPSP
NSLNVRWTSA PGPVQQYRVV YAPLTGTQPP QYVVVPGNTN NVLLQRLQPD TKYSVNVVAM
YADGEGDPQA SEGQTLTRSG PRNMRVYDAT TNSLTVSWDH AEGPVQQYRI IYAPTVGDPI
EEFTFVPGRR NTVTLQPLTA DTPYRISVVA MYEDGDGGQL TGDGRTVGLL EPRNLRVSDE
WYTRFRVTWD PAPSPVLGYK LAYQPTGSDE KMELFVGDVT SYTPQNLKPG TPYDVNVYAI
YDSGSSGPLA GQGTTLYLNV TGINTYNIGW DTFCMQWNSH RAATSYRIKL QPVNAYAGGH
QEVTISGAET SHCFTGLSPD SLYEAVIYTQ LPNLEGPGVQ VQERTLLRPT EVPTEPPSPP
PPPTIPPARE VCLGAKADLV FLIDGSWSIG DENFHKILQF CFDTIGALDN ISPQGMQVSL
VQFSDDAKAE FKLDTYRDKG IVLAALQLIQ YKGGNTKTGR ALKFLRDHVF VSQNGMRRTV
PKVLVVVTDG RSQDEVKKPA LLLQEAGYSV FVVGVADIDV TELKNIGSKP SERHLFIVDD
FDAFAKIQDE LITFLCETAT STCPLIYLNG FTTPGYRMLE SFNLTEKDAA SVPGVSTGPG
SFNGYTAYSL HKDAHLLQPT IDIHPDGLPP TYTIMILFRL LPTTTSEPFA IWQITDQDYK
PEVGVLLDGN SKTLSYFNKD ERGESQTITF DNEDMKKLFY GNFHKVHIMV NQNTVKLIVD
CEEVEEKTAN PPGNISTDGY EILGKLAKTR GPKGRSAPFE IQSFDIICSL GWVMRDKCCD
IPSMRDEAKC PALPHACTCA QDSIGPPGPP GPSGRPGMKG DSGEPGLPGR QGPPGRVGPP
GTAGTEGPRG FPGKQGPIGP SGPRGPPGPI GVSGPPGQPG SQGPKGEMGS AGQTGPKGEK
GDRGDFAPQN MMRAISRQVC EQLMNNHMRR VNSLINQIPN GYYSNRAVPG PPGPPGPPGS
AGQVGETGPP GPPGFPGSPG DQGRAGERGS PGEKGEQGSS GIGKPGQRGL PGPPGPPGQS
RTGPTGPPGP RGPIGPPGRP GRYGSRGPAG PPGYCDSSMC AGIPYNGYPG PYEPQPYRPE
THVVPIEREE EEDQIETEIQ SAGFSRAYS
//