GenomeNet

Database: UniProt
Entry: A0A401RJN0_CHIPU
LinkDB: A0A401RJN0_CHIPU
Original site: A0A401RJN0_CHIPU 
ID   A0A401RJN0_CHIPU        Unreviewed;      2969 AA.
AC   A0A401RJN0;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 17.
DE   RecName: Full=Collagen alpha-1(XII) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=chiPu_0017897 {ECO:0000313|EMBL:GCC18340.1};
OS   Chiloscyllium punctatum (Brownbanded bambooshark) (Hemiscyllium punctatum).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC   Elasmobranchii; Galeomorphii; Galeoidea; Orectolobiformes; Hemiscylliidae;
OC   Chiloscyllium.
OX   NCBI_TaxID=137246 {ECO:0000313|EMBL:GCC18340.1, ECO:0000313|Proteomes:UP000287033};
RN   [1] {ECO:0000313|EMBL:GCC18340.1, ECO:0000313|Proteomes:UP000287033}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA   Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA   Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA   Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA   Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT   "Shark genomes provide insights into elasmobranch evolution and the origin
RT   of vertebrates.";
RL   Nat. Ecol. Evol. 2:1761-1771(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GCC18340.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BEZZ01001406; GCC18340.1; -; Genomic_DNA.
DR   STRING; 137246.A0A401RJN0; -.
DR   OMA; WKRPPDE; -.
DR   Proteomes; UP000287033; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 16.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 4.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 17.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00041; fn3; 17.
DR   Pfam; PF00092; VWA; 4.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 17.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 4.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 11.
DR   SUPFAM; SSF53300; vWA-like; 4.
DR   PROSITE; PS50853; FN3; 17.
DR   PROSITE; PS50234; VWFA; 4.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000287033};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..2969
FT                   /note="Collagen alpha-1(XII) chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019433026"
FT   DOMAIN          27..117
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          139..311
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          335..427
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          439..611
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          633..722
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          724..814
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          815..906
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          907..994
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          996..1086
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1107..1279
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1295..1384
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1385..1476
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1477..1567
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1568..1664
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1667..1756
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1758..1848
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1849..1940
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1941..2030
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2031..2119
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2120..2210
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2238..2411
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          982..1014
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2666..2767
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2803..2921
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2950..2969
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        985..1010
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2719..2734
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2869..2897
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2969 AA;  323541 MW;  C1E91A583D72A356 CRC64;
     MKSRLCLAVA ALLAALSAPS VEAQVEPPSQ LKFKILTENS VQMMWRRSPS RIQGYRLTLA
     PTTVGSAKEM ILPQGASKTT LTDLIPDVEY VVTLVAFDRS SESVPVYGQL TIQSGRTPTK
     RPKKLEELSK KCSASAVADL VFLVDGSWSV GRANFKKIRE FIYSLVSAFE IGEDKTRVGI
     VQYSSDTRTE FDLNRYNRKQ ELLSAITNLP YKGGNTMTGE AINYLVQNTF SETAGARKSY
     PKIAVIITDG KSQDPVTESA EALRNIGVEV FTLGIKGADL DELKLIGSSP LNKHVFKVAD
     FDNIQDVQNE IINLVCSGIE EQLSDIVSGE EVVEPPSNLQ VLEIASNFMK LVWDSSPGQV
     TGYRVHLIPM VAGIPEQSIN VDARTRTAVV KDLTAETEYQ VNLYAMRGLA SSEPVSLMEK
     TQPVKESVEC TLDETTQADV VLLVDGSYSI GLSNFAKVKD FLETLVKTFQ VGPNRIQIGL
     VQYSREPHTE FTLMRHQTLE DVLSAIRTFP YRGGSTNTGR AMTYVRQKVF VPEKGARFNV
     PRVMVLITDG KSSDAFKEPA LKLRDAGVEI FAVGVKDAVF SELVAIASPP ENTHVYQVDD
     FDSFQRVSTK LTQTLCLRIV EEVKAIRARA FTAATDLRTS EVTSRSFRVS WTGAGPDVLS
     YLLKYKVAVG GDFTSIRVPA DQTTRVLTDL LPETTYLVNV IAEYTEGSSL PLEGEETTLE
     EVGSPSNLEV LDETTDSFKV RWTAAPGNVQ RYRVEYRPVL GGESKEVTVG GLETWTTLQN
     LLPDTKYSVT IIPEYQTLVG QPLVGEGTTK EARGSARNLV TENVTPTTID VSWTSAPGNV
     YNYRVTWKSL FDDDSGEKWV PGYSTTTTLE NLRPETKYQI RIYASYGSGE GDPLEGTETT
     DATPEAKTLT ISDEMVNSFR VRWLPAPGNV LNYRLSYRPA TGGRAIGTKI PSHVTTTVLR
     RLNPQTTYNI SVIPMYRKAE GKLRSGQGTT ASPYKPPQNL QTSEPARTSF RVTWDPSPGE
     VTGYKVIYHP RGREEQQGEM VLGPYDTTVV LEELRAGTTY KVAVSGMFEG GESLPLFGEE
     QTTLSDEALE KPLGPPGLQC TTRAAADIVL LVDGSWSIGR LNFRQIRNFI SKLVQVFDIG
     PRRVQFGLAQ YSGDPRTEWN LNQHRDKKSL LDAVAGLPYK GGNTLTGMAL SHILQNNFNT
     DAGARPNARK IGVLITDGKS QDDVGIPSDT LRNMGVELYA VGIKNADEAE LKQIATDPDS
     IHVYTVADFA LLTEIVDDLS TNLCNSVKGP GDLNPPTNLV TSEATHHTFR VTWDHSDSNF
     DRYRVEYQPV AGGKPEEVLV NGRTKTTVLR NLQPDTEYVV NVYGLLEGEV SEPLMGTETT
     LPIPGVRNLN VYDITPTTMN VRWEPARGAS GYMLLYTPMN ASQPAVQKEL KVGPDNTDVR
     LDNLFPNTEY TLTIHALYED VPSDPLSVQA TTLPLGGPRN IWFSDVTHSH LRAHWDPAPG
     KVLKYVIKYN EVGDETVKEV EIPGNENSVP LSGLASQTEY KVAVTAVHDH GPSAPLIGRE
     NTLVVPAPSN LRFTDIGERR FRVHWDHGAR DVALYRLSWV PSGGGERKEM IINGEDNSQV
     LDSLTPDTLY DVSLTAIYPD EMESEDLLGS QRTLPKTIPI TPSTPTPPRN LQVYNATSHS
     LTVKWDPAIG RVRGYRVIYA PMTGDPIDET VTVGARQNSV VLQNLDPDTP YSIKVVSVSR
     AGDGGQIIGN GRTKPLASVR NLRVYNPTTS SLNVIWDPAE GVVRRYKIHY VPTTGVGNEE
     VVTVPGNTHN AVLRSLNPDT PYKVTVVPVY SEGEGARRSS TGRTLIRGAP GNVQVFNPSP
     NSLNVRWTSA PGPVQQYRVV YAPLTGTQPP QYVVVPGNTN NVLLQRLQPD TKYSVNVVAM
     YADGEGDPQA SEGQTLTRSG PRNMRVYDAT TNSLTVSWDH AEGPVQQYRI IYAPTVGDPI
     EEFTFVPGRR NTVTLQPLTA DTPYRISVVA MYEDGDGGQL TGDGRTVGLL EPRNLRVSDE
     WYTRFRVTWD PAPSPVLGYK LAYQPTGSDE KMELFVGDVT SYTPQNLKPG TPYDVNVYAI
     YDSGSSGPLA GQGTTLYLNV TGINTYNIGW DTFCMQWNSH RAATSYRIKL QPVNAYAGGH
     QEVTISGAET SHCFTGLSPD SLYEAVIYTQ LPNLEGPGVQ VQERTLLRPT EVPTEPPSPP
     PPPTIPPARE VCLGAKADLV FLIDGSWSIG DENFHKILQF CFDTIGALDN ISPQGMQVSL
     VQFSDDAKAE FKLDTYRDKG IVLAALQLIQ YKGGNTKTGR ALKFLRDHVF VSQNGMRRTV
     PKVLVVVTDG RSQDEVKKPA LLLQEAGYSV FVVGVADIDV TELKNIGSKP SERHLFIVDD
     FDAFAKIQDE LITFLCETAT STCPLIYLNG FTTPGYRMLE SFNLTEKDAA SVPGVSTGPG
     SFNGYTAYSL HKDAHLLQPT IDIHPDGLPP TYTIMILFRL LPTTTSEPFA IWQITDQDYK
     PEVGVLLDGN SKTLSYFNKD ERGESQTITF DNEDMKKLFY GNFHKVHIMV NQNTVKLIVD
     CEEVEEKTAN PPGNISTDGY EILGKLAKTR GPKGRSAPFE IQSFDIICSL GWVMRDKCCD
     IPSMRDEAKC PALPHACTCA QDSIGPPGPP GPSGRPGMKG DSGEPGLPGR QGPPGRVGPP
     GTAGTEGPRG FPGKQGPIGP SGPRGPPGPI GVSGPPGQPG SQGPKGEMGS AGQTGPKGEK
     GDRGDFAPQN MMRAISRQVC EQLMNNHMRR VNSLINQIPN GYYSNRAVPG PPGPPGPPGS
     AGQVGETGPP GPPGFPGSPG DQGRAGERGS PGEKGEQGSS GIGKPGQRGL PGPPGPPGQS
     RTGPTGPPGP RGPIGPPGRP GRYGSRGPAG PPGYCDSSMC AGIPYNGYPG PYEPQPYRPE
     THVVPIEREE EEDQIETEIQ SAGFSRAYS
//
DBGET integrated database retrieval system