GenomeNet

Database: UniProt
Entry: A0A401PI59_SCYTO
LinkDB: A0A401PI59_SCYTO
Original site: A0A401PI59_SCYTO 
ID   A0A401PI59_SCYTO        Unreviewed;      2989 AA.
AC   A0A401PI59;
DT   08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT   08-MAY-2019, sequence version 1.
DT   27-MAR-2024, entry version 16.
DE   RecName: Full=Collagen alpha-1(XII) chain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=scyTo_0002206 {ECO:0000313|EMBL:GCB72820.1};
OS   Scyliorhinus torazame (Cloudy catshark).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC   Elasmobranchii; Galeomorphii; Galeoidea; Carcharhiniformes; Scyliorhinidae;
OC   Scyliorhinus.
OX   NCBI_TaxID=75743 {ECO:0000313|EMBL:GCB72820.1, ECO:0000313|Proteomes:UP000288216};
RN   [1] {ECO:0000313|EMBL:GCB72820.1, ECO:0000313|Proteomes:UP000288216}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA   Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA   Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA   Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA   Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT   "Shark genomes provide insights into elasmobranch evolution and the origin
RT   of vertebrates.";
RL   Nat. Ecol. Evol. 2:1761-1771(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GCB72820.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BFAA01000535; GCB72820.1; -; Genomic_DNA.
DR   STRING; 75743.A0A401PI59; -.
DR   OMA; YTQTPNM; -.
DR   Proteomes; UP000288216; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 16.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 3.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 17.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00041; fn3; 17.
DR   Pfam; PF00092; VWA; 4.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 17.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 4.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 12.
DR   SUPFAM; SSF53300; vWA-like; 4.
DR   PROSITE; PS50853; FN3; 16.
DR   PROSITE; PS50234; VWFA; 4.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000288216};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..2989
FT                   /note="Collagen alpha-1(XII) chain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5019170477"
FT   DOMAIN          27..117
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          139..311
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          335..426
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          439..611
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          633..722
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          724..814
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          815..906
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          907..994
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          996..1086
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1107..1279
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1295..1384
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1385..1476
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1477..1567
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1568..1662
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1744..1834
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1835..1925
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1926..2016
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2017..2105
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2106..2196
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          2224..2397
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          982..1014
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2194..2213
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2654..2786
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2823..2929
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        985..1010
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2198..2213
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2749..2766
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2989 AA;  327042 MW;  AA37BA2BD425747D CRC64;
     MKSRLCLAVV AVFAALFAAS VEAQVEPPSN LKFKILTESS VQMTWRRSPS RIRGYRLTLA
     PQAAGPAKEM ILPQRASKTT LTDLIPDTEY VVTLIAFDHS SESVPVYGQL TIQTGRTPTN
     RPKKIEDLSQ RCSASAVADL VFLVDGSWSV GRANFKKIRE FIYSLSSAFE IGEDKTRVGI
     VQYSSDTRTE FDLNRYSQKL ELLNAIVNLP YKGGNTMTGE AINYLVQNTF SEAAGARKSY
     PKIAVIITDG KSQDPVTESA EALRNIGVEI FTLGIKGADL DELKLIGSSP LNKHVFKVAD
     FDKIRDVQNE IINLVCSGIE EQLSDIVSGE EVVEPPSNLQ ILESASNFLK ITWDSSPGQI
     TGYRIHLIPM VTGVQEQSIN TDSVTRTIVA KNLTPDTEYQ VNLYAMKGLA SSEPVIIMEK
     TQTVHVSVEC TLDANTQADI VLLVDGSYSI GLSNFAKVKD FLETLVKTFQ VGPDRIQIGL
     VQYSRDPYTE FTLMKHSTLD DVVRAVRTFP YRGGSTNTGR AMTYVREKIF VPEKGARFKV
     PRVMVLITDG KSSDAFKAPA LRLREAGVEI FAVGVKDAVF SELVTIATPP DNTHVYQVDD
     FDSFQRVSTK LTRTLCLRIE EEVKAIRKRD FALAKDLRTS DQTSRSFRVS WTGAGSDVIS
     YLLRYKVAEG GDFISIRVPA DQTTKVLTDL LPETTYLVSV VAEYLEGSSF PLDGEDTTLE
     EIGLASNLEV LDETTDSFKI RWTAAPGNVL RYRLEYRPVV GGERKEVTVG GLETETILNN
     LLPDTKYSVK IVAEYQTATG EPLVGQGTTK EVRGSARNLV TENVTPTTID ASWTSAPGNV
     YNYRVTWKSL YDDDSGEKWV PGYSTETTLE NLRPETKYQI MVYASYGSGE ADPLEGEETT
     DATTGAKRVT ISDETVSTFR VRWKPAPGNV VNYRLSYRPA VGGRVIATKV PPHLTSTVLR
     RLNPQTVYNV SVIPMYREGE GKLRSGQGTT ASPYKPPQNL QTSEPARSSF RVTWEPSPGE
     VRGYKVIYHP RGHEEQLGEM VLGPYDTTVV LEELRADTTY RVAVSGMFEG GESLPLLGEE
     QTTLSDEPMV IPTEQSGVQC TTKAAADIVL LVDGSWSIGR LNFRQIRSFI AKLVQVFDIS
     PRRVQFGLAQ YSGDPRTEWN LNTYRDKQSL LTAVAGLPYK GGNTLTGMAL SYILEKNFKA
     DAGARPNARK IGVLITDGKS QDGVDLPSET LREMGVELYA VGIKNADEAE LKTIASDPDS
     VHMYNVADFA LLVEIVDDLT TNLCNSVKGP NDLYPPTNLV TSEPTHQSFR VTWDHSDSNF
     DRYRVEYQPV SGGRTEEVLV SGRTKTTVLT NLQPETEYLV NVYGLLEGEI SEPLTGTETT
     MPIPGVRNLN VYDVTPTTMN VKWEPATGAS GYMLLYAPVN ASIPTVEKEL KVGPDNTDLK
     LENLFPNTEY TVTIHALYGD VPSDPLSVHE TTLPVDEPRN IWFSDITHST INAHWDPAPG
     KVRKYLIKYK EPEDENIKEV EVPGSETSVP LTGLTSQTEY EISVTAVYDY GPSNPLTGRE
     NTLVVPAPSS LRFSDIRENR FRIHWDHGAR DVALYRLSWV PSGGRDKKEM IINGDEDSQV
     LDNLNPDTLY DVSLTAIYPD EMESEDLISS QRTLSIRTTP HTPVTPTAPR NLQVYNATSH
     SLTVKWDPAA GRVRGYRVIY APMTGDPIDE MVIPIVFNHS HVFHIVLHMW KASVRDSKAE
     PLASVRNLRV YNPTMSSLNV IWDPAEGVVR RYKIHYVPTT GTGNEEVVTV PGNTHSTVLK
     SLNADTPYEV NVVPVFNEGE GARRSSTGRT LIRGAPGSVQ VFNPSTNSLN VRWTSAPGPV
     QQYRVIYTPR TGTRPAHYIT VPANTNNVLL DRLQSDTEYS VNVVPMYANG EGDPGTDVGK
     TLPRGGPRNM RVYDATTNSL SVSWDHAEGP VQQYRIVYAP TVGDPIEEFT FVPGRRNNVL
     LQPLTADTPY RISVVAMYED GDGGQLTGDG KTVGLLEPRN LRVSDEWYTR FRVTWDPAPS
     PVLGYKLVYK PTGTNEKMEL FVGDVTSYTP QNLKPGTPYD VDVYAVYDSG SSGPLAGQGT
     TLYLNVTGLR TYKVDWNTFC IEWNPHRAAT SYRIKLQPVD AYSNGHQEVT ISGAESTHCF
     TGLSPDSLYD ATVYTQLPNL EGPGMQLQER TVVRPTEAPT EPPSPPPPPT IPPAREVCMG
     AKADLVFLID GSWSIGDENF HKILQFCFDT IGALDNIGPI GMQISLVQFS DDAKAEFKLD
     TYSDKGMTLA ALQIIHYKGG NTKTGRALEY LYDHVFVYPN GMRKTVPKVL VVVTDGRSQD
     EVKKPALALQ EAGYSVFVVG VADIDVTELK NIGSKPSTRH LFLVDDFDAF EKIQDELITF
     LCETATSTCP LIYLNGFTTP GFRMLESFNL TEKDAPSVAG VSTVPGSFNS YTAYSIHKDA
     HLLQPTIEIH PNGLPITYTI MMLFRLLPTT TSEPFAIWQI TDQDYKPEVG VLLDGNSKTL
     SYFNKDERGE SQTITFDTED MKKLFYGNFH KVHIMVNQNT VKLIVDCQEI EEKMANPPGN
     ITTDGYEILG KLAKSRGPKG KSAPFEIQSF DIICSLGWVL RDKCCDLPSK RDEAKCPALP
     HACTCAQANI GPPGPPGPSG QGGSKGPRGE RGPHGQPGPP GTRGELGPPG PQGLPGPQGA
     NGLSLPGEPG RSGVKGDAGE PGLVGRTGFP GIQGPPGPLG PRGPIGTTGV SGQPGQRGSQ
     GQKGDLGSLG PTGPKGEKGD RGDFAPQNMM RSISRQVCEQ LMNNHMRRVN SLINQIPNGY
     YSNRAVAGPP GSPGSPGTNG EVGETGPPGP PGFPGTPADQ GRPGDRGAPG EKGEKGSQGN
     GKPGQRGLPG PPGPQGQSRT GPTGLPGPMG PLGSPGRPGR YGSRGPSGPP GYCDSSMCAG
     IPYNGYPGRY EPQPYRPETH VVPIERREEE VIDQTETEIQ SPGFSRLHS
//
DBGET integrated database retrieval system