ID A0A401PI59_SCYTO Unreviewed; 2989 AA.
AC A0A401PI59;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Collagen alpha-1(XII) chain {ECO:0008006|Google:ProtNLM};
GN ORFNames=scyTo_0002206 {ECO:0000313|EMBL:GCB72820.1};
OS Scyliorhinus torazame (Cloudy catshark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Elasmobranchii; Galeomorphii; Galeoidea; Carcharhiniformes; Scyliorhinidae;
OC Scyliorhinus.
OX NCBI_TaxID=75743 {ECO:0000313|EMBL:GCB72820.1, ECO:0000313|Proteomes:UP000288216};
RN [1] {ECO:0000313|EMBL:GCB72820.1, ECO:0000313|Proteomes:UP000288216}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=30297745; DOI=.1038/s41559-018-0673-5;
RA Hara Y, Yamaguchi K, Onimaru K, Kadota M, Koyanagi M, Keeley SD, Tatsumi K,
RA Tanaka K, Motone F, Kageyama Y, Nozu R, Adachi N, Nishimura O, Nakagawa R,
RA Tanegashima C, Kiyatake I, Matsumoto R, Murakumo K, Nishida K, Terakita A,
RA Kuratani S, Sato K, Hyodo S Kuraku.S.;
RT "Shark genomes provide insights into elasmobranch evolution and the origin
RT of vertebrates.";
RL Nat. Ecol. Evol. 2:1761-1771(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GCB72820.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFAA01000535; GCB72820.1; -; Genomic_DNA.
DR STRING; 75743.A0A401PI59; -.
DR OMA; YTQTPNM; -.
DR Proteomes; UP000288216; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 16.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 3.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 17.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 4.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF70; PH DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00041; fn3; 17.
DR Pfam; PF00092; VWA; 4.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 17.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 4.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 12.
DR SUPFAM; SSF53300; vWA-like; 4.
DR PROSITE; PS50853; FN3; 16.
DR PROSITE; PS50234; VWFA; 4.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000288216};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..2989
FT /note="Collagen alpha-1(XII) chain"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019170477"
FT DOMAIN 27..117
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 139..311
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 335..426
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 439..611
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 633..722
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 724..814
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 815..906
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 907..994
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 996..1086
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1107..1279
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1295..1384
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1385..1476
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1477..1567
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1568..1662
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1744..1834
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1835..1925
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1926..2016
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2017..2105
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2106..2196
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 2224..2397
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 982..1014
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2194..2213
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2654..2786
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2823..2929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 985..1010
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2198..2213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2749..2766
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2989 AA; 327042 MW; AA37BA2BD425747D CRC64;
MKSRLCLAVV AVFAALFAAS VEAQVEPPSN LKFKILTESS VQMTWRRSPS RIRGYRLTLA
PQAAGPAKEM ILPQRASKTT LTDLIPDTEY VVTLIAFDHS SESVPVYGQL TIQTGRTPTN
RPKKIEDLSQ RCSASAVADL VFLVDGSWSV GRANFKKIRE FIYSLSSAFE IGEDKTRVGI
VQYSSDTRTE FDLNRYSQKL ELLNAIVNLP YKGGNTMTGE AINYLVQNTF SEAAGARKSY
PKIAVIITDG KSQDPVTESA EALRNIGVEI FTLGIKGADL DELKLIGSSP LNKHVFKVAD
FDKIRDVQNE IINLVCSGIE EQLSDIVSGE EVVEPPSNLQ ILESASNFLK ITWDSSPGQI
TGYRIHLIPM VTGVQEQSIN TDSVTRTIVA KNLTPDTEYQ VNLYAMKGLA SSEPVIIMEK
TQTVHVSVEC TLDANTQADI VLLVDGSYSI GLSNFAKVKD FLETLVKTFQ VGPDRIQIGL
VQYSRDPYTE FTLMKHSTLD DVVRAVRTFP YRGGSTNTGR AMTYVREKIF VPEKGARFKV
PRVMVLITDG KSSDAFKAPA LRLREAGVEI FAVGVKDAVF SELVTIATPP DNTHVYQVDD
FDSFQRVSTK LTRTLCLRIE EEVKAIRKRD FALAKDLRTS DQTSRSFRVS WTGAGSDVIS
YLLRYKVAEG GDFISIRVPA DQTTKVLTDL LPETTYLVSV VAEYLEGSSF PLDGEDTTLE
EIGLASNLEV LDETTDSFKI RWTAAPGNVL RYRLEYRPVV GGERKEVTVG GLETETILNN
LLPDTKYSVK IVAEYQTATG EPLVGQGTTK EVRGSARNLV TENVTPTTID ASWTSAPGNV
YNYRVTWKSL YDDDSGEKWV PGYSTETTLE NLRPETKYQI MVYASYGSGE ADPLEGEETT
DATTGAKRVT ISDETVSTFR VRWKPAPGNV VNYRLSYRPA VGGRVIATKV PPHLTSTVLR
RLNPQTVYNV SVIPMYREGE GKLRSGQGTT ASPYKPPQNL QTSEPARSSF RVTWEPSPGE
VRGYKVIYHP RGHEEQLGEM VLGPYDTTVV LEELRADTTY RVAVSGMFEG GESLPLLGEE
QTTLSDEPMV IPTEQSGVQC TTKAAADIVL LVDGSWSIGR LNFRQIRSFI AKLVQVFDIS
PRRVQFGLAQ YSGDPRTEWN LNTYRDKQSL LTAVAGLPYK GGNTLTGMAL SYILEKNFKA
DAGARPNARK IGVLITDGKS QDGVDLPSET LREMGVELYA VGIKNADEAE LKTIASDPDS
VHMYNVADFA LLVEIVDDLT TNLCNSVKGP NDLYPPTNLV TSEPTHQSFR VTWDHSDSNF
DRYRVEYQPV SGGRTEEVLV SGRTKTTVLT NLQPETEYLV NVYGLLEGEI SEPLTGTETT
MPIPGVRNLN VYDVTPTTMN VKWEPATGAS GYMLLYAPVN ASIPTVEKEL KVGPDNTDLK
LENLFPNTEY TVTIHALYGD VPSDPLSVHE TTLPVDEPRN IWFSDITHST INAHWDPAPG
KVRKYLIKYK EPEDENIKEV EVPGSETSVP LTGLTSQTEY EISVTAVYDY GPSNPLTGRE
NTLVVPAPSS LRFSDIRENR FRIHWDHGAR DVALYRLSWV PSGGRDKKEM IINGDEDSQV
LDNLNPDTLY DVSLTAIYPD EMESEDLISS QRTLSIRTTP HTPVTPTAPR NLQVYNATSH
SLTVKWDPAA GRVRGYRVIY APMTGDPIDE MVIPIVFNHS HVFHIVLHMW KASVRDSKAE
PLASVRNLRV YNPTMSSLNV IWDPAEGVVR RYKIHYVPTT GTGNEEVVTV PGNTHSTVLK
SLNADTPYEV NVVPVFNEGE GARRSSTGRT LIRGAPGSVQ VFNPSTNSLN VRWTSAPGPV
QQYRVIYTPR TGTRPAHYIT VPANTNNVLL DRLQSDTEYS VNVVPMYANG EGDPGTDVGK
TLPRGGPRNM RVYDATTNSL SVSWDHAEGP VQQYRIVYAP TVGDPIEEFT FVPGRRNNVL
LQPLTADTPY RISVVAMYED GDGGQLTGDG KTVGLLEPRN LRVSDEWYTR FRVTWDPAPS
PVLGYKLVYK PTGTNEKMEL FVGDVTSYTP QNLKPGTPYD VDVYAVYDSG SSGPLAGQGT
TLYLNVTGLR TYKVDWNTFC IEWNPHRAAT SYRIKLQPVD AYSNGHQEVT ISGAESTHCF
TGLSPDSLYD ATVYTQLPNL EGPGMQLQER TVVRPTEAPT EPPSPPPPPT IPPAREVCMG
AKADLVFLID GSWSIGDENF HKILQFCFDT IGALDNIGPI GMQISLVQFS DDAKAEFKLD
TYSDKGMTLA ALQIIHYKGG NTKTGRALEY LYDHVFVYPN GMRKTVPKVL VVVTDGRSQD
EVKKPALALQ EAGYSVFVVG VADIDVTELK NIGSKPSTRH LFLVDDFDAF EKIQDELITF
LCETATSTCP LIYLNGFTTP GFRMLESFNL TEKDAPSVAG VSTVPGSFNS YTAYSIHKDA
HLLQPTIEIH PNGLPITYTI MMLFRLLPTT TSEPFAIWQI TDQDYKPEVG VLLDGNSKTL
SYFNKDERGE SQTITFDTED MKKLFYGNFH KVHIMVNQNT VKLIVDCQEI EEKMANPPGN
ITTDGYEILG KLAKSRGPKG KSAPFEIQSF DIICSLGWVL RDKCCDLPSK RDEAKCPALP
HACTCAQANI GPPGPPGPSG QGGSKGPRGE RGPHGQPGPP GTRGELGPPG PQGLPGPQGA
NGLSLPGEPG RSGVKGDAGE PGLVGRTGFP GIQGPPGPLG PRGPIGTTGV SGQPGQRGSQ
GQKGDLGSLG PTGPKGEKGD RGDFAPQNMM RSISRQVCEQ LMNNHMRRVN SLINQIPNGY
YSNRAVAGPP GSPGSPGTNG EVGETGPPGP PGFPGTPADQ GRPGDRGAPG EKGEKGSQGN
GKPGQRGLPG PPGPQGQSRT GPTGLPGPMG PLGSPGRPGR YGSRGPSGPP GYCDSSMCAG
IPYNGYPGRY EPQPYRPETH VVPIERREEE VIDQTETEIQ SPGFSRLHS
//