ID A0A1S3FHG3_DIPOR Unreviewed; 1625 AA.
AC A0A1S3FHG3;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Collagen alpha-1(XVI) chain {ECO:0000313|RefSeq:XP_012875941.1};
GN Name=Col16a1 {ECO:0000313|RefSeq:XP_012875941.1};
OS Dipodomys ordii (Ord's kangaroo rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Castorimorpha; Heteromyidae;
OC Dipodomyinae; Dipodomys.
OX NCBI_TaxID=10020 {ECO:0000313|Proteomes:UP000081671, ECO:0000313|RefSeq:XP_012875941.1};
RN [1] {ECO:0000313|RefSeq:XP_012875941.1}
RP IDENTIFICATION.
RC TISSUE=Kidney {ECO:0000313|RefSeq:XP_012875941.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_012875941.1; XM_013020487.1.
DR STRING; 10020.ENSDORP00000013538; -.
DR GeneID; 105988760; -.
DR KEGG; dord:105988760; -.
DR CTD; 1307; -.
DR InParanoid; A0A1S3FHG3; -.
DR OrthoDB; 4252592at2759; -.
DR Proteomes; UP000081671; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 7.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119,
KW ECO:0000313|RefSeq:XP_012875941.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000081671};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1625
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010248102"
FT DOMAIN 50..231
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 292..359
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 371..557
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 585..960
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1022..1451
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1490..1604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..473
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 926..943
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1039..1053
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1063..1080
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1181..1198
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1304..1323
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1625 AA; 160097 MW; 8801627465199C68 CRC64;
MWVSWAPGLW LLSLWATFIH RINAGEQCPP SQPEGLKLEH SSDLSANVTG FNLIRRLNLM
KTSAIKKIRN PKGPLILRLG TASVTQPTRR VFPRGLPEEF ALVLTLLLKK HTYQNTWYLF
QVTDAAGYPQ ISLEVNSQER SLELRAQGQD GDFVSCIFPV PQLFDLRWHK LMLSVAGRVA
SVHVDCTSAS SQPLGPRRPT RPVGHVFLGL DAEQGKPVSF DLQQAHIYCD PELVLEEGCC
EISPGGCPLE SSKSRRDTQS NELIEINPQT EGKVYTRCFC LEEPQSSKVD AQLTGRANQK
AERRAQVHRE TAADECPPCA YGAQRSNVTL GPSEPQGGKG ERGLPGPSGP KGEKGARGND
CVRISQDAPL QCAEGPKGEK GEAGGVGPSG LPGSIGQKGQ KGEKGDGGLK GLPGKPGRDG
RPGEICVIGP KGQKGDPGFV GPEGLAGEPG PPGLPGPPGI GLPGTPGEPG GPPGPKGDKG
SSGIPGKEGP GGKPGKPGVP GTKGEKGDPC EVCPTLPEGF QNFVGLPGKP GPKGEPGDPA
SAREGLGTVG LKGDRGDPGI QGLKGEKGEP CLSCSTAVGA QHLGPSTGAN GDVGSPGLGL
PGLPGKAGVP GPRGLKGEKG NFGEPGPAGS PGPPGQVGPA GIKGAKGEPC EPCSALSRTQ
DGDSHVVHLP GPVGEKGEPG PPGFGLPGKQ GKAGERGLKG QKGDAGSPGD PGTPGTTGQP
GLSGEPGIRG PMGPKGEKGD GCTACPNLQG ALTDMAGLPG KPGPKGEQGP EGVGRPGKPG
QPGLPGVQGP PGLKGTQGEP GPPGRGVQGP QGEPGVQGLP GVQGPPGPQG PPGRTGEKGV
QGPPGMKGAT GPMGPTGASV SGPPGQDGPQ GQTGLPGARG TPGEKGSRGE KGEPGECSCP
SRGDPIFSGM PGAPGLWMGS SSQPGPQGPP GVPGPPGPPG MPGLQGVPGN NGLPGQPGLT
AELGSLPIEQ HLIKSICGDC AQGQMAQPAS VLVKGEKGDQ GIPGVPGLDN CARCFLERER
PRAEEARGDN GEGDPGCPGS PGLPGPPGVP GQRGEEGPPG MRGSPGLPGP IGPPGFPGAV
GSPGLPGLQG ERGLTGLTGD KGEPGPPGQP GYPGAMGPPG LPGIKGERGY TGPAGEKGEL
GPPGSEGLPG PAGPAGPRGE RGPQGAAGEK GDQGFQGQPG FPGLPGPPGF PGKVGAPGPP
GPQAEKGSEG IRGPSGLPGS PGPPGPPGIQ GPTGLDGLDG KDGKPGLRGD PGPAGPPGLM
GPPGFKGKTG HPGLPGPKGD CGKPGPPGSS GRPGAEGEPG AMGPQGRPGP PGHVGPAGPP
GQPGPAGISA LGLKGDRGAP GERGLAGLPG QPGTPGHPGP PGEPGTDGAA GKEGPPGKQG
LYGPPGPKGD PGPAGQKGQA GEKGRSGMPG GPGKSGSMGP VGPPGPAGER GHPGSPGPAG
SPGLPGMPGS MGDMVNYDEI KRYIRQEIIK MFDERMAYYT SRMQFPMEVA AAPGRPGPPG
KDGAPGRPGA PGSPGLPGQI GREGRQGLPG MRGLPGTKGE KGDTGVGIAG ENGLPGPPGP
QGPPGYGKMG ATGPMGQQGI PGIPGPPGPT GQPGKAGHCN PSDCFGAMPM EQQYPPMKNM
KGPFG
//