ID L9JEA6_TUPCH Unreviewed; 1773 AA.
AC L9JEA6;
DT 03-APR-2013, integrated into UniProtKB/TrEMBL.
DT 03-APR-2013, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Collagen alpha-1(XVI) chain {ECO:0000313|EMBL:ELW48639.1};
GN ORFNames=TREES_T100020963 {ECO:0000313|EMBL:ELW48639.1};
OS Tupaia chinensis (Chinese tree shrew).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Scandentia; Tupaiidae; Tupaia.
OX NCBI_TaxID=246437 {ECO:0000313|EMBL:ELW48639.1, ECO:0000313|Proteomes:UP000011518};
RN [1] {ECO:0000313|Proteomes:UP000011518}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Zhang G., Fan Y., Yao Y., Huang Z.;
RT "Genome of the Chinese tree shrew, a rising model animal genetically
RT related to primates.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000011518}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23385571; DOI=10.1038/ncomms2416;
RA Fan Y., Huang Z.Y., Cao C.C., Chen C.S., Chen Y.X., Fan D.D., He J.,
RA Hou H.L., Hu L., Hu X.T., Jiang X.T., Lai R., Lang Y.S., Liang B.,
RA Liao S.G., Mu D., Ma Y.Y., Niu Y.Y., Sun X.Q., Xia J.Q., Xiao J.,
RA Xiong Z.Q., Xu L., Yang L., Zhang Y., Zhao W., Zhao X.D., Zheng Y.T.,
RA Zhou J.M., Zhu Y.B., Zhang G.J., Wang J., Yao Y.G.;
RT "Genome of the Chinese tree shrew.";
RL Nat. Commun. 4:1426-1426(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB321034; ELW48639.1; -; Genomic_DNA.
DR STRING; 246437.L9JEA6; -.
DR InParanoid; L9JEA6; -.
DR Proteomes; UP000011518; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 6.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ELW48639.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000011518};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 77..241
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 41..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 301..561
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 580..614
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 675..930
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1104..1139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1207..1602
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1642..1719
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..58
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 301..315
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 445..470
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 849..863
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1211..1233
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1351..1369
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1421..1446
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1773 AA; 175557 MW; 172EAC1FFCEF05C2 CRC64;
MWIVEDRART SGTLALTQLT LEFHFSVPQL KSCARTCDIQ PAGRDSTSMS TRGTCPPSQQ
EGLKLEHSGD LSANVTGFNL IRRLSLMKTS AIKKIRNPKG PLILRLGATP VTQPTRAAEG
LDSGRLEPER RCLLLPWFMQ ISLEVSSPER SLEFRARGQD GDFVSCIFSV PQLFDLRWHK
LALSVAGRVA SVHVDCTSTS SQPLGPRRPL RPTGHVFLGL DAEQGKPVSF DLQQVHIYCD
PDLVLEEGCC EILPGGCPPE TSKARRDTQS NELIEINPQT EGKVYTRCFC LEEPQSKVDA
QLTGRIGQKT EQGTKAQREA AADEVTLGSS GPKGGKGERG PQGPSGPKGE KGARGSDCIR
VSPDAPLQCA EGPKGEKGQS GASGPPGLPG STGQKGQKGE KGDLGVKGVP GKPGRDGRPG
EICVIGPKGQ KGDPGFVGPE GLAGEPGPPG LPGPPGLGLP GTPGDPGGPP GPKGDKGSSG
VPGKEGPSGK PGKPGIPGLK GEKGDPCEVC PTLPEGFQNF MGLPGKPGPK GEPGEPAPAG
EGLGAAGHKG DRGDPGIQGL KGEKGEPCWA CSSAVGAQHL GPSSGATEGV GSPGSGLLGL
PGKPGPPGPA GLKGEKCRRW RFRVIDCVGG QFRGSGASRQ SGECDLVAAA GGGRWQEPPA
VGLRLLSHLA PYRAGTTRAS GSGRCQGGEG ESGDAPVLQP GEPCEPCSAL SKPQDGDSHV
VALPGPPGEK GEPGPPGFGL PGKQGEAGNP GDPGTPGAVG QPGLSGEPGG RGPTGPKGEK
GEGCTACPSL QGALTDTTGL PGKPGPRGER GPEGVGRPGK PGRPGLRGAQ GPPGLKGTQG
EPGPPGTGAQ GPQGLPGPRG PPGPAGEKGV QGPPGLKGAT GPVGPPGAGL SGPPGHDGQP
GEAGLPGSRG IPGEKGSRGE KELRTRTAST VPTAACRVES LPLVTDVAAR LREAQPQAQT
VGKRAATACL ESVSGGCPSS STSNEASFFS GVPQVFLDHQ APLECLGCRE YLLGASPGTP
LMGQAQAPGM PGNNGLPGQP GLTAELGSLP IEQQLLKSIC GDCAQGQLAS PVSREKGEKG
DQGIPGVPGL DSCARCFLER ERPTAEQAQR DVSEDPGCAG SPGLPGPPGL PGQRGGEHTV
FAPSGDVLQS STEHQLSAGR STDCPCPLGH DAMRQGSLSC LQPWAAAAAR LALAVASCDE
GRCGPVPHRG PPGVRGSPGP PGPTGPPGFP GAVGAPGLPG LQGERGLRGL TGDKGEPGPP
GQPGYPGAMG PPGLPGIKGE RGYAGPAGEK GEPGIKGERG YAGPAGEKGE PGPPGSEGLP
GAPGPAGPRG ERGPQGNSGE KGDQGFQGQP GFPGPPGPPG FPGKAGAPGP PGPQAEKGSE
GLRGPAGLPG SPGPPGPPGI QGPAGLDGLD GKDGKPGLRG DPGPAGPPGL MGPPGDPGPA
GPPGLMGPPG FKGKTGHPGL PGPKGDCGNP GPPGGSGRPG AEGLKGDRGS TGERGLIGLP
GQPGPPGHPG PPGEPGADGM AGKEGPPGKQ GLYGPPGPKG DPGPAGQKGQ AGEKGRSGMP
GGPGKSGSMG PAGPPGPAGE RGHPGSPGPT GSPGLPGLPG SMGDMVNYDE IKRFIRQEVI
KMFDDRMAYY TSRVQFPMEM VAAPGRPGPP GKEGAPGRPG APGSPGLPGQ IGREGRQGLP
GMRGLPGTKG EKGDIGVGIA GENGLPGPPG PQGPPGYGKM GATGPMGQQG IPGIPGPPGP
MGQPGKAGHC SPSDCFGALP VEQQYPPVKG PFG
//