ID G3WF63_SARHA Unreviewed; 701 AA.
AC G3WF63;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Collagen type VIII alpha 1 chain {ECO:0000313|Ensembl:ENSSHAP00000014068.2};
GN Name=COL8A1 {ECO:0000313|Ensembl:ENSSHAP00000014068.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000014068.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000014068.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000014068.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_012399939.1; XM_012544485.1.
DR AlphaFoldDB; G3WF63; -.
DR STRING; 9305.ENSSHAP00000014068; -.
DR Ensembl; ENSSHAT00000014185.2; ENSSHAP00000014068.2; ENSSHAG00000012024.2.
DR eggNOG; ENOG502QRFR; Eukaryota.
DR GeneTree; ENSGT00940000158272; -.
DR HOGENOM; CLU_001074_21_0_1; -.
DR InParanoid; G3WF63; -.
DR TreeFam; TF334029; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0048593; P:camera-type eye morphogenesis; IEA:Ensembl.
DR GO; GO:0035987; P:endodermal cell differentiation; IEA:Ensembl.
DR GO; GO:0001935; P:endothelial cell proliferation; IEA:Ensembl.
DR GO; GO:0010811; P:positive regulation of cell-substrate adhesion; IEA:Ensembl.
DR Gene3D; 2.60.120.40; -; 1.
DR InterPro; IPR001073; C1q_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR PANTHER; PTHR15427:SF50; C1Q DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR15427; EMILIN ELASTIN MICROFIBRIL INTERFACE-LOCATED PROTEIN ELASTIN MICROFIBRIL INTERFACER; 1.
DR Pfam; PF00386; C1q; 1.
DR Pfam; PF01391; Collagen; 1.
DR PRINTS; PR00007; COMPLEMNTC1Q.
DR SMART; SM00110; C1Q; 1.
DR SUPFAM; SSF49842; TNF-like; 1.
DR PROSITE; PS50871; C1Q; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..701
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5029736598"
FT DOMAIN 568..701
FT /note="C1q"
FT /evidence="ECO:0000259|PROSITE:PS50871"
FT REGION 73..400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 431..547
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..96
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 149..163
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 212..226
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 342..364
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 462..488
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..540
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 701 AA; 68585 MW; F0DC5D254F5673B9 CRC64;
MAVPLTPLQL LEVVILIHLS FIRSIHGGAY YGIKQLPPQI PAQIPQYQAL GQQVPHMALG
KDGLPMEIPL ASLRGEQGPR GEPGPRGPPG PPGLPGHGIP GTKGKPGPQG YPGIGKPGMP
GMPGKPGGMG MPGAKGEAGP KGEIGPMGLP GPQGPPGPHG LPGIGKPGGP GFPGQQGPKG
EPGPKGQPGP PGLQGPKGDK GFGMPGLPGL KGPPGMHGPP GPVGLPGVGK PGVTGFPGPQ
GPLGKPGLQG EPGPRGPIGV PGIQGPPGVP GVGKPGQDGI PGRPGFPGGK GEQGLPGVPG
PPGLPGVGKP GFPGPKGDRG MGGLPGALGP QGEKGPIGAP GMGGPPGEPG LPGIPGPMGP
PGAIGFPGPK GEGGAVGPQG PLGPKGEPGL QGFPGKPGFL GEVGPPGMRG LPGPIGPKGE
AGHKGLPGIP GAPGLIGPKG EPGIPGDQGL QGPSGIPGIG GPSGPIGPPG LPGPKGEPGL
PGPPGFPGVG KPGVAGLHGP PGKPGALGPQ GQPGLPGPPG PPGPPGPPAV MPPTPPAHGE
YLPEMGPGID GVKTPHAYGG KKGKNGGVAY EMPAFTAELT TPFPPVGAPV KFDKLLYNGR
QNYNPQTGIF TCEVPGVYYF AYHVHCKGGN VWVALFKNND PMMYTYDEYK KGFLDQASGS
AVLQLRPGDR VFLQMPSEQA AGLYAGQYVH SSFSGYLLYP M
//