ID G3H7G9_CRIGR Unreviewed; 859 AA.
AC G3H7G9;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 45.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:EGW07939.1};
GN ORFNames=I79_006300 {ECO:0000313|EMBL:EGW07939.1};
OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Cricetinae; Cricetulus.
OX NCBI_TaxID=10029 {ECO:0000313|EMBL:EGW07939.1, ECO:0000313|Proteomes:UP000001075};
RN [1] {ECO:0000313|Proteomes:UP000001075}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075};
RX PubMed=21804562; DOI=10.1038/nbt.1932;
RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., Xie M.,
RA Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., Koh W.,
RA Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., Quake S.R.,
RA Famili I., Palsson B.O., Wang J.;
RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line.";
RL Nat. Biotechnol. 29:735-741(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH000194; EGW07939.1; -; Genomic_DNA.
DR AlphaFoldDB; G3H7G9; -.
DR STRING; 10029.G3H7G9; -.
DR eggNOG; KOG3546; Eukaryota.
DR InParanoid; G3H7G9; -.
DR Proteomes; UP000001075; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF13385; Laminin_G_3; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000313|EMBL:EGW07939.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000001075};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 58..246
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT DOMAIN 107..245
FT /note="Laminin G"
FT /evidence="ECO:0000259|SMART:SM00282"
FT REGION 287..348
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..468
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 594..620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 791..842
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..347
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 401..423
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 451..468
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..810
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 811..829
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 859 AA; 91896 MW; 25D4833301A13376 CRC64;
MCLCTLHITF HYWRLSFRKP SVRDSLGIRL SLWLLAPCSR EVESWSLLLP AGSAAQDHLD
LTELIGVPLP SSVSFVTGYG GFPAYSFGPG ANVGRPARTL IPPTFFRDFA ISVAVKPSSS
QGGVLFAVTD AFQKVIYLGL RLSRVEDGYQ RVILYYTEPG SHVSHEAAAF SVPVMTNRWN
RFAVTVQGEE VALLMDCEEH SHVLFQRSAR PLMFEPSAGI FVGNAGATGL ERFTGSIQQL
TIYSDPRTPE ELCEAQESSA SGEASGLQEM DEVAEIMEAV TYTQAPPKEV HVDPISMPPT
LSSPAEDTEL SGEPVPEGTP ETNLSIIGQS SPEQGKSCHT HESGGSGEIL NDTLEVLAVD
GDPSTDGGSG DGALLNVTDG QVLSATATEE AKVPVTTTLE AEIGSMPTGS PTLAVSTQNT
REEATLDPDS EENLATAASG DGEVPTSTAG DAEAGTMSTT EPTLSMLTQK PREEATLGPN
GEEWLTPAVS KVPLGAFEEE EASGTAIESL DAFTPTMVLE QASGTLTDIQ DALTPPVVLE
QASGTLTDIQ DALTPPVVLE QGSRSPTDTQ ATLAPTVAPE QVFTAAPTDG EGLVASTEEA
EEEGSDSTPP SGPPLPTPTV IPERQVTLVG VEAEGSGHVW GLDVGSGSGD IVDNEDLLRV
TALSDMGDML QKAHLVIEGT FIYLKDSAEF FIRVRDGWKK LQLGELIPIP ADSPPPPALS
SNLHLVALNT PVAGDIRADF QCFQQARAAG LLSTFRAFLS SHLQDLSTVV RKAERFSLPI
VNLKNLTEVR RQKRGKGGGG DRRRNNEDVV QEEMVEEEEE EQEEREEKEE DEATTYKYSD
GPETECFLRS RNLAWLSVE
//