ID G3HGY5_CRIGR Unreviewed; 1070 AA.
AC G3HGY5;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Collagen alpha-1(XX) chain {ECO:0000313|EMBL:EGW11071.1};
DE Flags: Fragment;
GN ORFNames=I79_009879 {ECO:0000313|EMBL:EGW11071.1};
OS Cricetulus griseus (Chinese hamster) (Cricetulus barabensis griseus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Cricetinae; Cricetulus.
OX NCBI_TaxID=10029 {ECO:0000313|EMBL:EGW11071.1, ECO:0000313|Proteomes:UP000001075};
RN [1] {ECO:0000313|Proteomes:UP000001075}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CHO K1 cell line {ECO:0000313|Proteomes:UP000001075};
RX PubMed=21804562; DOI=10.1038/nbt.1932;
RA Xu X., Nagarajan H., Lewis N.E., Pan S., Cai Z., Liu X., Chen W., Xie M.,
RA Wang W., Hammond S., Andersen M.R., Neff N., Passarelli B., Koh W.,
RA Fan H.C., Wang J., Gui Y., Lee K.H., Betenbaugh M.J., Quake S.R.,
RA Famili I., Palsson B.O., Wang J.;
RT "The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line.";
RL Nat. Biotechnol. 29:735-741(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH000364; EGW11071.1; -; Genomic_DNA.
DR AlphaFoldDB; G3HGY5; -.
DR STRING; 10029.G3HGY5; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; G3HGY5; -.
DR Proteomes; UP000001075; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 5.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 6.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF39; COLLAGEN ALPHA-1(XX) CHAIN; 1.
DR Pfam; PF00041; fn3; 5.
DR Pfam; PF00092; VWA; 1.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 5.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 4.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50853; FN3; 5.
DR PROSITE; PS50234; VWFA; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EGW11071.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000001075};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 150..326
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 351..440
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 441..530
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 531..621
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 622..711
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 714..804
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT REGION 937..969
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1009..1070
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1035..1049
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1056..1070
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EGW11071.1"
SQ SEQUENCE 1070 AA; 115202 MW; EA0EA07BB7E2B370 CRC64;
SGRLRLAVLP EDQLQMKWRE AEGSSLGYLV QVTPMAGDLE QELILTTKTP KATVGGLSPS
KGYTLQIFEL TDSGPVLLAR REFVIEDLKS HSLSRGSRRL VGPTLDPTSF LLGASDPEKT
LEPSIAFTSS KDRPILGDPQ FHCTPPTPTD IIFLVDGSWS IGHRHFQKVK DFLASIITPF
EIGPDKVQVG LTQYSGDPRT EWDLNSFHTK EQVLAAVHSL HYRGGNTFTG LALTHVLGQN
LKPAAGARPE AAKVLILVTD GKSQDDVRTA ARILKDQDID VFVVAGVKNA DEAELKLLAS
QPLDITVHSV LDFPQLATLA ALLSRLICQK IQGRGPVKPA ATTLALDPLP MPTRLDLTHI
TSSSVHLSWT PALYPPLKYL IVWQPSRGGA PKEVVVEGPV SSMELRNLTS NTEYLVSVLP
VYENGFGKSL QGLATTAPLP PPGALTLASV TPRTLRLTWQ PSAGATQYLV RYLLAASPGE
EQRREVHVGQ PEALLDGLEP GRDYEVSVQS LRGPEVSEVR SINARTTALG PPRHLSFSDV
SYNSACVSWE AQRPVRLVKV SYVSSDGSHS GQAQVPGNTT SATLGPLSSS TKYTVRVTCF
YFGGGSSMLT GHVTTKKAPS PSQLSVTELP GDAVKLSWVA AALSGVLVYQ IKWMPLGEGK
AHEISVPGTL DTAVLPGLMS HVEYEITILA YYRDGTRSDP VSLRYIPSSA SRSPPSNLVL
SSETPNSLQV SWTPPSGHVL HYRVNYALAS GSGPEKWISV PGTRSHVVLP DLLSATKYRV
LVSAVYGAGE SVAVTATYRT GFDLMVAFGL VAKEYASIRG VAMEPSALGA APTFTLFKDA
QLMRRASDIY PAALPPEHTI VFLVRLLPET PREAFALWQM MAEDFQPIVG VLLDAGRKSL
TYFNHDPRAA LQEVTFDLQD AKKIFFGSFH KGFQGLAGAR GSNGERGPPG AVGPTGLPGS
KGERGEKGEP QSLATIFQLV SQACESAIQT HVLRLNSFLH ENARPPMPFM VETAKPSRPV
SIEPPGSHNE ALLPGDGGHV HHPEDQGEPE AISHISSPGL QESQTPEPLE
//