GenomeNet

Database: UniProt
Entry: G5C9Q9_HETGA
LinkDB: G5C9Q9_HETGA
Original site: G5C9Q9_HETGA 
ID   G5C9Q9_HETGA            Unreviewed;      1795 AA.
AC   G5C9Q9;
DT   14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2011, sequence version 1.
DT   27-MAR-2024, entry version 42.
DE   SubName: Full=Collagen alpha-1(XIV) chain {ECO:0000313|EMBL:EHB18270.1};
GN   ORFNames=GW7_19379 {ECO:0000313|EMBL:EHB18270.1};
OS   Heterocephalus glaber (Naked mole rat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC   Heterocephalus.
OX   NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB18270.1, ECO:0000313|Proteomes:UP000006813};
RN   [1] {ECO:0000313|EMBL:EHB18270.1, ECO:0000313|Proteomes:UP000006813}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21993625; DOI=10.1038/nature10533;
RA   Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA   Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA   Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA   Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA   Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA   Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT   "Genome sequencing reveals insights into physiology and longevity of the
RT   naked mole rat.";
RL   Nature 479:223-227(2011).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JH174064; EHB18270.1; -; Genomic_DNA.
DR   STRING; 10181.G5C9Q9; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   InParanoid; G5C9Q9; -.
DR   Proteomes; UP000006813; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:UniProt.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 7.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00041; fn3; 8.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 8.
DR   SMART; SM00210; TSPN; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 6.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS50853; FN3; 8.
DR   PROSITE; PS50234; VWFA; 2.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EHB18270.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          32..122
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          159..331
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          356..445
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          446..537
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          538..625
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          627..716
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          738..831
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          832..922
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          923..1014
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1033..1204
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   REGION          1002..1022
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1456..1612
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1643..1795
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1560..1574
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1657..1674
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1729..1746
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1795 AA;  192169 MW;  6A3BA2C733BE56B1 CRC64;
     MKILQCKMRY WLIPACVAAA YFCSVVQGQV SAPTRLRYNV LSEDSVQISW KAPRGKFGGY
     KLLVTPASGG KTNQLNLQNT ATKAIIQGLM PEQNYTVQII AYHKDKESKP VQGQFRIKDL
     EKKKDPTKPR GKAVDKANGT SQALPEEAKF MCPSPALADI VILVDGSWSI GRFNFRLVRL
     FLENLVTAFN VGSGRTQIGL SQYSGDPRIE WHLNAFSTKD EVIDAVRNLP YKGGNTLTGL
     ALNYIFENNF KPEAGSRTGV AKVGILITDG KSQDDVVPAS RRLRESGVEL FAVGVKNADL
     NELQEIASEP DSTHVYNVAE FDLMHTVVES LTRTVCSGME EQDREIKASA LATTGPPTEL
     ITSEVTARSF MVNWTHAPGH VQKYRVVYYP TRGGKPEEVV VDRSVSSLVL KNLMSLTEYQ
     IAVFAIYAHT ASEGLRGTET TLALPVASDL ELYGVTENSL RVKWDPVPGA TGYLLLYAPL
     TEGLAGDEKE MKIGETLTDI ELTGLVPNTE YTVTVYAMFG EEASDPVTGQ ETTLPLTPPS
     NLRISNVGSN SARLSWDPSS RKISGYRIVY TSADGTEINE VEVDPITTFP LKGLTPVTEY
     SVAIFSIYDE GQSLPLAGTF TTEEVPAQQY LEIDEVKTDS FRVTWHPLSA EEGQHKLMWI
     PVYGGKTQEV VLKEEQDSYV IEGLDPGTEY EVSLLAVLDD GSESEVVTAV GTTLDSFWTE
     PATTIVPATP VTSVLQTGIR NLVVDDETPS SLRVSWDISD SNVEQFRVTY LTAQGDSVEE
     VVGTVMVPGS QNSVLLKPLL AATEYKVTVT PIYSDGEGVS VSAPGKTSPP SGPQNLRVSE
     EWYNRLRITW DPPPGPIKGY RIVYRPVSVP GPTLETFVGA DIHTILITNL LSGMDYSVKI
     FASQAKGFSD ALTGLVKTLF LGVTELQAHQ VEMTSLCAQW QVHRHATAYR IVVENIQNTQ
     KQESTVGGGT NRHCFYGLQP DSEYKISIYT KLQELEGPSV SIVEKTRSRP TQPPTSPPTI
     PPAREVCKAA RADLVFMVDG SFSIGDDNFN KIINFLYSTV GALGRIGADG TQVAIVQFTD
     DPRTEFKLNA YKTKETLLDS IKRISYKGGN TKTGKAMKHV RDTLFTAEAG XGVPKVIVVI
     TDGRSQDEVD SVCREMQVDG YSIFAVGVAD ADYSELLSIG SKPSARHVFF VDDFDAFKKI
     EDELITFVCE AASATCPMVH KDGTDVAGFK MMEMFGLVEK EFSSVEGVSM EPGTFNVYPC
     YQLHKDALVS QPTRYLHPGG LPSDYTISFL FRILPNTPQE PFALWEILNK NSDPLVGVIL
     DNGGKTLTYF NYDHTGDFQT VTFEGPEIRK IFYGSFHKLH VVVNKTLVKV VVDCKQMGEK
     AINVSANITS DGVEVLGRMV RSRGPNGSSA PFQLQMFDIL CSTSWASRDK CCELPGLRDD
     QACPNMPHSC ACSQTNEVAL GPVGPPGGPG LRGPKGQQGE QGLKGPEGPR GESGPAGPQG
     PPGPQGPSGL SIQGMPGMPG EKGEKGDTGL PGPQGVPGGV GSPGRDGSPG QRGFPGKDGS
     SGPPGPPGPI GIPGAPGVPG VTGSVGPQGA LGPPGVPGAK GERGERGDLQ SQAMVRAVAR
     QVCEQLIQNH MARYAAVLNQ IPSHSSSTRT IQGPPGEPGR PGSPGTPGEQ GPPGAPGFPG
     NTGLPGTPGE RGLTGLKGEK GNPGIGTQGP RGPPGPAGPS GESRPGSPGA PGSPGPRGPP
     GHLGVPGPQG PSGQPGYCDP SSCSAYGVGA PHPDEPEFTP VQDEQEALEL WGSGI
//
DBGET integrated database retrieval system