ID G5C9Q9_HETGA Unreviewed; 1795 AA.
AC G5C9Q9;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Collagen alpha-1(XIV) chain {ECO:0000313|EMBL:EHB18270.1};
GN ORFNames=GW7_19379 {ECO:0000313|EMBL:EHB18270.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB18270.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB18270.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH174064; EHB18270.1; -; Genomic_DNA.
DR STRING; 10181.G5C9Q9; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; G5C9Q9; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:UniProt.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 7.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 2.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 8.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF15; COLLAGEN ALPHA-1(XIV) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS50853; FN3; 8.
DR PROSITE; PS50234; VWFA; 2.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EHB18270.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 32..122
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 159..331
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 356..445
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 446..537
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 538..625
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 627..716
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 738..831
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 832..922
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 923..1014
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1033..1204
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT REGION 1002..1022
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1456..1612
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1643..1795
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1560..1574
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1657..1674
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1729..1746
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1795 AA; 192169 MW; 6A3BA2C733BE56B1 CRC64;
MKILQCKMRY WLIPACVAAA YFCSVVQGQV SAPTRLRYNV LSEDSVQISW KAPRGKFGGY
KLLVTPASGG KTNQLNLQNT ATKAIIQGLM PEQNYTVQII AYHKDKESKP VQGQFRIKDL
EKKKDPTKPR GKAVDKANGT SQALPEEAKF MCPSPALADI VILVDGSWSI GRFNFRLVRL
FLENLVTAFN VGSGRTQIGL SQYSGDPRIE WHLNAFSTKD EVIDAVRNLP YKGGNTLTGL
ALNYIFENNF KPEAGSRTGV AKVGILITDG KSQDDVVPAS RRLRESGVEL FAVGVKNADL
NELQEIASEP DSTHVYNVAE FDLMHTVVES LTRTVCSGME EQDREIKASA LATTGPPTEL
ITSEVTARSF MVNWTHAPGH VQKYRVVYYP TRGGKPEEVV VDRSVSSLVL KNLMSLTEYQ
IAVFAIYAHT ASEGLRGTET TLALPVASDL ELYGVTENSL RVKWDPVPGA TGYLLLYAPL
TEGLAGDEKE MKIGETLTDI ELTGLVPNTE YTVTVYAMFG EEASDPVTGQ ETTLPLTPPS
NLRISNVGSN SARLSWDPSS RKISGYRIVY TSADGTEINE VEVDPITTFP LKGLTPVTEY
SVAIFSIYDE GQSLPLAGTF TTEEVPAQQY LEIDEVKTDS FRVTWHPLSA EEGQHKLMWI
PVYGGKTQEV VLKEEQDSYV IEGLDPGTEY EVSLLAVLDD GSESEVVTAV GTTLDSFWTE
PATTIVPATP VTSVLQTGIR NLVVDDETPS SLRVSWDISD SNVEQFRVTY LTAQGDSVEE
VVGTVMVPGS QNSVLLKPLL AATEYKVTVT PIYSDGEGVS VSAPGKTSPP SGPQNLRVSE
EWYNRLRITW DPPPGPIKGY RIVYRPVSVP GPTLETFVGA DIHTILITNL LSGMDYSVKI
FASQAKGFSD ALTGLVKTLF LGVTELQAHQ VEMTSLCAQW QVHRHATAYR IVVENIQNTQ
KQESTVGGGT NRHCFYGLQP DSEYKISIYT KLQELEGPSV SIVEKTRSRP TQPPTSPPTI
PPAREVCKAA RADLVFMVDG SFSIGDDNFN KIINFLYSTV GALGRIGADG TQVAIVQFTD
DPRTEFKLNA YKTKETLLDS IKRISYKGGN TKTGKAMKHV RDTLFTAEAG XGVPKVIVVI
TDGRSQDEVD SVCREMQVDG YSIFAVGVAD ADYSELLSIG SKPSARHVFF VDDFDAFKKI
EDELITFVCE AASATCPMVH KDGTDVAGFK MMEMFGLVEK EFSSVEGVSM EPGTFNVYPC
YQLHKDALVS QPTRYLHPGG LPSDYTISFL FRILPNTPQE PFALWEILNK NSDPLVGVIL
DNGGKTLTYF NYDHTGDFQT VTFEGPEIRK IFYGSFHKLH VVVNKTLVKV VVDCKQMGEK
AINVSANITS DGVEVLGRMV RSRGPNGSSA PFQLQMFDIL CSTSWASRDK CCELPGLRDD
QACPNMPHSC ACSQTNEVAL GPVGPPGGPG LRGPKGQQGE QGLKGPEGPR GESGPAGPQG
PPGPQGPSGL SIQGMPGMPG EKGEKGDTGL PGPQGVPGGV GSPGRDGSPG QRGFPGKDGS
SGPPGPPGPI GIPGAPGVPG VTGSVGPQGA LGPPGVPGAK GERGERGDLQ SQAMVRAVAR
QVCEQLIQNH MARYAAVLNQ IPSHSSSTRT IQGPPGEPGR PGSPGTPGEQ GPPGAPGFPG
NTGLPGTPGE RGLTGLKGEK GNPGIGTQGP RGPPGPAGPS GESRPGSPGA PGSPGPRGPP
GHLGVPGPQG PSGQPGYCDP SSCSAYGVGA PHPDEPEFTP VQDEQEALEL WGSGI
//