GenomeNet

Database: UniProt
Entry: G5AY50_HETGA
LinkDB: G5AY50_HETGA
Original site: G5AY50_HETGA 
ID   G5AY50_HETGA            Unreviewed;      3170 AA.
AC   G5AY50;
DT   14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2011, sequence version 1.
DT   27-MAR-2024, entry version 46.
DE   SubName: Full=Collagen alpha-3(VI) chain {ECO:0000313|EMBL:EHB01961.1};
GN   ORFNames=GW7_19424 {ECO:0000313|EMBL:EHB01961.1};
OS   Heterocephalus glaber (Naked mole rat).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC   Heterocephalus.
OX   NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB01961.1, ECO:0000313|Proteomes:UP000006813};
RN   [1] {ECO:0000313|EMBL:EHB01961.1, ECO:0000313|Proteomes:UP000006813}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21993625; DOI=10.1038/nature10533;
RA   Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA   Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA   Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA   Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA   Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA   Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT   "Genome sequencing reveals insights into physiology and longevity of the
RT   naked mole rat.";
RL   Nature 479:223-227(2011).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JH167488; EHB01961.1; -; Genomic_DNA.
DR   STRING; 10181.G5AY50; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   InParanoid; G5AY50; -.
DR   Proteomes; UP000006813; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   CDD; cd00063; FN3; 1.
DR   CDD; cd01481; vWA_collagen_alpha3-VI-like; 3.
DR   CDD; cd01450; vWFA_subfamily_ECM; 2.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 11.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR041900; vWA_collagen_alpha3-VI-like.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF00014; Kunitz_BPTI; 1.
DR   Pfam; PF00092; VWA; 11.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00131; KU; 1.
DR   SMART; SM00327; VWA; 11.
DR   SUPFAM; SSF57362; BPTI-like; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 1.
DR   SUPFAM; SSF53300; vWA-like; 11.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR   PROSITE; PS50853; FN3; 1.
DR   PROSITE; PS50234; VWFA; 11.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EHB01961.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT   DOMAIN          260..437
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          463..638
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          657..834
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          855..1027
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1047..1223
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1250..1383
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1420..1593
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1622..1795
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          1821..2007
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2265..2444
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2482..2678
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2922..3016
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          3043..3112
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   REGION          1..23
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          126..178
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2029..2183
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2750..2770
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2868..2914
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        142..156
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2091..2107
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2115..2137
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3170 AA;  344919 MW;  9644BCA466E8CBCC CRC64;
     MRPESSLVPA AAPGGDSGAR EAGTELVHPI HRVADECQAL HRVPMRPPEQ LTRRLESLLM
     VQWWLEDLEE SQPREVVALG GSKADLEEHW GVMLQVTVQQ VSGAGQDLEQ VVALLSHALR
     AASAAGPAAG SIHPGAPRPQ APQATTHVSY PSPYEAEQLS SGPADLRPSK ARGLSQEGGQ
     FTERKRLLSM ESSAKLDHNV VGSSVPWSRG RRVRCRPESR CGLRASLQHG FSSKATWITG
     ILLVALGTSG GPLCAQDSAD IIFLIDGSNY TGSSNFAVIL DFLVNLLERL SIGPQQIQVG
     VVQYSDEPRT MFSLDTYVSK AQALAAVKAL KFTGGELANI GQALDFVLKN HFTPAVGSRV
     EEGVPQVLVL MSAGPSSDEI SDKVVALKQA SVFSFALGVL AASRAELQHI ATEDSLLFMV
     PEFHSFGNLQ EELLGYIVAI AQRHILLRPP TIVTQAIEVN KRDIVFLVDG SSKLGLANFN
     AIRDFIVKVI QRLEIGQDLI QVAVAQYADT VKPEFYFNTY PSKREAVNAV RKMRPMEGPA
     LNTGSALDFI RNNLFISAVG SRAAEGVPKL MVLVTGGKSL DAVSQPAQEL KRSGIMAFTV
     GNKAADQAEL EEIAFDSSLV FIPAEFRAAP LQGILPGLLS PLRTLSGTTE VHVNKRDIIF
     LLDGSVNVGK TNFPYVRDFV MNLVNSLDVG NDNIRVGLVQ FSDTPVTEFS LDTYQTKGEL
     LAHLQRLQLQ GGAGLNMGSA LSYVHANHFT EAGGSRIREH VPQLLLLLTA GQSQDSYLQA
     ANQLTREGIL TFCVGASQAN KAELEQIAFN PSLVYLMDDF SSLPALPQQL IQPLTTYVSG
     GVEEVPLVQP ESKRDILFLF DGSANLMGQF PAVRDFLYKI IDELDVKPDG VRIAVAQYSD
     DVKLESRFNE HMTKPEILNQ VKRMKIKMGK TLNLGYALDF AQRYIFVKSM GSRIEDGVMQ
     VLVLLVAGRS SDSVEVPARN LKQSGVVPFI FQAKNADPAE LERIVLSPAF ILAAESLPKI
     GELQPQIVNL LKSVYNGGQP PASGEKDVVF LIDGSEGVRS GFGLLKDFVE RLVQGLDVGP
     DRVRVAVVQY SDRTRPEFYL NSFMDQQGVI SAIRRLTLLG GPAPSTGAAL DFVLKNILTR
     SAGSRMEDGV PQLLIVLTAD RSGDDVRGPS TVLKRGGAVP ISIGIGNADI SEMQAISFIP
     DFAVAIPTFR ELGTVQHVIS ERVTQLDREG LSRLQPIFLP PTSPGGAKRD VVFLIDGSQA
     AVLEFQHIRT LIERLVDSLD VGFETTRVAV IQFSEDPKVE FLLNAHSSKD EVQNALRQLR
     PKGGRQVNVG SALEYVSRNI FKRPLGSRIE EGVPQFLVLI SSGKSADEVD DSAVELKRTF
     RELPSLEQKL LTPITTLTAQ QIQQILASTH YPPPESDAAD IVFLIDSSDG VRSDGLAHIR
     DFVSRIVQRL NIGPNKVRIG LVQFSNEVFP EFFLKTHRSQ AAVLGAIRRL RFRGGAPLNT
     GRALEYVAKN LFVKSAGSRI EDGVPQHLVL LLGGKSQDDI SGFARIISSS GIVSLGIGSR
     NVDRTELQAI ANDPRLVFTV REFRELPSIE ERVIGSFGSS GATPATPVVT SPPSRPEKKK
     ADIVFLLDGS INFRRDNFQE VLRFVSEIVD TVYEDGDSIQ VGLVQYNSDP TDEFFLKDFS
     SKRQIIDAIN KVVYKGGRNA NTRVGIQHLQ LHHFVPEAGS RLDQRVPQIA FVITGGKSVE
     DAQDASLALT QRGVKVFAVG VRNIDSEEVG KMASNSATAF RVGNVQELSE LSEQVLETLH
     DAMHEILCPG VTDLSKACNL DVILGFDGSR DQNVFVAQKG FESKVDAILS RISQMQRISC
     GGGQLPTVHV SVVANAPSGP VEAFDFDEYQ PEQFEKFRNM RSQHPYVLTA DTLKVYQNKF
     RQSSPDSVKV VIHFTDGVDG DLADLHRASE ELREEGVHAL ILVGLEQVAN LEQLAHLEFG
     RGFMYDRPLR LNLLDLDYEL AEQLDNIAEK ACCGVPCKCS GERGDRGPIG SIGPKGMAGE
     DGYRGYPGDE GGPGERGPPG VNGTQGFQGC PGQRGVKGSR GFPGEKGELG EIGLDGLDGE
     DGDKGLPGSS GEKGSPGRRG DKGPKGAKGE RGDVGIRGDP GDSGQDSQQR GPKGETGDIG
     PMGNSGPPGT GGQKGDPGFP GPSAVVMELS TARFASALFL RPSTVLFGAV SHAFCHLLLS
     LFLLVFQGHK GIRGDSIDQC ALIQSIKDKC PCCYGPLECP VFPTELAFAL DTSEGVTQDA
     FSLMRDVVLS LVGDLAIAES NCPRGARVAV VTYNNEVTTE IRFSDSRKKS ALLDSIKNLQ
     VALTSKQQSL ETAMSFVARN TFKRVRNGFL MRKVAVFFSN RPTRATPQLR QAVLKLWDAG
     ITPLFLTSQE DRQLINALQI NNTAVGHVLV LPTRRDLTDF LKNVLTCHVC LDICNIQPSC
     GFGSWRPSFR DRRAAGSDAD LDLVFLLDSE ETTSLFQFNE MKKYIGYVVR QLELSPDPGA
     SQHLSRVAVV QHAPYESLGN ASALPVRVGL SLTDFGSKER LLEFLGRGVV QLQGGRALGR
     AVEYTVQHVF ESAPNPRDLK VLVLMLTGEV PEQELEEAHR AVLQAKCRGY FFVVLGIGRK
     VNVKEVYGFA SEPNDVFFKL MDKSSELNEE PLMRFGRLLP SFVGSENAFH LSPDVKQCDW
     LQGDQPAKNG VKFGHKQVNI PNNITSNTTT KPVTTKLVTT TTKPATVVNL PPAKPAAGNP
     AAAKPVGPAX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
     XXXXXXXXXX XXXXXXXXXX XXXXXXXXRP GVVVKPVAAA KPAAAKPAAV RPPLPPARVA
     STKPEAARTP AKPAAATKPE ALRPQAKPAA PRPAAANPAV KVVPEVQVSE VTENSARLRW
     KRPEAPGSYF YDLTVTSAQD QSLVLRQNLT VTERVIGGLR AGQTYYAAVV CYLRSQVQAV
     FRGTFSTKKT QPPPPSLARS ASSSSINLMV KTEPLAFTKT DICKLPKDDG TCREFKLKWF
     YDAKTESCAR FWYGGGGDDE NRFXXXXXXX XXXXXXXXXX XXXXXXXCEK VCASGEWWFL
     VFMAIANKIR NPGQQFTLEK YLNHNHHNTF EKDRLHLQMR KILKQNTQKR
//
DBGET integrated database retrieval system