ID G5AY50_HETGA Unreviewed; 3170 AA.
AC G5AY50;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE SubName: Full=Collagen alpha-3(VI) chain {ECO:0000313|EMBL:EHB01961.1};
GN ORFNames=GW7_19424 {ECO:0000313|EMBL:EHB01961.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB01961.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB01961.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH167488; EHB01961.1; -; Genomic_DNA.
DR STRING; 10181.G5AY50; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; G5AY50; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR CDD; cd00063; FN3; 1.
DR CDD; cd01481; vWA_collagen_alpha3-VI-like; 3.
DR CDD; cd01450; vWFA_subfamily_ECM; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 11.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR041900; vWA_collagen_alpha3-VI-like.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 11.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00131; KU; 1.
DR SMART; SM00327; VWA; 11.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF53300; vWA-like; 11.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR PROSITE; PS50853; FN3; 1.
DR PROSITE; PS50234; VWFA; 11.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EHB01961.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 260..437
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 463..638
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 657..834
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 855..1027
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1047..1223
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1250..1383
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1420..1593
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1622..1795
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 1821..2007
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2265..2444
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2482..2678
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2922..3016
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 3043..3112
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 126..178
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2029..2183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2750..2770
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2868..2914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2091..2107
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2115..2137
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3170 AA; 344919 MW; 9644BCA466E8CBCC CRC64;
MRPESSLVPA AAPGGDSGAR EAGTELVHPI HRVADECQAL HRVPMRPPEQ LTRRLESLLM
VQWWLEDLEE SQPREVVALG GSKADLEEHW GVMLQVTVQQ VSGAGQDLEQ VVALLSHALR
AASAAGPAAG SIHPGAPRPQ APQATTHVSY PSPYEAEQLS SGPADLRPSK ARGLSQEGGQ
FTERKRLLSM ESSAKLDHNV VGSSVPWSRG RRVRCRPESR CGLRASLQHG FSSKATWITG
ILLVALGTSG GPLCAQDSAD IIFLIDGSNY TGSSNFAVIL DFLVNLLERL SIGPQQIQVG
VVQYSDEPRT MFSLDTYVSK AQALAAVKAL KFTGGELANI GQALDFVLKN HFTPAVGSRV
EEGVPQVLVL MSAGPSSDEI SDKVVALKQA SVFSFALGVL AASRAELQHI ATEDSLLFMV
PEFHSFGNLQ EELLGYIVAI AQRHILLRPP TIVTQAIEVN KRDIVFLVDG SSKLGLANFN
AIRDFIVKVI QRLEIGQDLI QVAVAQYADT VKPEFYFNTY PSKREAVNAV RKMRPMEGPA
LNTGSALDFI RNNLFISAVG SRAAEGVPKL MVLVTGGKSL DAVSQPAQEL KRSGIMAFTV
GNKAADQAEL EEIAFDSSLV FIPAEFRAAP LQGILPGLLS PLRTLSGTTE VHVNKRDIIF
LLDGSVNVGK TNFPYVRDFV MNLVNSLDVG NDNIRVGLVQ FSDTPVTEFS LDTYQTKGEL
LAHLQRLQLQ GGAGLNMGSA LSYVHANHFT EAGGSRIREH VPQLLLLLTA GQSQDSYLQA
ANQLTREGIL TFCVGASQAN KAELEQIAFN PSLVYLMDDF SSLPALPQQL IQPLTTYVSG
GVEEVPLVQP ESKRDILFLF DGSANLMGQF PAVRDFLYKI IDELDVKPDG VRIAVAQYSD
DVKLESRFNE HMTKPEILNQ VKRMKIKMGK TLNLGYALDF AQRYIFVKSM GSRIEDGVMQ
VLVLLVAGRS SDSVEVPARN LKQSGVVPFI FQAKNADPAE LERIVLSPAF ILAAESLPKI
GELQPQIVNL LKSVYNGGQP PASGEKDVVF LIDGSEGVRS GFGLLKDFVE RLVQGLDVGP
DRVRVAVVQY SDRTRPEFYL NSFMDQQGVI SAIRRLTLLG GPAPSTGAAL DFVLKNILTR
SAGSRMEDGV PQLLIVLTAD RSGDDVRGPS TVLKRGGAVP ISIGIGNADI SEMQAISFIP
DFAVAIPTFR ELGTVQHVIS ERVTQLDREG LSRLQPIFLP PTSPGGAKRD VVFLIDGSQA
AVLEFQHIRT LIERLVDSLD VGFETTRVAV IQFSEDPKVE FLLNAHSSKD EVQNALRQLR
PKGGRQVNVG SALEYVSRNI FKRPLGSRIE EGVPQFLVLI SSGKSADEVD DSAVELKRTF
RELPSLEQKL LTPITTLTAQ QIQQILASTH YPPPESDAAD IVFLIDSSDG VRSDGLAHIR
DFVSRIVQRL NIGPNKVRIG LVQFSNEVFP EFFLKTHRSQ AAVLGAIRRL RFRGGAPLNT
GRALEYVAKN LFVKSAGSRI EDGVPQHLVL LLGGKSQDDI SGFARIISSS GIVSLGIGSR
NVDRTELQAI ANDPRLVFTV REFRELPSIE ERVIGSFGSS GATPATPVVT SPPSRPEKKK
ADIVFLLDGS INFRRDNFQE VLRFVSEIVD TVYEDGDSIQ VGLVQYNSDP TDEFFLKDFS
SKRQIIDAIN KVVYKGGRNA NTRVGIQHLQ LHHFVPEAGS RLDQRVPQIA FVITGGKSVE
DAQDASLALT QRGVKVFAVG VRNIDSEEVG KMASNSATAF RVGNVQELSE LSEQVLETLH
DAMHEILCPG VTDLSKACNL DVILGFDGSR DQNVFVAQKG FESKVDAILS RISQMQRISC
GGGQLPTVHV SVVANAPSGP VEAFDFDEYQ PEQFEKFRNM RSQHPYVLTA DTLKVYQNKF
RQSSPDSVKV VIHFTDGVDG DLADLHRASE ELREEGVHAL ILVGLEQVAN LEQLAHLEFG
RGFMYDRPLR LNLLDLDYEL AEQLDNIAEK ACCGVPCKCS GERGDRGPIG SIGPKGMAGE
DGYRGYPGDE GGPGERGPPG VNGTQGFQGC PGQRGVKGSR GFPGEKGELG EIGLDGLDGE
DGDKGLPGSS GEKGSPGRRG DKGPKGAKGE RGDVGIRGDP GDSGQDSQQR GPKGETGDIG
PMGNSGPPGT GGQKGDPGFP GPSAVVMELS TARFASALFL RPSTVLFGAV SHAFCHLLLS
LFLLVFQGHK GIRGDSIDQC ALIQSIKDKC PCCYGPLECP VFPTELAFAL DTSEGVTQDA
FSLMRDVVLS LVGDLAIAES NCPRGARVAV VTYNNEVTTE IRFSDSRKKS ALLDSIKNLQ
VALTSKQQSL ETAMSFVARN TFKRVRNGFL MRKVAVFFSN RPTRATPQLR QAVLKLWDAG
ITPLFLTSQE DRQLINALQI NNTAVGHVLV LPTRRDLTDF LKNVLTCHVC LDICNIQPSC
GFGSWRPSFR DRRAAGSDAD LDLVFLLDSE ETTSLFQFNE MKKYIGYVVR QLELSPDPGA
SQHLSRVAVV QHAPYESLGN ASALPVRVGL SLTDFGSKER LLEFLGRGVV QLQGGRALGR
AVEYTVQHVF ESAPNPRDLK VLVLMLTGEV PEQELEEAHR AVLQAKCRGY FFVVLGIGRK
VNVKEVYGFA SEPNDVFFKL MDKSSELNEE PLMRFGRLLP SFVGSENAFH LSPDVKQCDW
LQGDQPAKNG VKFGHKQVNI PNNITSNTTT KPVTTKLVTT TTKPATVVNL PPAKPAAGNP
AAAKPVGPAX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXRP GVVVKPVAAA KPAAAKPAAV RPPLPPARVA
STKPEAARTP AKPAAATKPE ALRPQAKPAA PRPAAANPAV KVVPEVQVSE VTENSARLRW
KRPEAPGSYF YDLTVTSAQD QSLVLRQNLT VTERVIGGLR AGQTYYAAVV CYLRSQVQAV
FRGTFSTKKT QPPPPSLARS ASSSSINLMV KTEPLAFTKT DICKLPKDDG TCREFKLKWF
YDAKTESCAR FWYGGGGDDE NRFXXXXXXX XXXXXXXXXX XXXXXXXCEK VCASGEWWFL
VFMAIANKIR NPGQQFTLEK YLNHNHHNTF EKDRLHLQMR KILKQNTQKR
//