GenomeNet

Database: UniProt
Entry: T0MCC0_CAMFR
LinkDB: T0MCC0_CAMFR
Original site: T0MCC0_CAMFR 
ID   T0MCC0_CAMFR            Unreviewed;      2695 AA.
AC   T0MCC0;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   27-MAR-2024, entry version 44.
DE   SubName: Full=Collagen alpha-1(VII) chain {ECO:0000313|EMBL:EQB77096.1};
GN   ORFNames=CB1_000145077 {ECO:0000313|EMBL:EQB77096.1};
OS   Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX   NCBI_TaxID=419612 {ECO:0000313|EMBL:EQB77096.1, ECO:0000313|Proteomes:UP000030684};
RN   [1] {ECO:0000313|EMBL:EQB77096.1, ECO:0000313|Proteomes:UP000030684}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX   PubMed=23149746;
RG   Bactrian Camels Genome Sequencing and Analysis Consortium;
RA   Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA   Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA   Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA   Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA   Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA   Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA   Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA   Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA   Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA   Bayaer T., Li Y., Meng H.;
RT   "Genome sequences of wild and domestic bactrian camels.";
RL   Nat. Commun. 3:1202-1202(2012).
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- SIMILARITY: Belongs to the sauvagine/corticotropin-releasing
CC       factor/urotensin I family. {ECO:0000256|ARBA:ARBA00009287}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KB016442; EQB77096.1; -; Genomic_DNA.
DR   Proteomes; UP000030684; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0005179; F:hormone activity; IEA:InterPro.
DR   GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00063; FN3; 8.
DR   CDD; cd22627; Kunitz_collagen_alpha1_VII; 1.
DR   CDD; cd01482; vWA_collagen_alphaI-XII-like; 1.
DR   CDD; cd01450; vWFA_subfamily_ECM; 1.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR   Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000187; CRF.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR002223; Kunitz_BPTI.
DR   InterPro; IPR036880; Kunitz_BPTI_sf.
DR   InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24020:SF86; COLLAGEN ALPHA-1(VII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF00473; CRF; 1.
DR   Pfam; PF00041; fn3; 8.
DR   Pfam; PF00014; Kunitz_BPTI; 1.
DR   Pfam; PF00092; VWA; 2.
DR   PRINTS; PR00759; BASICPTASE.
DR   PRINTS; PR00453; VWFADOMAIN.
DR   SMART; SM00060; FN3; 8.
DR   SMART; SM00131; KU; 1.
DR   SMART; SM00327; VWA; 2.
DR   SUPFAM; SSF57362; BPTI-like; 1.
DR   SUPFAM; SSF49265; Fibronectin type III; 6.
DR   SUPFAM; SSF53300; vWA-like; 2.
DR   PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR   PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR   PROSITE; PS50853; FN3; 8.
DR   PROSITE; PS50234; VWFA; 2.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EQB77096.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030684};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..2695
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5004567670"
FT   DOMAIN          41..214
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          237..332
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          333..419
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          420..510
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          513..600
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          603..690
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          741..829
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          832..919
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          921..1014
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          1015..1171
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   DOMAIN          2500..2553
FT                   /note="BPTI/Kunitz inhibitor"
FT                   /evidence="ECO:0000259|PROSITE:PS50279"
FT   REGION          1788..1812
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2099..2412
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2622..2653
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2116..2143
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2202..2225
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2273..2287
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2622..2641
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2695 AA;  288195 MW;  30B39B6DEAF6F972 CRC64;
     MGRMRLRLLV AALCAGILAR APRVRAQHRE RVTCTRLYAA DIVFLLDGSS SIGRSNFREV
     RAFLEGLVLP FSGAAGAQGV RFAAVQYSDD PRTEFGLDAL GSGSDVIRAI RELSYKGGNT
     RTGAAILHVA DHVFQPQLAR PGVPKVCILI TDGKSQDLVD PAAQRLKGQG IKLFAVGIKN
     ADPEELKRVA SQPTSDFFFF VNDFSILRTL LPLVSRRVCM TAGGVPVTLP PEVLTSGPRD
     LVLSEPGSQS LRVQWTAASG PVTGYKVQYA PLTGLGQPVS SERREVSVSA GETNVRLQGL
     RPLTEYQVTV VALYANSIGE AVSGTARTTA LEGPELTIQN TTAHSLLVTW RTVPGATGYR
     VTWRVLSGGA TQQQELGPGQ GSVLLRDLEP GTDYEVTVRA LLGRSMGPAT SLTARTDTSV
     EQTLRAVILG PTSILLSWNL VPEARGYRLE WRRESGLEVP QKVVLPSDVT RYQLDGLQPG
     TEYRLTLYTL LEGREVATPA TVVPTGSEMP VGPVMDLQAT ELPGQRMRVS WSPVPSATEY
     RITVRSTQGV ERTLVLPGSQ TAFDLDDVRA GLSYMVRVLA RVGTREGAAS VLTVHREPET
     PLAIPELRVV ASDATRVRVA WGPVPGASGF RISWRTDNGL ESSQTLPPES TATDITGLRP
     GTSYQVAVSA LRGREEGPAA VIVARTGPEK SQLVSGEATL AEVDGLEPDM EYTVRVWAHA
     AGVDGTPASV VVRTDPQPVG SVSKLQIINA SSDVLRITWV GVTGATAYRL AWGRSEGGPM
     REQMLPGNTD SAEIRGLEGG VSYSVRVTAL VGDREGAPVS IVVTTPPAAP PALETLRVVQ
     RGEHSLRLGW QPVPGARGFR LRWRPEGGQE QSRVLGPELS SYELDRLEPA THYRIWLSVL
     GPAGEGAPSE VTAYTELPRV PSTELRVVDT SIDSVTLAWT PVSGVSSYIL SWRPLRGPGQ
     ELPVASQTLP GVSNSQRVTG LEPGTSYIFS LTPVREGVQG HEASVTHNPV CPQGLMDVVF
     LLHTTRDNAH RAEAVGLLSY SHRPSPLFSL NSSHDLGVIL QKIRNIPYMD PSGNNLGTAV
     VTAHRYLLAP DAPGRRQHVP GVMVLLVDEP LRGDIFSPIR EAQAAGLKVM MLGLAGADQE
     QLRRLVPGMD PVQTFFAVDD GLSLDRAVSS LATALCQTAL TTQGQKGEPG EMDKLGLPDP
     LACRAGLVFL APRGLLEEPL QRARGVSLGQ MGLQAALAAL GLLEPLAQRA PQGGRALVEN
     LESLDKSLEA RDQGFLGGKG TLDYRVTKVI VGRGVFLEVL DPKAQLAPLE KKEKRATVRM
     EPQASQGNLG PRVSGAYGDF LEMLAPKANA DPLAQWDPRD HRELLDVLEL RVLKGHQDPL
     AAEERRGSLV ALETLQWEMW GRLGPKELLE SKESGAHLAW FFLETLAPRE TLETGVPSAS
     LAEQDPQVTR GLLERRETLG GLVPQVLLAP EDEMVKLERK VTRVPRVTQV CLEKLASVAF
     GGHLELGGLW VRKETREILE RMDEMGVLDY LDPRATVGSQ VPQDPLDGWE SLGTVDRRVL
     EDPRVTPAPL EPLGRGASVD FGGPQAHRGT QVFEAQQEKR VTEVPLAWMA AVGWMGNQEP
     LAPLGCRENL GTLAKTGGRV VMAPRVSVEL LAALDSRAPQ AFRGRSALLA RAFLAFREIQ
     APRVTVERLD PKENSLHRGE PGPWGHAKEG LPVDLSSFNW GLKGPLGKRP GEEGMEAVLG
     GQCWVSFFAG SPSPSLCPQN VERLLENIGI KTSALREIVE TWDESSGSFL PVPERRRGPK
     GDPGERGPPG KEVAPLAFLE TAGRREIVET LALRGHLACP LGRGVPLDLL ALPGSLGSLV
     FLGSQAGLGV WERQEDQERG ENGERKENVE NRAETALLAS LDPPGPLAPR GSQAVKVTKA
     PKETGVYQES VVWLGLKGSR VATETLDRLV PRAFLDQLDL WDFLAPLVLQ ALWVLKGHQV
     CLDKWGRQGS RECQVVTVSV EKMETEEPLV CRGPQVCLAL SDLKESLDPW GPLDRLWSGP
     LEQRERRELL EALLETWWES REPKVTEDCQ GLGARRVKPA VWESLETLVK MVRKGLQDPK
     VKRVWQASQG EREPQAPQGH LDHQGQREHL APLDSKETRE TLEQGFPGPE ASVGSQVSGC
     GPPGSRGERG EKGDAGPPGL KGEKGDSAVI MGPPGPRGAK GDMGERGPRG IDGDKGPRGD
     SGDPGEKGEP GAAGIPGDPG SPGKDGAPGV RGDKGDVGFM GPRGLKGERG IKGACGLDGE
     KGDKGEAGPL GRPGLAGRKG DMGEPGVPGQ SGAPGKEGLI GPKGDRGFDG QPGPKGDQGE
     KGERGPPGIG GFPGPRGNDG SSGPPGPPGS VGPKGPEGLQ GQKGERGPPG ESVVGAPGAP
     GTPGERGEQD PSQVMLQTLL APSSMLCLCS VSLMPRRKVR TAGAVGQHEP GPTVAPLTAH
     PTVRAGRLPP EDDEYEYSEY SVEEYQDPEA PWDGDAPDPC SLPLDEGSCT AYTLRWYHRA
     MPGGMEACHP FVYGGCGGNA NRFGTREACE RRCPPRMAQS QGTAPHPSSA PCFSVPTCWP
     DHTMTSWALL VLMVLTLGRT LPVPATPIPA FQLLPENFPQ ATPCPVTSES PSASTTGLSA
     AWGHPSPGPR PGPHITLSLD VPLGLLQILL EQAQARAARE QAAANARILA HIGRR
//
DBGET integrated database retrieval system