ID T0MCC0_CAMFR Unreviewed; 2695 AA.
AC T0MCC0;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Collagen alpha-1(VII) chain {ECO:0000313|EMBL:EQB77096.1};
GN ORFNames=CB1_000145077 {ECO:0000313|EMBL:EQB77096.1};
OS Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX NCBI_TaxID=419612 {ECO:0000313|EMBL:EQB77096.1, ECO:0000313|Proteomes:UP000030684};
RN [1] {ECO:0000313|EMBL:EQB77096.1, ECO:0000313|Proteomes:UP000030684}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX PubMed=23149746;
RG Bactrian Camels Genome Sequencing and Analysis Consortium;
RA Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA Bayaer T., Li Y., Meng H.;
RT "Genome sequences of wild and domestic bactrian camels.";
RL Nat. Commun. 3:1202-1202(2012).
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the sauvagine/corticotropin-releasing
CC factor/urotensin I family. {ECO:0000256|ARBA:ARBA00009287}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB016442; EQB77096.1; -; Genomic_DNA.
DR Proteomes; UP000030684; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0005179; F:hormone activity; IEA:InterPro.
DR GO; GO:0004867; F:serine-type endopeptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00063; FN3; 8.
DR CDD; cd22627; Kunitz_collagen_alpha1_VII; 1.
DR CDD; cd01482; vWA_collagen_alphaI-XII-like; 1.
DR CDD; cd01450; vWFA_subfamily_ECM; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 9.
DR Gene3D; 4.10.410.10; Pancreatic trypsin inhibitor Kunitz domain; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 2.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000187; CRF.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR002223; Kunitz_BPTI.
DR InterPro; IPR036880; Kunitz_BPTI_sf.
DR InterPro; IPR020901; Prtase_inh_Kunz-CS.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR PANTHER; PTHR24020; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24020:SF86; COLLAGEN ALPHA-1(VII) CHAIN; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF00473; CRF; 1.
DR Pfam; PF00041; fn3; 8.
DR Pfam; PF00014; Kunitz_BPTI; 1.
DR Pfam; PF00092; VWA; 2.
DR PRINTS; PR00759; BASICPTASE.
DR PRINTS; PR00453; VWFADOMAIN.
DR SMART; SM00060; FN3; 8.
DR SMART; SM00131; KU; 1.
DR SMART; SM00327; VWA; 2.
DR SUPFAM; SSF57362; BPTI-like; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 6.
DR SUPFAM; SSF53300; vWA-like; 2.
DR PROSITE; PS00280; BPTI_KUNITZ_1; 1.
DR PROSITE; PS50279; BPTI_KUNITZ_2; 1.
DR PROSITE; PS50853; FN3; 8.
DR PROSITE; PS50234; VWFA; 2.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:EQB77096.1};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000030684};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..2695
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004567670"
FT DOMAIN 41..214
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 237..332
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 333..419
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 420..510
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 513..600
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 603..690
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 741..829
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 832..919
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 921..1014
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1015..1171
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT DOMAIN 2500..2553
FT /note="BPTI/Kunitz inhibitor"
FT /evidence="ECO:0000259|PROSITE:PS50279"
FT REGION 1788..1812
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2099..2412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2622..2653
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2116..2143
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2202..2225
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2273..2287
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2622..2641
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2695 AA; 288195 MW; 30B39B6DEAF6F972 CRC64;
MGRMRLRLLV AALCAGILAR APRVRAQHRE RVTCTRLYAA DIVFLLDGSS SIGRSNFREV
RAFLEGLVLP FSGAAGAQGV RFAAVQYSDD PRTEFGLDAL GSGSDVIRAI RELSYKGGNT
RTGAAILHVA DHVFQPQLAR PGVPKVCILI TDGKSQDLVD PAAQRLKGQG IKLFAVGIKN
ADPEELKRVA SQPTSDFFFF VNDFSILRTL LPLVSRRVCM TAGGVPVTLP PEVLTSGPRD
LVLSEPGSQS LRVQWTAASG PVTGYKVQYA PLTGLGQPVS SERREVSVSA GETNVRLQGL
RPLTEYQVTV VALYANSIGE AVSGTARTTA LEGPELTIQN TTAHSLLVTW RTVPGATGYR
VTWRVLSGGA TQQQELGPGQ GSVLLRDLEP GTDYEVTVRA LLGRSMGPAT SLTARTDTSV
EQTLRAVILG PTSILLSWNL VPEARGYRLE WRRESGLEVP QKVVLPSDVT RYQLDGLQPG
TEYRLTLYTL LEGREVATPA TVVPTGSEMP VGPVMDLQAT ELPGQRMRVS WSPVPSATEY
RITVRSTQGV ERTLVLPGSQ TAFDLDDVRA GLSYMVRVLA RVGTREGAAS VLTVHREPET
PLAIPELRVV ASDATRVRVA WGPVPGASGF RISWRTDNGL ESSQTLPPES TATDITGLRP
GTSYQVAVSA LRGREEGPAA VIVARTGPEK SQLVSGEATL AEVDGLEPDM EYTVRVWAHA
AGVDGTPASV VVRTDPQPVG SVSKLQIINA SSDVLRITWV GVTGATAYRL AWGRSEGGPM
REQMLPGNTD SAEIRGLEGG VSYSVRVTAL VGDREGAPVS IVVTTPPAAP PALETLRVVQ
RGEHSLRLGW QPVPGARGFR LRWRPEGGQE QSRVLGPELS SYELDRLEPA THYRIWLSVL
GPAGEGAPSE VTAYTELPRV PSTELRVVDT SIDSVTLAWT PVSGVSSYIL SWRPLRGPGQ
ELPVASQTLP GVSNSQRVTG LEPGTSYIFS LTPVREGVQG HEASVTHNPV CPQGLMDVVF
LLHTTRDNAH RAEAVGLLSY SHRPSPLFSL NSSHDLGVIL QKIRNIPYMD PSGNNLGTAV
VTAHRYLLAP DAPGRRQHVP GVMVLLVDEP LRGDIFSPIR EAQAAGLKVM MLGLAGADQE
QLRRLVPGMD PVQTFFAVDD GLSLDRAVSS LATALCQTAL TTQGQKGEPG EMDKLGLPDP
LACRAGLVFL APRGLLEEPL QRARGVSLGQ MGLQAALAAL GLLEPLAQRA PQGGRALVEN
LESLDKSLEA RDQGFLGGKG TLDYRVTKVI VGRGVFLEVL DPKAQLAPLE KKEKRATVRM
EPQASQGNLG PRVSGAYGDF LEMLAPKANA DPLAQWDPRD HRELLDVLEL RVLKGHQDPL
AAEERRGSLV ALETLQWEMW GRLGPKELLE SKESGAHLAW FFLETLAPRE TLETGVPSAS
LAEQDPQVTR GLLERRETLG GLVPQVLLAP EDEMVKLERK VTRVPRVTQV CLEKLASVAF
GGHLELGGLW VRKETREILE RMDEMGVLDY LDPRATVGSQ VPQDPLDGWE SLGTVDRRVL
EDPRVTPAPL EPLGRGASVD FGGPQAHRGT QVFEAQQEKR VTEVPLAWMA AVGWMGNQEP
LAPLGCRENL GTLAKTGGRV VMAPRVSVEL LAALDSRAPQ AFRGRSALLA RAFLAFREIQ
APRVTVERLD PKENSLHRGE PGPWGHAKEG LPVDLSSFNW GLKGPLGKRP GEEGMEAVLG
GQCWVSFFAG SPSPSLCPQN VERLLENIGI KTSALREIVE TWDESSGSFL PVPERRRGPK
GDPGERGPPG KEVAPLAFLE TAGRREIVET LALRGHLACP LGRGVPLDLL ALPGSLGSLV
FLGSQAGLGV WERQEDQERG ENGERKENVE NRAETALLAS LDPPGPLAPR GSQAVKVTKA
PKETGVYQES VVWLGLKGSR VATETLDRLV PRAFLDQLDL WDFLAPLVLQ ALWVLKGHQV
CLDKWGRQGS RECQVVTVSV EKMETEEPLV CRGPQVCLAL SDLKESLDPW GPLDRLWSGP
LEQRERRELL EALLETWWES REPKVTEDCQ GLGARRVKPA VWESLETLVK MVRKGLQDPK
VKRVWQASQG EREPQAPQGH LDHQGQREHL APLDSKETRE TLEQGFPGPE ASVGSQVSGC
GPPGSRGERG EKGDAGPPGL KGEKGDSAVI MGPPGPRGAK GDMGERGPRG IDGDKGPRGD
SGDPGEKGEP GAAGIPGDPG SPGKDGAPGV RGDKGDVGFM GPRGLKGERG IKGACGLDGE
KGDKGEAGPL GRPGLAGRKG DMGEPGVPGQ SGAPGKEGLI GPKGDRGFDG QPGPKGDQGE
KGERGPPGIG GFPGPRGNDG SSGPPGPPGS VGPKGPEGLQ GQKGERGPPG ESVVGAPGAP
GTPGERGEQD PSQVMLQTLL APSSMLCLCS VSLMPRRKVR TAGAVGQHEP GPTVAPLTAH
PTVRAGRLPP EDDEYEYSEY SVEEYQDPEA PWDGDAPDPC SLPLDEGSCT AYTLRWYHRA
MPGGMEACHP FVYGGCGGNA NRFGTREACE RRCPPRMAQS QGTAPHPSSA PCFSVPTCWP
DHTMTSWALL VLMVLTLGRT LPVPATPIPA FQLLPENFPQ ATPCPVTSES PSASTTGLSA
AWGHPSPGPR PGPHITLSLD VPLGLLQILL EQAQARAARE QAAANARILA HIGRR
//