GenomeNet

Database: UniProt
Entry: S9XFI9_CAMFR
LinkDB: S9XFI9_CAMFR
Original site: S9XFI9_CAMFR 
ID   S9XFI9_CAMFR            Unreviewed;       704 AA.
AC   S9XFI9;
DT   16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT   16-OCT-2013, sequence version 1.
DT   27-MAR-2024, entry version 31.
DE   RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
GN   ORFNames=CB1_000311001 {ECO:0000313|EMBL:EPY86469.1};
OS   Camelus ferus (Wild bactrian camel) (Camelus bactrianus ferus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Tylopoda; Camelidae; Camelus.
OX   NCBI_TaxID=419612 {ECO:0000313|EMBL:EPY86469.1, ECO:0000313|Proteomes:UP000030684};
RN   [1] {ECO:0000313|EMBL:EPY86469.1, ECO:0000313|Proteomes:UP000030684}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=bactrian camel {ECO:0000313|Proteomes:UP000030684};
RX   PubMed=23149746;
RG   Bactrian Camels Genome Sequencing and Analysis Consortium;
RA   Jirimutu, Wang Z., Ding G., Chen G., Sun Y., Sun Z., Zhang H., Wang L.,
RA   Hasi S., Zhang Y., Li J., Shi Y., Xu Z., He C., Yu S., Li S., Zhang W.,
RA   Batmunkh M., Ts B., Narenbatu, Unierhu, Bat-Ireedui S., Gao H.,
RA   Baysgalan B., Li Q., Jia Z., Turigenbayila, Subudenggerile, Narenmanduhu,
RA   Wang Z., Wang J., Pan L., Chen Y., Ganerdene Y., Dabxilt, Erdemt, Altansha,
RA   Altansukh, Liu T., Cao M., Aruuntsever, Bayart, Hosblig, He F., Zha-ti A.,
RA   Zheng G., Qiu F., Sun Z., Zhao L., Zhao W., Liu B., Li C., Chen Y.,
RA   Tang X., Guo C., Liu W., Ming L., Temuulen, Cui A., Li Y., Gao J., Li J.,
RA   Wurentaodi, Niu S., Sun T., Zhai Z., Zhang M., Chen C., Baldan T.,
RA   Bayaer T., Li Y., Meng H.;
RT   "Genome sequences of wild and domestic bactrian camels.";
RL   Nat. Commun. 3:1202-1202(2012).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KB016603; EPY86469.1; -; Genomic_DNA.
DR   AlphaFoldDB; S9XFI9; -.
DR   Proteomes; UP000030684; Unassembled WGS sequence.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1090; EMI DOMAIN-CONTAINING PROTEIN 1 ISOFORM X1; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030684};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT   DOMAIN          529..704
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          55..148
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          370..503
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          596..615
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        77..103
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        402..422
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   704 AA;  74872 MW;  F79F936096EFFA91 CRC64;
     MMAVLVGADG IRGLKGTKGE KPLTGQLENR GRLAHLVPEG KTAPKAPRAV EVRTVTRALW
     GPPGRRENSA SQGYRVTQED KGRRAHLGSR DHGDSEAQRA HEGKEAPAAS PGSLAPRATL
     EVTARPALRA NGDPTDPKDP QGSLDRRAPR VIQAPPAFLG KTDPQGYVAS LGTEGFLAQW
     GDGSEAREHA HPHWSRVGPG KAVVHEGEIG EPGQKGSKGD KGEQSRACNT AQLALGVGLP
     SLDGRLARGA ALMTSDARPL RSPAPLPGPQ LGFGKQLSLS SSHRVPLGLR DLKAPSDSQA
     PLELMASQVL VASRAFLGRK VTKVREVFLG PPDPWGCRVN LARLENLAFL EKGAPRPQVR
     RQRRLVDRVL LGFPLPPPPE DRGGPKGERG EKGESGPSGA AGPPGPKGPP GDDGPKGSPG
     PPGLPGLKGD SGPKGEKGHP GLIGLIGPPG EQGEKGDRGL PGPQGSSGPK GEQGPPGPKG
     AKGSSGPPGE VIQPLPIQAS RTRRHIDASQ LMDDGEGVES YMDYADGMEE VFGSLNSLKL
     EIEQMKRPLG TQQNPARTCK DLQLCHPDFP DVDPEPKCLT LQELARITSG FPSTRLQAEP
     RFPVPQSTCP RSKMARWPKE QPSTWYSQYK RGSLLSYVDA EGNPVGVVQM TFLRLLSASA
     HQNITYHCYQ SVAWQDAATG SYDKAMRLLG SNDEEMSYDN SRRR
//
DBGET integrated database retrieval system