ID A0A1X7U7U7_AMPQE Unreviewed; 1330 AA.
AC A0A1X7U7U7;
DT 05-JUL-2017, integrated into UniProtKB/TrEMBL.
DT 05-JUL-2017, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
GN Name=100634948 {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001};
OS Amphimedon queenslandica (Sponge).
OC Eukaryota; Metazoa; Porifera; Demospongiae; Heteroscleromorpha;
OC Haplosclerida; Niphatidae; Amphimedon.
OX NCBI_TaxID=400682 {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001, ECO:0000313|Proteomes:UP000007879};
RN [1] {ECO:0000313|Proteomes:UP000007879}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20686567; DOI=10.1038/nature09201;
RA Srivastava M., Simakov O., Chapman J., Fahey B., Gauthier M.E., Mitros T.,
RA Richards G.S., Conaco C., Dacre M., Hellsten U., Larroux C., Putnam N.H.,
RA Stanke M., Adamska M., Darling A., Degnan S.M., Oakley T.H.,
RA Plachetzki D.C., Zhai Y., Adamski M., Calcino A., Cummins S.F.,
RA Goodstein D.M., Harris C., Jackson D.J., Leys S.P., Shu S., Woodcroft B.J.,
RA Vervoort M., Kosik K.S., Manning G., Degnan B.M., Rokhsar D.S.;
RT "The Amphimedon queenslandica genome and the evolution of animal
RT complexity.";
RL Nature 466:720-726(2010).
RN [2] {ECO:0000313|Proteomes:UP000007879}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lucas S., Shapiro H., Lindquist E., Tice H., Dalin E., Glavina del Rio T.,
RA Bruce D., Barry K., Pitluck S., Srivastava M., Simakov O., Chapman J.,
RA Mitros T., Hellsten U., Putnam N.H., Fahey B., Gauthier M., Larroux C.,
RA Richards G.S., Stanke M., Adamska M., Darling A., Dacre M., Degnan S.M.,
RA Zhai Y., Adamski M., Calcino A., Cummins S.F., Goodstein D.M., Harris C.,
RA Shu S., Woodcroft B., Leys S.P., Manning G., Degnan B.M., Rokhsar D.S.;
RT "The genome of the haplosclerid demosponge Amphimedon queenslandica and the
RT evolution of animal complexity.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2017) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003388783.2; XM_003388735.3.
DR STRING; 400682.A0A1X7U7U7; -.
DR EnsemblMetazoa; Aqu2.1.23728_001; Aqu2.1.23728_001; Aqu2.1.23728.
DR GeneID; 100634948; -.
DR KEGG; aqu:100634948; -.
DR eggNOG; KOG3544; Eukaryota.
DR InParanoid; A0A1X7U7U7; -.
DR OrthoDB; 2970887at2759; -.
DR Proteomes; UP000007879; Unassembled WGS sequence.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR Gene3D; 2.60.120.1000; -; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR000885; Fib_collagen_C.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR Pfam; PF01410; COLFI; 1.
DR Pfam; PF01391; Collagen; 3.
DR SMART; SM00038; COLFI; 1.
DR PROSITE; PS51461; NC1_FIB; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007879};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1330
FT /note="Fibrillar collagen NC1 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010862329"
FT DOMAIN 1153..1330
FT /note="Fibrillar collagen NC1"
FT /evidence="ECO:0000259|PROSITE:PS51461"
FT REGION 32..1141
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..64
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..116
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..169
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 284..301
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 441..455
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1330 AA; 126085 MW; CEDC3B18ACFCE8BE CRC64;
MIMAAQALIL ALIGVSLCTL AYGAAPFTKR QDGVTDTDPF GDPGPPGPPG PNLAPVPGPP
GPPGSRGSDG FDGLDGSAGS DGATGPPGDP GVPGPIGPPG DPGAPAFIPP PEQGIKGPII
IPIVGDRGDR GQSGVPGRKG VPGPPGHIGP KGTRGHRGMD GEKGPAGDQG EVGKEGVIGA
SGPNGYPGDP GPTGQIGLPG IPGFPGLQGA KGNRGADGAA GEPGDQGPNG PPGEVGNIGP
EGPRGPDGDV GAKGGDGETG ADGPAGPPGP DGFPGQQGAP GAPGPQGPQG PNGPTGPPGA
PGKRGPAGNQ GPPGSDGSPG SPGSPGTPGQ NGATGPPGPA GAKGSMGSPG RTGLAGIRGQ
TGEPGDTGPR GQPGRKGNNG ADGDKGNTGP QGKAGERGEK GFRGANGAPG DQGEDGKQGN
AGPAGPRGAK GPPGHGGVPG ENGKDGDKGE PGEKGEKGAQ GIPGLPGYDG VPGGAGVPGR
PGARGAKGAR GDRGQPGVNG PDGTDGIQGP PGPSGIPGPH GPAGPVGDTG APGQKGSRGA
DGQPGAAGPP GQPGETGSAG DPGNKGDKGV PGVTGKAGDQ GATGASGVHG PAGDQGTKGA
RGGAGPTGAK GATGAKGRAG ETGQIGAAGE PGQPGKQGPQ GGVGLPGTTG AHGERGLQGK
TGSPGPAGAR GGPGGRGKKG PQGRPGFQGV QGDVGQQGPD GAPGRAGPLG AAGNPGPNGF
QGATGGPGPQ GPAGGAGATG DKGTVGPQGV AGGPGPAGRP GDTGAAGERG KKGPTGSQGS
RGPSGAPGKP GLPSSQGSQG PAGASGAAGV PGLPGKTGPT GPKGPQGLPG PAGSKGARGV
PGQAGPKGPR GPAGSRGEQG KQGDAGPTGK AGADGEKGRA GAPGEPGEAG VPGPQGSKGA
QGKSGIPGRP GAKGEVGYPG LIGAKGFMGE NGAKGPKGDK GPKGERGPAG PKGANGDRGG
QGPDGPTGSP GAQGEPGQKG PQGYTGLPGE DGKRGPPGTD GRPGDPGKEG PAGAQGDPGP
AGGPGAPGAR GDQGPKGVPG DKGDQGDRGL PGHKGDPGLQ GGVGAPGQAG PQGATGPQGP
QGPEGPKGEQ GDRGNRGGFG PQGEKGSPGE QGLHGSVGPV GPAGPPGPAG APGEPYKGPA
FYRSKFDEKQ DISQLKKQFL FVELKSIEEE MNKMGINEKL PRTCEDLNMQ HPEYSNGEYT
IDPNMGSTKD SFKSFCEFKS STIRTCVNNE TSTSQLGYLH LLHTHVSQTI QLPCGAKGPF
YLQPYDSDTA IEVPLKDRKD LKITLSGCSP FSMLREVEFV SDNVNQQMLP FTHSINVNNQ
DYYFKDICFY
//