GenomeNet

Database: UniProt
Entry: A0A1X7U7U7_AMPQE
LinkDB: A0A1X7U7U7_AMPQE
Original site: A0A1X7U7U7_AMPQE 
ID   A0A1X7U7U7_AMPQE        Unreviewed;      1330 AA.
AC   A0A1X7U7U7;
DT   05-JUL-2017, integrated into UniProtKB/TrEMBL.
DT   05-JUL-2017, sequence version 1.
DT   27-MAR-2024, entry version 32.
DE   RecName: Full=Fibrillar collagen NC1 domain-containing protein {ECO:0000259|PROSITE:PS51461};
GN   Name=100634948 {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001};
OS   Amphimedon queenslandica (Sponge).
OC   Eukaryota; Metazoa; Porifera; Demospongiae; Heteroscleromorpha;
OC   Haplosclerida; Niphatidae; Amphimedon.
OX   NCBI_TaxID=400682 {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001, ECO:0000313|Proteomes:UP000007879};
RN   [1] {ECO:0000313|Proteomes:UP000007879}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20686567; DOI=10.1038/nature09201;
RA   Srivastava M., Simakov O., Chapman J., Fahey B., Gauthier M.E., Mitros T.,
RA   Richards G.S., Conaco C., Dacre M., Hellsten U., Larroux C., Putnam N.H.,
RA   Stanke M., Adamska M., Darling A., Degnan S.M., Oakley T.H.,
RA   Plachetzki D.C., Zhai Y., Adamski M., Calcino A., Cummins S.F.,
RA   Goodstein D.M., Harris C., Jackson D.J., Leys S.P., Shu S., Woodcroft B.J.,
RA   Vervoort M., Kosik K.S., Manning G., Degnan B.M., Rokhsar D.S.;
RT   "The Amphimedon queenslandica genome and the evolution of animal
RT   complexity.";
RL   Nature 466:720-726(2010).
RN   [2] {ECO:0000313|Proteomes:UP000007879}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Lucas S., Shapiro H., Lindquist E., Tice H., Dalin E., Glavina del Rio T.,
RA   Bruce D., Barry K., Pitluck S., Srivastava M., Simakov O., Chapman J.,
RA   Mitros T., Hellsten U., Putnam N.H., Fahey B., Gauthier M., Larroux C.,
RA   Richards G.S., Stanke M., Adamska M., Darling A., Dacre M., Degnan S.M.,
RA   Zhai Y., Adamski M., Calcino A., Cummins S.F., Goodstein D.M., Harris C.,
RA   Shu S., Woodcroft B., Leys S.P., Manning G., Degnan B.M., Rokhsar D.S.;
RT   "The genome of the haplosclerid demosponge Amphimedon queenslandica and the
RT   evolution of animal complexity.";
RL   Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EnsemblMetazoa:Aqu2.1.23728_001}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (MAY-2017) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_003388783.2; XM_003388735.3.
DR   STRING; 400682.A0A1X7U7U7; -.
DR   EnsemblMetazoa; Aqu2.1.23728_001; Aqu2.1.23728_001; Aqu2.1.23728.
DR   GeneID; 100634948; -.
DR   KEGG; aqu:100634948; -.
DR   eggNOG; KOG3544; Eukaryota.
DR   InParanoid; A0A1X7U7U7; -.
DR   OrthoDB; 2970887at2759; -.
DR   Proteomes; UP000007879; Unassembled WGS sequence.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   Gene3D; 2.60.120.1000; -; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR000885; Fib_collagen_C.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN ALPHA-1(X) CHAIN; 1.
DR   Pfam; PF01410; COLFI; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   SMART; SM00038; COLFI; 1.
DR   PROSITE; PS51461; NC1_FIB; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007879};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1330
FT                   /note="Fibrillar collagen NC1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5010862329"
FT   DOMAIN          1153..1330
FT                   /note="Fibrillar collagen NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51461"
FT   REGION          32..1141
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        41..64
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        88..116
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        153..169
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        284..301
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        314..331
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        441..455
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1330 AA;  126085 MW;  CEDC3B18ACFCE8BE CRC64;
     MIMAAQALIL ALIGVSLCTL AYGAAPFTKR QDGVTDTDPF GDPGPPGPPG PNLAPVPGPP
     GPPGSRGSDG FDGLDGSAGS DGATGPPGDP GVPGPIGPPG DPGAPAFIPP PEQGIKGPII
     IPIVGDRGDR GQSGVPGRKG VPGPPGHIGP KGTRGHRGMD GEKGPAGDQG EVGKEGVIGA
     SGPNGYPGDP GPTGQIGLPG IPGFPGLQGA KGNRGADGAA GEPGDQGPNG PPGEVGNIGP
     EGPRGPDGDV GAKGGDGETG ADGPAGPPGP DGFPGQQGAP GAPGPQGPQG PNGPTGPPGA
     PGKRGPAGNQ GPPGSDGSPG SPGSPGTPGQ NGATGPPGPA GAKGSMGSPG RTGLAGIRGQ
     TGEPGDTGPR GQPGRKGNNG ADGDKGNTGP QGKAGERGEK GFRGANGAPG DQGEDGKQGN
     AGPAGPRGAK GPPGHGGVPG ENGKDGDKGE PGEKGEKGAQ GIPGLPGYDG VPGGAGVPGR
     PGARGAKGAR GDRGQPGVNG PDGTDGIQGP PGPSGIPGPH GPAGPVGDTG APGQKGSRGA
     DGQPGAAGPP GQPGETGSAG DPGNKGDKGV PGVTGKAGDQ GATGASGVHG PAGDQGTKGA
     RGGAGPTGAK GATGAKGRAG ETGQIGAAGE PGQPGKQGPQ GGVGLPGTTG AHGERGLQGK
     TGSPGPAGAR GGPGGRGKKG PQGRPGFQGV QGDVGQQGPD GAPGRAGPLG AAGNPGPNGF
     QGATGGPGPQ GPAGGAGATG DKGTVGPQGV AGGPGPAGRP GDTGAAGERG KKGPTGSQGS
     RGPSGAPGKP GLPSSQGSQG PAGASGAAGV PGLPGKTGPT GPKGPQGLPG PAGSKGARGV
     PGQAGPKGPR GPAGSRGEQG KQGDAGPTGK AGADGEKGRA GAPGEPGEAG VPGPQGSKGA
     QGKSGIPGRP GAKGEVGYPG LIGAKGFMGE NGAKGPKGDK GPKGERGPAG PKGANGDRGG
     QGPDGPTGSP GAQGEPGQKG PQGYTGLPGE DGKRGPPGTD GRPGDPGKEG PAGAQGDPGP
     AGGPGAPGAR GDQGPKGVPG DKGDQGDRGL PGHKGDPGLQ GGVGAPGQAG PQGATGPQGP
     QGPEGPKGEQ GDRGNRGGFG PQGEKGSPGE QGLHGSVGPV GPAGPPGPAG APGEPYKGPA
     FYRSKFDEKQ DISQLKKQFL FVELKSIEEE MNKMGINEKL PRTCEDLNMQ HPEYSNGEYT
     IDPNMGSTKD SFKSFCEFKS STIRTCVNNE TSTSQLGYLH LLHTHVSQTI QLPCGAKGPF
     YLQPYDSDTA IEVPLKDRKD LKITLSGCSP FSMLREVEFV SDNVNQQMLP FTHSINVNNQ
     DYYFKDICFY
//
DBGET integrated database retrieval system