GenomeNet

Database: UniProt
Entry: A0A165AGI0_9CRUS
LinkDB: A0A165AGI0_9CRUS
Original site: A0A165AGI0_9CRUS 
ID   A0A165AGI0_9CRUS        Unreviewed;       550 AA.
AC   A0A165AGI0;
DT   06-JUL-2016, integrated into UniProtKB/TrEMBL.
DT   06-JUL-2016, sequence version 1.
DT   27-MAR-2024, entry version 13.
DE   SubName: Full=Putative Pulmonary surfactant-associated protein D {ECO:0000313|EMBL:KZS17619.1};
GN   ORFNames=APZ42_016507 {ECO:0000313|EMBL:KZS17619.1};
OS   Daphnia magna.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda;
OC   Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia.
OX   NCBI_TaxID=35525 {ECO:0000313|EMBL:KZS17619.1, ECO:0000313|Proteomes:UP000076858};
RN   [1] {ECO:0000313|EMBL:KZS17619.1, ECO:0000313|Proteomes:UP000076858}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Xinb3 {ECO:0000313|EMBL:KZS17619.1,
RC   ECO:0000313|Proteomes:UP000076858};
RC   TISSUE=Complete organism {ECO:0000313|EMBL:KZS17619.1};
RA   Gilbert D.G., Choi J.-H., Mockaitis K., Colbourne J., Pfrender M.;
RT   "EvidentialGene: Evidence-directed Construction of Genes on Genomes.";
RL   Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KZS17619.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LRGB01000626; KZS17619.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A165AGI0; -.
DR   STRING; 35525.A0A165AGI0; -.
DR   EnsemblMetazoa; XM_045172314.1; XP_045028249.1; LOC116921678.
DR   Proteomes; UP000076858; Unassembled WGS sequence.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   InterPro; IPR008160; Collagen.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF01391; Collagen; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000076858};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..550
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5007855304"
FT   REGION          66..85
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          108..550
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        67..85
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        246..260
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        409..426
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        441..474
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        483..527
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   550 AA;  50741 MW;  091453B32A6AE594 CRC64;
     MKAVVILPFL VATAFAAPQY GATPQYGPAP QTVEVSPQVS LEQWGTQTSG AFASAHIEPA
     QHHVQAAAHA STGSWGGSPS SSNLQIRPRP QMEEIQRQWE QFIEYLPWLK GPAGPPGPPG
     HAGADGAQGG GYGGGQSQQT VVAGPPGPPG APGPAGPKGD SGNPGTPGGP GSQGPAGLNG
     APGAPGPAGE RGSNGAPGAP GGPGFPGAKG APGNNGAPGL NGAPGTPGRD GNNGAPGAPG
     PKGETGAPGQ TSTSNSAGPA GPPGSPGRDG APGTPGRPGP QGPVGPAGAI GPAGSPGTNG
     FPGTPGPKGE AGSPGTPGGP GNDGRPGAPG TSGPAGPAGP VGPPGGNGGP GKDGLSGRPG
     APGKDGFPGG PGLPGGPGQP GQPGKDGFNG APGAPGSPGS LGPAGPQGKP GAPGSPGGPG
     GPGPAGPAGG PGSQGQPGTP GFPGGPGPAG PQGAPGPQGP TGPEGRPGSP GTPGPAGPAG
     ATGSAGISTS IQSPAYEVPA HQTGHQQAPA AQPSHQETAQ PPFWAAPTQP SWSTPPQPSV
     QAPQSAYGRR
//
DBGET integrated database retrieval system