ID A0A165AGI0_9CRUS Unreviewed; 550 AA.
AC A0A165AGI0;
DT 06-JUL-2016, integrated into UniProtKB/TrEMBL.
DT 06-JUL-2016, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=Putative Pulmonary surfactant-associated protein D {ECO:0000313|EMBL:KZS17619.1};
GN ORFNames=APZ42_016507 {ECO:0000313|EMBL:KZS17619.1};
OS Daphnia magna.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Crustacea; Branchiopoda;
OC Diplostraca; Cladocera; Anomopoda; Daphniidae; Daphnia.
OX NCBI_TaxID=35525 {ECO:0000313|EMBL:KZS17619.1, ECO:0000313|Proteomes:UP000076858};
RN [1] {ECO:0000313|EMBL:KZS17619.1, ECO:0000313|Proteomes:UP000076858}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Xinb3 {ECO:0000313|EMBL:KZS17619.1,
RC ECO:0000313|Proteomes:UP000076858};
RC TISSUE=Complete organism {ECO:0000313|EMBL:KZS17619.1};
RA Gilbert D.G., Choi J.-H., Mockaitis K., Colbourne J., Pfrender M.;
RT "EvidentialGene: Evidence-directed Construction of Genes on Genomes.";
RL Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KZS17619.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LRGB01000626; KZS17619.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A165AGI0; -.
DR STRING; 35525.A0A165AGI0; -.
DR EnsemblMetazoa; XM_045172314.1; XP_045028249.1; LOC116921678.
DR Proteomes; UP000076858; Unassembled WGS sequence.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR InterPro; IPR008160; Collagen.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000076858};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..550
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007855304"
FT REGION 66..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 108..550
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..85
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..260
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..426
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 441..474
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 483..527
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 550 AA; 50741 MW; 091453B32A6AE594 CRC64;
MKAVVILPFL VATAFAAPQY GATPQYGPAP QTVEVSPQVS LEQWGTQTSG AFASAHIEPA
QHHVQAAAHA STGSWGGSPS SSNLQIRPRP QMEEIQRQWE QFIEYLPWLK GPAGPPGPPG
HAGADGAQGG GYGGGQSQQT VVAGPPGPPG APGPAGPKGD SGNPGTPGGP GSQGPAGLNG
APGAPGPAGE RGSNGAPGAP GGPGFPGAKG APGNNGAPGL NGAPGTPGRD GNNGAPGAPG
PKGETGAPGQ TSTSNSAGPA GPPGSPGRDG APGTPGRPGP QGPVGPAGAI GPAGSPGTNG
FPGTPGPKGE AGSPGTPGGP GNDGRPGAPG TSGPAGPAGP VGPPGGNGGP GKDGLSGRPG
APGKDGFPGG PGLPGGPGQP GQPGKDGFNG APGAPGSPGS LGPAGPQGKP GAPGSPGGPG
GPGPAGPAGG PGSQGQPGTP GFPGGPGPAG PQGAPGPQGP TGPEGRPGSP GTPGPAGPAG
ATGSAGISTS IQSPAYEVPA HQTGHQQAPA AQPSHQETAQ PPFWAAPTQP SWSTPPQPSV
QAPQSAYGRR
//