GenomeNet

Database: UniProt
Entry: A0A226EI35_FOLCA
LinkDB: A0A226EI35_FOLCA
Original site: A0A226EI35_FOLCA 
ID   A0A226EI35_FOLCA        Unreviewed;       972 AA.
AC   A0A226EI35;
DT   25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT   25-OCT-2017, sequence version 1.
DT   24-JAN-2024, entry version 15.
DE   SubName: Full=Saxiphilin {ECO:0000313|EMBL:OXA56246.1};
GN   ORFNames=Fcan01_09120 {ECO:0000313|EMBL:OXA56246.1};
OS   Folsomia candida (Springtail).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC   Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX   NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA56246.1, ECO:0000313|Proteomes:UP000198287};
RN   [1] {ECO:0000313|EMBL:OXA56246.1, ECO:0000313|Proteomes:UP000198287}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=VU population {ECO:0000313|EMBL:OXA56246.1,
RC   ECO:0000313|Proteomes:UP000198287};
RC   TISSUE=Whole body {ECO:0000313|EMBL:OXA56246.1};
RA   Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT   "The genome of Folsomia candida.";
RL   Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OXA56246.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LNIX01000004; OXA56246.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A226EI35; -.
DR   OMA; DSALCYC; -.
DR   Proteomes; UP000198287; Unassembled WGS sequence.
DR   CDD; cd00191; TY; 3.
DR   Gene3D; 4.10.800.10; Thyroglobulin type-1; 4.
DR   InterPro; IPR000716; Thyroglobulin_1.
DR   InterPro; IPR036857; Thyroglobulin_1_sf.
DR   PANTHER; PTHR12352:SF3; NIDOGEN-2; 1.
DR   PANTHER; PTHR12352; SECRETED MODULAR CALCIUM-BINDING PROTEIN; 1.
DR   Pfam; PF00086; Thyroglobulin_1; 4.
DR   SMART; SM00211; TY; 4.
DR   SUPFAM; SSF57610; Thyroglobulin type-1 domain; 4.
DR   PROSITE; PS51162; THYROGLOBULIN_1_2; 4.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00500}; Reference proteome {ECO:0000313|Proteomes:UP000198287};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..972
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012691656"
FT   DOMAIN          165..230
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   DOMAIN          232..293
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   DOMAIN          633..698
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   DOMAIN          700..762
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   DISULFID        261..268
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT   DISULFID        729..736
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ   SEQUENCE   972 AA;  108018 MW;  587EEAF5DFDA4E5C CRC64;
     MSFAKFKNFS KSEIFFIILI TVSIKSLASS NLNPEIDNDC PGFSCPKILC DLKRKNVQDF
     CENVQCFGIE VTSCANITHA SDKGMLIYEE RLTMCGCCPG CAKYQGLDEK CEQYPLPPTP
     DPACTGIVCP ILMNTCAPGL ICPLVGPNAK NCSYNSTSPE ESAKLTSCGD KRALALHDDL
     HWTPQCEKDG SYSPKQCKGL RANGICFCVS KEGDRIFGTE NRRQAENQTC ACSRQVHELR
     ESGHIASFHC TPDGNYEPLQ CDTDSALCYC VEPKTGKMTG AVLPENEWKK LPCFSLNLTN
     SNPKFGYYRV CESEWAPAQK LSLEASLHGL DVWSGRRLNC DFDGSYAPVQ NIDKITKCVN
     KDSTLIAAYH DTEGKFDTDC QCARDAIKSS QEGITFGKRC AATGNYDMIS PIDQTQPRRW
     RCVDRDGIFF GEDTPVDDKY KCCLYGNLVT YPPRNCDKEP IEVQEQCYDP TKDDEWNNFA
     KMRDKLLYVK YYSNNLLVLA GLILFVVIIP SSIVNCSALC PKALSSIPNV CDRFECKRNM
     TEADCPLDTI FVKGLTLCGC CPGCAKYIGF TKASSNATQR NCTNSKNDIV KSPEPLCAGP
     NCPILINQCF PGLVCNAFNG SCSLPTSPRN SCTEECMYKK SLKRLGLFHW DPICEEDGSY
     APKQCKGDHV TGICFCVDPC GRRIFGQEFR ADSKNQTCAC SRLANNLRTS GNVATLHCTP
     DGNFEPLQCD TDSALCYCVH ERTGKLLHGA VVPETRWKSL PCLTPNVTNF LQNGHYLRKC
     ESNFAANEKF INYWKIHGLE NVTISLPHFN CDYDGSFGLV QESESRYDCV FKNGSKIGDF
     SSITDQVDCS KDKNKKKKEI IFSKIINYLL AFYLSDCARD QINYQIKGRP FSIECVRNVG
     NYPNSVDFGS HASCIDRDGI PYGDAVPSKY ACCLSNDCVL ATAMECQQNG YMSCCSWPYG
     PECTYETTDA TT
//
DBGET integrated database retrieval system