ID A0A226EI35_FOLCA Unreviewed; 972 AA.
AC A0A226EI35;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE SubName: Full=Saxiphilin {ECO:0000313|EMBL:OXA56246.1};
GN ORFNames=Fcan01_09120 {ECO:0000313|EMBL:OXA56246.1};
OS Folsomia candida (Springtail).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Isotomoidea; Isotomidae; Proisotominae; Folsomia.
OX NCBI_TaxID=158441 {ECO:0000313|EMBL:OXA56246.1, ECO:0000313|Proteomes:UP000198287};
RN [1] {ECO:0000313|EMBL:OXA56246.1, ECO:0000313|Proteomes:UP000198287}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=VU population {ECO:0000313|EMBL:OXA56246.1,
RC ECO:0000313|Proteomes:UP000198287};
RC TISSUE=Whole body {ECO:0000313|EMBL:OXA56246.1};
RA Faddeeva A., Derks M.F., Anvar Y., Smit S., Van Straalen N., Roelofs D.;
RT "The genome of Folsomia candida.";
RL Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OXA56246.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LNIX01000004; OXA56246.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A226EI35; -.
DR OMA; DSALCYC; -.
DR Proteomes; UP000198287; Unassembled WGS sequence.
DR CDD; cd00191; TY; 3.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 4.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR PANTHER; PTHR12352:SF3; NIDOGEN-2; 1.
DR PANTHER; PTHR12352; SECRETED MODULAR CALCIUM-BINDING PROTEIN; 1.
DR Pfam; PF00086; Thyroglobulin_1; 4.
DR SMART; SM00211; TY; 4.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 4.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 4.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00500}; Reference proteome {ECO:0000313|Proteomes:UP000198287};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..972
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012691656"
FT DOMAIN 165..230
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 232..293
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 633..698
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DOMAIN 700..762
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT DISULFID 261..268
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
FT DISULFID 729..736
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 972 AA; 108018 MW; 587EEAF5DFDA4E5C CRC64;
MSFAKFKNFS KSEIFFIILI TVSIKSLASS NLNPEIDNDC PGFSCPKILC DLKRKNVQDF
CENVQCFGIE VTSCANITHA SDKGMLIYEE RLTMCGCCPG CAKYQGLDEK CEQYPLPPTP
DPACTGIVCP ILMNTCAPGL ICPLVGPNAK NCSYNSTSPE ESAKLTSCGD KRALALHDDL
HWTPQCEKDG SYSPKQCKGL RANGICFCVS KEGDRIFGTE NRRQAENQTC ACSRQVHELR
ESGHIASFHC TPDGNYEPLQ CDTDSALCYC VEPKTGKMTG AVLPENEWKK LPCFSLNLTN
SNPKFGYYRV CESEWAPAQK LSLEASLHGL DVWSGRRLNC DFDGSYAPVQ NIDKITKCVN
KDSTLIAAYH DTEGKFDTDC QCARDAIKSS QEGITFGKRC AATGNYDMIS PIDQTQPRRW
RCVDRDGIFF GEDTPVDDKY KCCLYGNLVT YPPRNCDKEP IEVQEQCYDP TKDDEWNNFA
KMRDKLLYVK YYSNNLLVLA GLILFVVIIP SSIVNCSALC PKALSSIPNV CDRFECKRNM
TEADCPLDTI FVKGLTLCGC CPGCAKYIGF TKASSNATQR NCTNSKNDIV KSPEPLCAGP
NCPILINQCF PGLVCNAFNG SCSLPTSPRN SCTEECMYKK SLKRLGLFHW DPICEEDGSY
APKQCKGDHV TGICFCVDPC GRRIFGQEFR ADSKNQTCAC SRLANNLRTS GNVATLHCTP
DGNFEPLQCD TDSALCYCVH ERTGKLLHGA VVPETRWKSL PCLTPNVTNF LQNGHYLRKC
ESNFAANEKF INYWKIHGLE NVTISLPHFN CDYDGSFGLV QESESRYDCV FKNGSKIGDF
SSITDQVDCS KDKNKKKKEI IFSKIINYLL AFYLSDCARD QINYQIKGRP FSIECVRNVG
NYPNSVDFGS HASCIDRDGI PYGDAVPSKY ACCLSNDCVL ATAMECQQNG YMSCCSWPYG
PECTYETTDA TT
//