ID F2U0G2_SALR5 Unreviewed; 1971 AA.
AC F2U0G2;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EGD80890.1};
GN ORFNames=PTSG_11741 {ECO:0000313|EMBL:EGD80890.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832958; EGD80890.1; -; Genomic_DNA.
DR RefSeq; XP_004997451.1; XM_004997394.1.
DR STRING; 946362.F2U0G2; -.
DR EnsemblProtists; EGD80890; EGD80890; PTSG_11741.
DR GeneID; 16078048; -.
DR KEGG; sre:PTSG_11741; -.
DR eggNOG; KOG1218; Eukaryota.
DR eggNOG; KOG3627; Eukaryota.
DR InParanoid; F2U0G2; -.
DR OrthoDB; 5404432at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd06263; MAM; 2.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 2.30.180.10; FAS1 domain; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR036378; FAS1_dom_sf.
DR InterPro; IPR000782; FAS1_domain.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR000998; MAM_dom.
DR PANTHER; PTHR23282; APICAL ENDOSOMAL GLYCOPROTEIN PRECURSOR; 1.
DR PANTHER; PTHR23282:SF101; RT07201P-RELATED; 1.
DR Pfam; PF02469; Fasciclin; 1.
DR Pfam; PF00629; MAM; 2.
DR SMART; SM00554; FAS1; 1.
DR SMART; SM00137; MAM; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF82153; FAS1 domain; 1.
DR PROSITE; PS50213; FAS1; 1.
DR PROSITE; PS51465; KAZAL_2; 1.
DR PROSITE; PS50060; MAM_2; 3.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000007799};
KW Signal {ECO:0000256|SAM:SignalP}; Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1971
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003290148"
FT TRANSMEM 1896..1916
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 215..270
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 294..513
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
FT DOMAIN 891..1071
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
FT DOMAIN 1265..1469
FT /note="MAM"
FT /evidence="ECO:0000259|PROSITE:PS50060"
FT DOMAIN 1481..1616
FT /note="FAS1"
FT /evidence="ECO:0000259|PROSITE:PS50213"
FT REGION 422..445
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1084..1116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1287..1327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1732..1759
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1092..1116
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1313..1327
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1735..1759
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1971 AA; 206046 MW; 3DF87E7E442C408D CRC64;
MARAFCATAL TAVFLLLLST LSVAQQGTNT QDEQSDLHNL PIATCPMNTT TALNRSCVTG
GSDIVCSLDG TEYRNAACAV CHGVVGIVPG KCPEVVATCT SVADMQAVRA CVEGTQDLVC
GTSMITYRNI HCLDCSGDAY LRTGVCANPA PPIVDTCSSN TTCGLSFLDA CQTQDASYVC
HNGTTYKNAA CAMCNDVPRS AAGTGRCYSP PAQPSPFFED CPESAPPALK SACSASSVDP
VCGWDGVTYR SHACAVCNNV RAPTAGTCVS SNVSDVHCRD GDCGVNAHCV RDVVDCDFET
PCAWGSVHDL GSSSSVAGDG TSDTESAVSF HVVDVSASNN IDGVPSVDHT YMNRTRGHYI
VASAHTRARL QSLLFSTRDP VTHHACHVAF AYAFSGVGAH LDVFAATASE LASPTLVLSL
SSSSSSNNNN NNSSSTSPND NSTATAAQVW STAHVPAEAL VGVGEEVVVR FALSYDEGGA
GGSNSSSVAS GVYVALDDIA LVCDGVQEHV CRCDDGFHNS AAGMDSDVLD CRPIEGDTNT
TTATATAESN AVSVCRAGAT CEEVLECEQG APDLVCAAGT HFKNRACAKC NYEMPSAVYS
GVCRAVPDDG AFIDACPATA VELFQSACEN QPVDVVCSDG RNYKNTACAQ CNGFVSFATS
SAVSSVPSVF FGPCSTVISA RTRDIVTTCP DTASAAEVSS CSSGATFVIC GSDGNQYRNL
ACMRCNSASF SGLGLCGAHA QVEDGRAVIS TCPASSNPFF ATLCNTREPR IVCANGFNFK
NTFCARCNGY SSEVTTSPCL TLQYPVVETC SGSSSSTTTT TTSEDQCHDF TPDPVCGSDG
NSYINRPCAA CNGLNPNVDV VDGACYGQEM VYEPCANVTC DINASCAKTL LDCDFETDMC
GWRNSKGDDL QFQRHSGSTS TAQTGPTVDH TLNTTEGVYV FMDASDGASG HEAHLLSPLL
TALQFGHSCV LEFYYHMWGA DVHSLSVEVI RKQEDNTWKE LWRVDAGNTP RAEHRNTWRA
AHVDLYALRD VGVVQLRFVA RRGDGPQGDV ALDDVRMVCG DESSGTCVCN PGWIGDGHTC
VRDPDEVVPT TAPPPSNSSS TNSTPATLSP STTTQAATDT TFATTTSAIV ENVVESCNLP
GGIIDPCEYT AESFVCGDDD VTYKNAVCAL CNGMTQVVSG PCTKSGDAAT VHTCTDLQSP
LFLQCNFQSE ELVCADGSLY RNRPCALCAG YAEDKIEDGP CDGSIFDPCS AVQCVDNAVC
THTAADCDFE DAAMPLCGWT IDSDDGDGGN SGGDGGGDDG GGGGASGRFS WVRWSGPTPS
SQTGPSVDHT LGNATGHYMY AEATGGSQGS RTFMQSPVFA PSSLGSGCFL EFWFHMFGQH
VEPLTVQVRE IRNATAWEDV WRIYPPANTA DAEHDVWRRH ATISLAAYTG PIQVRIVAER
GSGIEGDVGI DDIRVLCSAE TVGQCVCKEG YAMSSDGTAC VPNILAYLLS RPDTQLALQL
IKASGLGDFV ASAQGLTLFA PSDAAVLSAS VNPDVDLTDR DTLRSFVLHH LVGAPLLPRQ
LVSGQRLNTL FVTDAGFPQT LLVQRQNSQL TVNGATVTTP NLHASNGVIH IVDDIIAFDE
DLCGGTECTD TQECTLAYSS NFTRSDSCDC PSAFSSVCAI SPSGPQLFFS PCVAECLDAP
IVGTGTCGAP QCQDLESCDG VLCDLHATCT PTGCQCALRY VPANPALGGQ RGNCVLPSER
TTTSTTTTTT TTVESMSTTT AVTTGAVTPA ATTHSDTSST SSTSSAATTT PVMVAFVSSA
RFDAELATED RADFVQALRE SVVDAGVARY DILGVVFESD ENGVVVLINL RTEQDKALVD
AAIADGDIRV LFSSQTFIAA PASSPDSDQN GSPKTAAIAG SVVAVAIVIG VVYIMYRRGL
CARKSSVGYD FTAPSVAFDN PMYDAHEGIV TKLSGFDDGM HGEDGYMTVE A
//