ID A0A0L0DT47_THETB Unreviewed; 2325 AA.
AC A0A0L0DT47;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=Fibronectin type-III domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AMSG_01789 {ECO:0000313|EMBL:KNC55524.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC55524.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC55524.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC55524.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349439; KNC55524.1; -; Genomic_DNA.
DR RefSeq; XP_013761302.1; XM_013905848.1.
DR EnsemblProtists; KNC55524; KNC55524; AMSG_01789.
DR GeneID; 25561520; -.
DR OrthoDB; 5474719at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00185; TNFRSF; 1.
DR Gene3D; 2.130.10.130; Integrin alpha, N-terminal; 2.
DR InterPro; IPR013517; FG-GAP.
DR InterPro; IPR009030; Growth_fac_rcpt_cys_sf.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR PANTHER; PTHR44103; PROPROTEIN CONVERTASE P; 1.
DR PANTHER; PTHR44103:SF1; PROPROTEIN CONVERTASE P; 1.
DR Pfam; PF13517; FG-GAP_3; 4.
DR SUPFAM; SSF57184; Growth factor receptor domain; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 4.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000054408};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..2325
FT /note="Fibronectin type-III domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5005537758"
FT TRANSMEM 1871..1892
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1963..1989
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2054..2081
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2108..2131
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2163..2182
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2189..2213
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT REGION 891..972
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2288..2325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 891..959
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2325 AA; 238394 MW; 79804098E6BA84E0 CRC64;
MKAALHFLLL CLSAAVLVTQ PHLAVAASAC VADLPARINF PAFPNLGAAA LAVGDLDGDS
HDDVVVVTGT KLAWSRYNAT AGWFDRQHIV WLPGDDGLGA IRNVVPTDVD LDGDLDLVVH
ADVDAAAWFE NLDGSGSFGN FVPLFGTLST TSTAVRVVDL DSEDGPDVIT VYGDSVGWFA
NRGNGRFADQ EQLISSSDQL DHVAAGNLDS DEFVDLVIVA ETANYVELYT NLGYEEFTLR
QSVSLVRPYF VEVTDVDNDG DNDVLVASRE LLDYRIGLYI NDGTSAPFTW TSNIYFSEQV
MSSFTLADVD SDGFIDILLT AEERPFGVGL LLNQGSRTFA PVRGISPSVE SAGPIATGDF
DNDGALDFVA SLSVKLRSTD VVAFRSADGG AGYQYDNATV AASQDGADDM AVADIDNDGQ
FDAIVASFKL GTISWLPFVP GTGDFGQLQL VGNLADVVSV LALDFTGDGA PDIAAISPST
LVVFPNAVGL EAFGPALVVT VDGGASQDTL AGGDIDGDGD NDIVFSNATG VVIAPSDTPT
PLAFGPLVLL PMPHSPVSLV VADINSDGHL DIISSNGFEP PSWLENSGSS PDPSWVEHTL
PGVNWAVSLV VAMDVDGDST PDLVVARNKQ VAAGSFATLH SVAALERSVT DIAGIDYDGD
GDVDLAVASM ATAVMLVENM DGLGTFGAVH TLTNVGMDSR RVRVAVNDWN GDGRADIALA
RQDSISLAWL ANTGAPVVFG VPTNPNLKFL DSRNVEVVVA DFDGDMDLDI AVGLESTVTT
STVIWFENSD GAASYGNQRH VSDLDRSGAL VAADWDGDGD VDMASWSSFP SFSIIVHENI
DGSGNFVDVT VASGVENADA ILVADVDGDG DSDMLICMAK PAAADMAYSF SASGSGSGSG
SGSSSSSGGG TASSGTASSG TSGSGGSSTS GGSSTSGGSS TGGSASTTGS TTGTTTGSTT
GPPPPSPPPL DRVWAWLENR DALDGWAQPV TVSGLIYEAA DIDALAIDMT GDGAADLVWK
GPGADADLVF MVPRLGGMMP LYGARQVLID PFDLIGAVGG GDVDGDSNAD IVLATRQGME
LYLNTDGTGA MGPKIMLAGG AAASTTAVLV VDIDGDGDVD IIASQKTFTS ITHGAVFWYR
NAGGGDLGTR LRVALNADAE ALAVGDLDSD GDLDVVWISA AVESPSPLPR LFFSRQLSRT
GWYDYQPREY GLDMSASECT SPTSLACVSR SLARMPSCVR NVLVLPQGRY GCRVSDHLVA
RRVFDCESTG GGVLFRVEPP AHVELEGVEI RHTTAGVHSQ TGVPGLRVAG AGAQLSIINA
TVVGGQSLMT STSSASNTGY GGAILALESA DLELRDTSLE GCKANYGGAI CAIGANVSMA
QVSISGSSAT ELGGAVAVLD GGSLAASASS ISGCRADSSG GGGVALVSGA SMEAESLLVT
GNTAVLGIGG GMLVRADAGV ASCSACAFSG NRAVVGGGIA AASAAFGTGA SAAAAVTNVP
TASVAVAAAP HVLDLNECVV SANTATLYGG GVAVCDATVA LSGAGTRWSD NKAQLGIEGT
SGDGFVCAMA GNDAAQSSRA TALPWISSSP DSSAALEAAA IHGPIATIAW ASAPQSTTLQ
SGESLLGTIT TLDSLAQSAA YYQARMVYAF GGGANGVLAA PLSREIAPLS RNIELPDLAA
VVSATETAPA SIDFSIAMSG SGLAVAPLRG QVTVTACGPG FGGVTDGSGT LCVACSTGTS
SDETSFGECT SLPPCPANTI RTAGGAGSNT TEGACVCKPS FWIASGASNV ACEACPVGAV
CDGGAAQPRA APGFFPDTAD TTTFHPCPNA KACSGDGRCK PGYGSRLCAE CTSGYYKLRG
MCYKCADGQN AAVVTVVVLV VVGVLAALLW FNLSESVRYK FAAAMIGLNA LQISAMYGKL
ELDWGPVAAM YFDLASFLNL NFELTSPECS LAAGVDAWVL KWALALVLPL LLGGVLLGVA
LVYGVLIQVR AGWFASKTHN QLVGAYGRTM FQCLVLMYLP LTDAALAPFG CRRDSSGRWV
LDADPARSCY TREWWMGLFG PGMAAVMVYA VTVPVAVVGV LNRAARQLDE LTFVVRFNFL
VGRFARSAWW FEAAIMVRKL MVAICMTFFF SEENKANAAV FALVASLGQL LLAQPYASVT
NNVMAVVVLA STATVLYAGT FDDYLMRRLL VNVGIVVNLL AIVVGNAIDA WLMTRTEKRT
EAEEYYVPGV VQMDCLDTDV DAADTRTMTT TELGINGVDE DGDLPRLHVA ELGSSRWSAA
SGELATETML PGSGSTACST GSLPPPAPPA HPSLDSGAMA ESRPA
//