ID A0A0L0D6J7_THETB Unreviewed; 1755 AA.
AC A0A0L0D6J7;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=FH2 domain-containing protein {ECO:0000259|PROSITE:PS51444};
GN ORFNames=AMSG_04208 {ECO:0000313|EMBL:KNC47974.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC47974.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC47974.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC47974.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349449; KNC47974.1; -; Genomic_DNA.
DR RefSeq; XP_013758991.1; XM_013903537.1.
DR EnsemblProtists; KNC47974; KNC47974; AMSG_04208.
DR GeneID; 25563762; -.
DR eggNOG; KOG0621; Eukaryota.
DR eggNOG; KOG1924; Eukaryota.
DR OMA; ISHYASR; -.
DR OrthoDB; 1118745at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR Gene3D; 1.20.58.2220; Formin, FH2 domain; 1.
DR InterPro; IPR015425; FH2_Formin.
DR InterPro; IPR042201; FH2_Formin_sf.
DR PANTHER; PTHR45691; PROTEIN DIAPHANOUS; 1.
DR PANTHER; PTHR45691:SF6; PROTEIN DIAPHANOUS; 1.
DR Pfam; PF02181; FH2; 1.
DR PRINTS; PR01217; PRICHEXTENSN.
DR SMART; SM00498; FH2; 1.
DR SUPFAM; SSF101447; Formin homology 2 domain (FH2 domain); 1.
DR PROSITE; PS51444; FH2; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000054408}.
FT DOMAIN 1229..1627
FT /note="FH2"
FT /evidence="ECO:0000259|PROSITE:PS51444"
FT REGION 348..371
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 573..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1088..1237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1619..1707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 452..537
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 622..1019
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1054..1088
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 352..367
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1109..1211
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1619..1649
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1755 AA; 186292 MW; B7586D9020B652C2 CRC64;
MLPNELRSVV RHVAVALSSG SAAHGVVVSG LVGDVLAAII DARDMLALPD GIDALLRAMV
AAVQRALPGG SRYTLVEEDS LRARARPLMA QLKHLHSTEL SSFVDAEIPL VRRPSATRSG
APVLTKHDLE LARAAIVDAG VELLSNAQAM MAEVDAAVAS ERGRAALLLK SSSEVISTLE
AREQASRSLV ALVEPLVEAL TTTIDAFSAL VVFVEAWKAA TPSSVLDSLW TSSAALDALI
PAFGPALGST TTSRSLDALL RTSSSVIDEA RTAVANSTRV VDDATFNVAT GLDAHSVGHL
KHAYHLASDE CYSRPQLEEL EINYHARLAA AANEIKSLQA QLRRASLGVG GNNEDRRSSS
EGLAPSGSST EKLLAPYVER EHALRAQLEY LKRVVEAGTN GTDPLAAEEA VAFSARVASA
ARDMAQHMMA DSGVRYTEDD LMELYNHFQG RQAALETRVS ELEEAERELA EHKRRAAGME
GVSAARLETA LRQLLDKYNR YRAKSKAKLA SMKRLSLEAY SKLHRKHKKE LARAKATALM
REDELRAQLT FAYNERDAIL TAHDEAQQRA ADERAARQAE RQKRKTKEAT MASLKDQLKS
TAAAASQSAE LAAAVKNFTA LSAEYESQIA EADATIDALS EEKAALQAAL QQATAAIDRE
RATRAQLSRH ANESAASVAA LAEVEDALQA KTNELKRAEA SKRELAGKLH AVAEAMQDLE
TQVTDAVAEA EARTHERDDI AKKLRTAMEQ LVVASKRVDA LEELLAQQGS ADDAGLAAAL
AKAQSARDEL EIELTQRNHE LAIARDDAAR AQADADARPT ADAIAELKKS VRELEAKVEE
SRQRAQDADE AHVARVGELE AQLAAAEAAA VNADRVDGEV VPASEMEALK AQLAAAEAAA
ANADGVDGEV VPASEMEALK AQLAAAEAAA ADADGVDGEV VPASEMEALK AQLAAAEAAA
ADADGVDGEV AEALQARLDE QTQECTRLEA EVAALASERD ELKARIVELE KELTTKGEND
VEAVNDAELV AEMEKVKKSM AMKNKIIVDR DKKLAEATKE LDKVKGMLEA AQQQAAAARE
DAAKAKKALA EAPSAATGDI SASANDNASG PPPPAPPGGP PPPPPPPPGG APPPPPPPPP
GGAPPPPPPM PGGGPPPPPP MPGGGPPPPP PMPGGGPPPP PPPPGGGPPP PPPPPGGGPP
PPPPPGGMPR VPAAPQIGAE FRTGGPLPKR PKKKPRVKMK GFNVAKINTN KIKPTSFWYK
ADDTKIKVDA DEIENLFAAK VSAPAASGAS ASASKKPTVV TLLDPQRSQN LGIALARFKM
KPQQICDAIL SMDTEALSLN QINSLLRNVP TSEEMGELEG YDGDRALLAP AEAFLIETLS
ISHYASRLEA FQFRLKFKER LDELLPDLNA ITTASEAIKD SAAFKRVLEH TLALSNYLNG
GGFRGELYGF QLSSLPKLRD TRSSARPGYT LLHYLVDRMT ELDSADEVSK AIDDLKSCEA
AAKISLETTM TETKKLVAKF EEVAGYVATF EEEGVKDSDK DNFIESMGAF RDVAQAELDE
AVAEMAQAEM LFKETLEWYS ESPKTSNEEF FGMLSEFGKD MEEARKEIAS LKRKAELAKK
REQEAKEKAE RGETEPAAAK EKAKPRFGKP PGEAPKVSQA ALMGDLAGAL SFGRGRGRGR
GRGRGGGRGG GRGRGGPASD EEKPGSCHAD GCDCDMFRVS PFGRKKDACV DCGHTKSDHL
EKKAEVAEAR AGAFR
//