ID K1WHG3_TRIAC Unreviewed; 729 AA.
AC K1WHG3;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=CBS and PB1 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=A1Q2_04787 {ECO:0000313|EMBL:EKD00914.1};
OS Trichosporon asahii var. asahii (strain CBS 8904) (Yeast).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Tremellomycetes;
OC Trichosporonales; Trichosporonaceae; Trichosporon.
OX NCBI_TaxID=1220162 {ECO:0000313|EMBL:EKD00914.1, ECO:0000313|Proteomes:UP000006757};
RN [1] {ECO:0000313|EMBL:EKD00914.1, ECO:0000313|Proteomes:UP000006757}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 8904 {ECO:0000313|EMBL:EKD00914.1,
RC ECO:0000313|Proteomes:UP000006757};
RX PubMed=23193141; DOI=10.1128/EC.00264-12;
RA Yang R.Y., Li H.T., Zhu H., Zhou G.P., Wang M., Wang L.;
RT "Genome sequence of the Trichosporon asahii environmental strain CBS
RT 8904.";
RL Eukaryot. Cell 11:1586-1587(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKD00914.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMBO01000331; EKD00914.1; -; Genomic_DNA.
DR AlphaFoldDB; K1WHG3; -.
DR STRING; 1220162.K1WHG3; -.
DR eggNOG; ENOG502QVK2; Eukaryota.
DR HOGENOM; CLU_009026_1_0_1; -.
DR InParanoid; K1WHG3; -.
DR OMA; RERKQFF; -.
DR OrthoDB; 1328303at2759; -.
DR Proteomes; UP000006757; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd17782; CBS_pair_MUG70_2; 1.
DR Gene3D; 3.10.580.10; CBS-domain; 2.
DR InterPro; IPR000644; CBS_dom.
DR InterPro; IPR046342; CBS_dom_sf.
DR InterPro; IPR000270; PB1_dom.
DR PANTHER; PTHR48108:SF34; CBS AND PB1 DOMAIN PROTEIN (AFU_ORTHOLOGUE AFUA_1G06780); 1.
DR PANTHER; PTHR48108; CBS DOMAIN-CONTAINING PROTEIN CBSX2, CHLOROPLASTIC; 1.
DR Pfam; PF00571; CBS; 4.
DR Pfam; PF00564; PB1; 1.
DR SMART; SM00116; CBS; 3.
DR SUPFAM; SSF54277; CAD & PB1 domains; 1.
DR SUPFAM; SSF54631; CBS-domain pair; 2.
DR PROSITE; PS51371; CBS; 3.
DR PROSITE; PS51745; PB1; 1.
PE 4: Predicted;
KW CBS domain {ECO:0000256|PROSITE-ProRule:PRU00703};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000006757};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}.
FT TRANSMEM 705..724
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 147..205
FT /note="CBS"
FT /evidence="ECO:0000259|PROSITE:PS51371"
FT DOMAIN 306..378
FT /note="CBS"
FT /evidence="ECO:0000259|PROSITE:PS51371"
FT DOMAIN 387..443
FT /note="CBS"
FT /evidence="ECO:0000259|PROSITE:PS51371"
FT DOMAIN 537..633
FT /note="PB1"
FT /evidence="ECO:0000259|PROSITE:PS51745"
FT REGION 1..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 108..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..21
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 729 AA; 77733 MW; 2EAD291F4594ED72 CRC64;
MHNAASNYEN NNKRNYLAPP PTTTRPLTPP DSDTSLSPRV NKSELPPPVS PKRTKQRPLP
AGVAFPANGD DYTSPSDADL AQRQRYRSTV SVGPPSPSQS LRKKIENELS RKRPGGGRGP
SEPQHRQESS FLKRPSRRHQ KGTVAGLRPS PALTVPEGMS IADASQLCAA KRTDCVLVVD
DEEGLSGIFT AKDLAFRVTA DGLDPRTTTV AQIMTRNPMV TRDTTSATEA LQVMVSRHFR
HLVFHDALAK VERSSSATSQ LSMALAGVQT ELGPNLTANP QAAAMMAYVD ALRDRMAQPD
LTSVLDTSLP PPTVTPRTSV REAARLMKER RTTAVCVLEP NGTSVMSGVS NNGVPPKIAG
IFTSKDIVLR VIAAGLDSSR CSVVRVMTPH PDTAPPDMTV QDALKKMHTG HYLNLPVVES
DGRLLGIVDV LKLTYATLEQ IDTMNDDRSD AGPMWSKFFD TLNTAGGDDE SHSAISGAIP
DTPSKVGHQR ALSNATSPMS EVMPGDSASV VNEVNDGISD LGKGNTSSLA PALPVDDGTY
VFKFKTPSGR VHRFQARHDS YDLLRDIIHG KLQSDPFFDE PTDGRPAADS SVFTIAYTDD
EGDLVQITAD GDVLDAVNTA RGQKTDRVTL LINGGKNWEE AARSAGGEDA VEKLKVAETV
PAAKGESHLL DEGADPAHAA AYGAKGVHAK PGSEELIGGV LPKDLALPAA IGFLGVVILG
VFIASRKSN
//