ID W9W910_9EURO Unreviewed; 1299 AA.
AC W9W910;
DT 14-MAY-2014, integrated into UniProtKB/TrEMBL.
DT 14-MAY-2014, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Cleavage/polyadenylation specificity factor A subunit N-terminal domain-containing protein {ECO:0000259|Pfam:PF10433};
GN ORFNames=A1O5_11786 {ECO:0000313|EMBL:EXJ61470.1};
OS Cladophialophora psammophila CBS 110553.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Chaetothyriomycetidae; Chaetothyriales; Herpotrichiellaceae;
OC Cladophialophora.
OX NCBI_TaxID=1182543 {ECO:0000313|EMBL:EXJ61470.1, ECO:0000313|Proteomes:UP000019471};
RN [1] {ECO:0000313|EMBL:EXJ61470.1, ECO:0000313|Proteomes:UP000019471}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 110553 {ECO:0000313|EMBL:EXJ61470.1,
RC ECO:0000313|Proteomes:UP000019471};
RG The Broad Institute Genomics Platform;
RA Cuomo C., de Hoog S., Gorbushina A., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Cladophialophora psammophila CBS 110553.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXJ61470.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGX01000028; EXJ61470.1; -; Genomic_DNA.
DR RefSeq; XP_007750546.1; XM_007752356.1.
DR STRING; 1182543.W9W910; -.
DR GeneID; 19196473; -.
DR eggNOG; ENOG502QVPZ; Eukaryota.
DR HOGENOM; CLU_003539_0_0_1; -.
DR OrthoDB; 2087067at2759; -.
DR Proteomes; UP000019471; Unassembled WGS sequence.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 3.
DR InterPro; IPR018846; Cleavage/polyA-sp_fac_asu_N.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR PANTHER; PTHR10644:SF21; CLEAVAGE_POLYADENYLATION SPECIFICITY FACTOR A SUBUNIT N-TERMINAL DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR10644; DNA REPAIR/RNA PROCESSING CPSF FAMILY; 1.
DR Pfam; PF10433; MMS1_N; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019471}.
FT DOMAIN 131..632
FT /note="Cleavage/polyadenylation specificity factor A
FT subunit N-terminal"
FT /evidence="ECO:0000259|Pfam:PF10433"
SQ SEQUENCE 1299 AA; 144442 MW; B42EE112F3AFBC66 CRC64;
MAYRAPPHGR ELNARGDQRR LAIQDRVGIL SRTLSSSSSI RWILPARIRS PSDNDVVFVG
PTFVQLHEFK DSGQLEVVTA KLDFGTTITD AKIISAELKS VPIVDAILKQ ERDQEQFSIR
GQPVSDTLPP QILVLITKDN ELIYVYARKD AGGEVRLVFA KRPFLGGVDL PSRQCRDIAV
DPESRALAVA SPSGYLGVFK LHHVDEIKSE IDAWDPIDPA SFRPSEEQRF IQVAGKILMM
TFLRSPEDDP TKVILLLLVY SGEADQGTHQ YLYRWDTRQP LQTIQAMTCS GRKLREDQLP
LMLIPSTRPF SYIVVMAAGI SYYENIQSSQ MKRVYCRCMD NMNTSLEWVQ WVRPRRHNQY
LERCDDIVIL REDGLLKNFL IDKASSTKFS TNNTIGHLGF SVDTAFCMLS GPPRTGGDII
IVGGSLTDGG VFHVSARGSP ERIQRIETLA PLNDIALGPP IAVDSQDSAV WQGTPGRLYA
CSGRCDRHGQ VSEIRYGLEA QIGWEMSFPE AALVERLFSL EIPGTNELLL LASHTTNSSM
VVFELENQEI SLTDAESHPG FDFGHPTLAA TVIDQDTVIQ VTTRGVHAIL IEADGEVGEL
NHLGSRIAHA AFFEGDRAIA TTSRVTSGYE LCLIDIETSE DGALRFMPGQ PLRLKHIPNA
ICCVQVQNTQ LVLIGTSTGE VLGYTDTLKL AFEFRVQDLN TKVENAAVAS VVALKQNTAG
PTLLLCGLRS GTVMCLELRV DCQDGLKTVL RCDDVYHIGA TPVQIVQEKS QSGSLENPSA
LVLCEYTLHR VTLHPNSALV DYTMSPLWVT DRTKPSAQPL INAVHRIPKL ESGHEHPGGF
LICAVAEDLL FCSVISQEHG IARHLNLNGV PKRMLYSRFL QRFVVAFSRE GDPTEVPVRM
HTSVPLDGEK DISDQPSNKI QRVGLQLITP NLRYPTSRDS EGTTFATIVT GDDSDVVHDL
IDWAPTDGNN HYEWLVLAIE QLNPIPGLRG PGRVVCVNAK SLGKGKPDAS PKLAFSSREP
ITAICAYKMS SLLIAAGREI VLHHLDWATR RWKTLARHAL PSYANAISCQ GSLICVATQQ
HSFFVLVEKD NKLHHRECDT KMRYAKDIVV LDGSTAVFAS ADVGVTNIIG FLGLNKETTE
TAPAFHAAVP GHIHRFRLDT SQGLRKTDRT RFYGCTIDGT LFHFALLKHS EWKLLHFIEE
MSYMIRKPIK AVPIEKKDAD NTRYLWMPPA FRPKDMHVQG DRLLMMIEQG PYNLRSVLRG
SDRMDSFKAL VEEILGEAEE PVEAVIAWIR KLMRYPPRS
//