ID A0A091HMA1_CALAN Unreviewed; 870 AA.
AC A0A091HMA1;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=Pre-mRNA-splicing factor CWC22 homolog {ECO:0000256|ARBA:ARBA00040488};
DE AltName: Full=Nucampholin homolog {ECO:0000256|ARBA:ARBA00042174};
DE Flags: Fragment;
GN ORFNames=N300_04496 {ECO:0000313|EMBL:KFO97016.1};
OS Calypte anna (Anna's hummingbird) (Archilochus anna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Caprimulgimorphae; Apodiformes;
OC Trochilidae; Calypte.
OX NCBI_TaxID=9244 {ECO:0000313|EMBL:KFO97016.1, ECO:0000313|Proteomes:UP000054308};
RN [1] {ECO:0000313|EMBL:KFO97016.1, ECO:0000313|Proteomes:UP000054308}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N300 {ECO:0000313|EMBL:KFO97016.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL217611; KFO97016.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091HMA1; -.
DR STRING; 9244.A0A091HMA1; -.
DR Proteomes; UP000054308; Unassembled WGS sequence.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR Gene3D; 1.25.40.180; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF3; PRE-MRNA-SPLICING FACTOR CWC22 HOMOLOG; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR PROSITE; PS51366; MI; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000054308}.
FT DOMAIN 446..562
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..435
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 647..870
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..92
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 405..431
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 648..695
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 731..820
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 833..870
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFO97016.1"
FT NON_TER 870
FT /evidence="ECO:0000313|EMBL:KFO97016.1"
SQ SEQUENCE 870 AA; 101917 MW; 8A47D3AD35887B6E CRC64;
SGYERKENYS SHQRSLSPED RDTDRDKSPS PRRRRYSDDS RYDQEYSRRE YYDDRTSEGR
RMEWGRERHH ERWEERESDR RRQRRYSSPD RRSPERSTVQ SSLAHDETTS KKKKEEVDPI
LTRTGGAYIP PAKLRMMQEQ ITDKSSLAYQ RMSWEALKKS INGLVNKVNV SNIENIIHEL
LQENIVRGRG LLSRSILQAQ SASPIFTHVY AALVAIINSK FPNIGELILK RLILNFRKGY
RRNDKQLCLT SSKFVAHLMN QNVAHEVLCL EMLTLLLERP TDDSIEVAIG FIKESGLKLT
EVSPRAINAI FDRLRHILHE SKIDMRVQYM IEVMFAVRKD GFKDHPIIPE GLDLVEEEDQ
FTHMLPLEDD YNPEDVLNVF KMDPNFLENE EKYKMLKKEI LDEGDSESEA DQEAGSSEED
EEEDEEEDED GQKVTVHDKT EINLVSFRRT IYLAIQSSLD FEECAHKLLK MDFPESQTKE
LCNMILDCCA QQRTYEKFFG LLAGRFCMLK KEYMESFEAI FKEQYDTIHR LETNKLRNVA
KMFAHLLYTD SIPWSVLECI ILSEETTTSS SRIFVKIFFQ ELSEYMGLPN LNARLKDVTL
QPFFEGLLPR DNPRNTRFAI NFFTSIGLGG LTDELREHLK NAPKMIMTQK QDVESSDSSS
SSETDSSSDS DSSSSSSESS SSSDSSSSSG SSSDTDVSKA KRRRLQKKNR ESDKVSRKKQ
ERRRKSLEKK VRRRQQEDRS DTESKSERNH RNARESHRRD DASKYHHRDE SNDRDGYHSG
KDRNHERNKD LENKHSNLKP KKAERRASFS DDENYRHWSK DNGHRSRKRE RSKSVERAHN
HSSPREKEQE DRYRNGSEKH REKSSRHSEQ
//