ID L5LU75_MYODS Unreviewed; 2238 AA.
AC L5LU75;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE SubName: Full=Integrator complex subunit 1 {ECO:0000313|EMBL:ELK29003.1};
GN ORFNames=MDA_GLEAN10021509 {ECO:0000313|EMBL:ELK29003.1};
OS Myotis davidii (David's myotis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=225400 {ECO:0000313|EMBL:ELK29003.1, ECO:0000313|Proteomes:UP000010556};
RN [1] {ECO:0000313|Proteomes:UP000010556}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23258410; DOI=10.1126/science.1230835;
RA Zhang G., Cowled C., Shi Z., Huang Z., Bishop-Lilly K.A., Fang X.,
RA Wynne J.W., Xiong Z., Baker M.L., Zhao W., Tachedjian M., Zhu Y., Zhou P.,
RA Jiang X., Ng J., Yang L., Wu L., Xiao J., Feng Y., Chen Y., Sun X.,
RA Zhang Y., Marsh G.A., Crameri G., Broder C.C., Frey K.G., Wang L.F.,
RA Wang J.;
RT "Comparative analysis of bat genomes provides insight into the evolution of
RT flight and immunity.";
RL Science 339:456-460(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB108645; ELK29003.1; -; Genomic_DNA.
DR eggNOG; KOG4596; Eukaryota.
DR Proteomes; UP000010556; Unassembled WGS sequence.
DR GO; GO:0032039; C:integrator complex; IEA:InterPro.
DR GO; GO:0034474; P:U2 snRNA 3'-end processing; IEA:InterPro.
DR Gene3D; 1.10.418.10; Calponin-like domain; 1.
DR InterPro; IPR001715; CH_dom.
DR InterPro; IPR036872; CH_dom_sf.
DR InterPro; IPR022145; DUF3677.
DR InterPro; IPR038902; INTS1.
DR PANTHER; PTHR21224:SF1; INTEGRATOR COMPLEX SUBUNIT 1; 1.
DR PANTHER; PTHR21224; UNCHARACTERIZED; 1.
DR Pfam; PF00307; CH; 1.
DR Pfam; PF12432; DUF3677; 1.
DR SUPFAM; SSF47576; Calponin-homology domain, CH-domain; 1.
DR PROSITE; PS50021; CH; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000010556}.
FT DOMAIN 1854..1972
FT /note="Calponin-homology (CH)"
FT /evidence="ECO:0000259|PROSITE:PS50021"
FT REGION 326..360
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 924..943
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1226..1264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1979..2213
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2087..2104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2136..2151
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2238 AA; 247811 MW; EE1A0C490DBAE56B CRC64;
MPRXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX EAAVAEKQNA
PPSIKEPSVV PIEVPPPALL DEIEAAELEG NDDRIEGVLC GAVKQLRATR AKPDSALCLS
LMYLAKTKPN IFATEGVIEA KGNSLVSVLA CSLLMAAYEE DENWPEIFVK VYIEDSLGER
IWVDSPHCKA FVDNIQTAFN TRMPPKSMLL QGEAGRSGGD LSAGSSPHPS LTEEEDSQTE
LLIAEEKLSP EQEGQLMPRY EDLAESVEEY VLDMLRDQLN RRQPIDNVSR NLLRLLTSTC
GYKEVRLMTV QKLEMWLQNP KILYTVLQHS AELAPKFLAM VFQDLLTNKD DYLRASRALL
REIIKQTKHE INFQAFCLGL MQERKEPQYL DMEFKERFVV HITDVLAVSM MLGITAQVKE
AGIAWDKGEK KNLEVLRSFQ NQIAAIQRDA VWWLHTVVPS ISKMAPKDYV HCLHKVLFTE
QPETYYKWDN WPPESDRNFF LRLCSEVPIL EDTLMRILVI GLSRELPLGP ADAMELADHL
VKRAAAVQAX XXXXXXXXXX XXRPPGALTQ VSSPCPLCCY PQQPCPLARP AQAPPNVEVL
KVERVQLLDA VLNLCTYHHP ENIQLPPGQQ PPNLAISTLY WKAWPLLLVV AAFNPENIGL
AAWEEYPTLK MLMEMVMTNN YSYPPCTLTD EETRTEMLSR ELQTSQREKQ EILAFEGHLA
AASTKQTITE SSSLLLSQLT SLDPQYKQQQ LLGRLQDLLL GPKADEQTTC EVLDYFLRRL
GSSQVASRVL AMKGLSLVLS EGSLRDRDGE EKEPPAEEDS GDAETLQGYQ WLLRDLPRLP
LFHSVSASTA LALQQAVHME TDPQTVSAYL VYLSQHTPVE EQGPHSDLAL DVARLIVERS
TIMSHLFSKR SCSAESDAVL AALLSIFSRY VRRMRKSKEG EEVYSWSESQ DQVFLRWSSG
ETATMHILVV HAMVILLTLG PPRAGDEEFL ALLDIWFPDK KPLPTAFLVD TSEEALLLPD
WLKLRMIRSE VPRLVDAALQ DLEPQQLLLF VQSFGIPVSS MSKLLLHLDQ AVAHDPQTLE
QNIMDKNYMA HLVEVQHERG ASGGQTFHSL LTASLPPRRD SAEAPKPKSS PEPPPGQGRI
RASTQVRVLG PEDDLASVFL QVFPLSPGPR WQLSSARPVA LALQQALGQE LARAGTWGAL
TGAGTVSPAT AWAALSGVAM CPVVPQRCMP QDSGFSSLFL QVLMQMLQWL DSPGVEAGPL
QAQLKLFATQ YSAQRRISDV RSGFLHLAEA LAFRGDMEVV SSTVRALVAT LKAGEKCGVE
PELISKVLRG LIRVRSPHLE ELLTALFSAA TALPASRPVA VVSSLLLQEE EPLAAGQQDT
DGSSCPDLQQ RLLFSRSKGK GQPGPQVPSF RPYLLALLTH QSNWSTLHQC IRVLLGKNRE
HRFDPSASLD FLWACIHVPR IWQGRDQRTP QAGHGEHPPA GWGDTCASPL VHPGWLTGCP
LQKRREELAL QLDVLTHRFI TLLADTSDSR ASESRVADAN MACRKLAVAH PLLLLRHLPM
IAALLHGRTH LNFQEFRQQN HLTFFLHVLG VLELLQPQVF QSEHQGALWD CLRSFIRLLL
NYRKSSRHLA PFLNKFVQFT HKYITCNAPA AAAFLQKHAD ALHDLSFDSS DLVMLKSLLA
GLSLPSRDGR ADHGLDEEGE DESSAGSLPL VSVSLFTPLT AAEMAPYMKR LSRGQTVEDL
LEVLGDIDEM SRRRPEILGF FSTNLQRLMS SEEEACRSLA FSLALRSIQN NPRDFDALRK
ENVYENNQLA FRVAEEQLGI PALLEAEDMV ALKVPDRLSI LTYVSQYYNY FHGRSPIGGM
AGVKRSPSEA EEEPSGKKAS PTPARGLPLS PVSTNTMVQR KDRGAEGPPL KAASGAPSTG
LASRAPAAAD APSEADRREQ ALSCLRKALP GFGGAQAPGR AAHGSRVATD QEGERGRPGR
ATALHRLGRE GQGPSAGSQK DQVDRPAAWR SHLKPVAKKQ PAERAPELME PRVLGEPRAG
AVPQKVPGSS EGGVCITLTP VRPNRTPGPA GPQSSLSEAL KSPQDRQREQ DLLTQYVSTV
NDRSDIIDFL VQKLPQSY
//