ID G3VHB2_SARHA Unreviewed; 740 AA.
AC G3VHB2;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=DGCR8 microprocessor complex subunit {ECO:0000313|Ensembl:ENSSHAP00000002566.2};
GN Name=DGCR8 {ECO:0000313|Ensembl:ENSSHAP00000002566.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000002566.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000002566.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000002566.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3VHB2; -.
DR STRING; 9305.ENSSHAP00000002566; -.
DR Ensembl; ENSSHAT00000002594.2; ENSSHAP00000002566.2; ENSSHAG00000002274.2.
DR eggNOG; KOG4334; Eukaryota.
DR GeneTree; ENSGT00390000015977; -.
DR HOGENOM; CLU_017211_3_0_1; -.
DR OrthoDB; 5404886at2759; -.
DR TreeFam; TF324256; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0070877; C:microprocessor complex; IEA:InterPro.
DR GO; GO:0020037; F:heme binding; IEA:InterPro.
DR GO; GO:0042802; F:identical protein binding; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0031053; P:primary miRNA processing; IEA:InterPro.
DR CDD; cd19868; DSRM_DGCR8_rpt2; 1.
DR CDD; cd00201; WW; 1.
DR Gene3D; 2.20.70.10; -; 1.
DR Gene3D; 3.30.160.20; -; 1.
DR Gene3D; 3.30.160.590; -; 1.
DR InterPro; IPR040375; DGCR8.
DR InterPro; IPR014720; dsRBD_dom.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR13482:SF3; MICROPROCESSOR COMPLEX SUBUNIT DGCR8; 1.
DR PANTHER; PTHR13482; MICRORNA PROCESSOR COMPLEX SUBUNIT DGCR8; 1.
DR Pfam; PF00035; dsrm; 1.
DR SMART; SM00456; WW; 1.
DR SUPFAM; SSF54768; dsRNA-binding domain-like; 1.
DR SUPFAM; SSF51045; WW domain; 1.
DR PROSITE; PS50020; WW_DOMAIN_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007648}.
FT DOMAIN 301..334
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT REGION 25..63
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 363..417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 710..740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 28..42
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 740 AA; 83842 MW; 524F11D23A32DB7C CRC64;
METLESIAPL QQEVTRELIV ENHLYLPPLP PSEEPPPPPL QTSSDADVMD VGSGGAGQSD
TPAGDVDAHF GPQLLTKGSA PYKSRLCIDP DNCDLSPRTA RHAPLVRKFI PDLRLLKDVK
ISVSFTESCK SKDRKVLYTG AETDDKAEAA FSINDVKGDF HVCPFDGSHG NIVGVVGESV
DKRDEENEID QEKRVEYAVL DELEDFTDNM EIEEETGRFT AKAISQRDKV DEDTLNFSYE
DDFDNDVDAL LEEGLCAPKK RKIEEKYGGE SDHLSDGETT VQPMMTKIKT VLKSRGRPPT
EPLPDGWIMT FHNSGIPVYL HRESRVVTWS RPYFLGTGSI RKHDPPLSSI PCLHYKKMKD
NEERELSNDI TPIGDASPIK SMEKSSELDS QTEEPDSTAV DSGSLDEKEP LGGDTAQGAL
GQVKAKVEVC KDESVDLEDF RNYLEKRFDF EQVTVKKFRT WAERRQFNRE MKRKQAESER
PILPANQKLI TLSVHDAPTK KEFVINPNGK SEVCILHEYM QRVLKVRPVY TFFECARATL
EILIPDFVKQ TSDEKPKDSE ELEYFNHISI EDSRVYELTS KAGLLSPYQI LHECLKRNHG
MGDTTIKFEV IPGKNQKSEY VMTCGKHTVR GWCKNKRVGK QLASQKILQL LHPHVKNWGS
LLRMYGRESN KMVKQEMSDK SVIELQQYAK KNKPNLHILN KLQEEMKKLA QEREETRKKP
KMTIVESAQP GSEPLCTVDV
//