ID A0A151MVU8_ALLMI Unreviewed; 2436 AA.
AC A0A151MVU8;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:KYO28671.1};
GN ORFNames=Y1Q_0000834 {ECO:0000313|EMBL:KYO28671.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO28671.1};
RN [1] {ECO:0000313|EMBL:KYO28671.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO28671.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO28671.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03004838; KYO28671.1; -; Genomic_DNA.
DR eggNOG; KOG1721; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd07765; KRAB_A-box; 1.
DR Gene3D; 6.10.140.140; -; 1.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 25.
DR InterPro; IPR003655; aKRAB.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR001909; KRAB.
DR InterPro; IPR036051; KRAB_dom_sf.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR011017; TRASH_dom.
DR InterPro; IPR003656; Znf_BED.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR23235:SF120; KRUEPPEL-LIKE FACTOR 17; 1.
DR PANTHER; PTHR23235; KRUEPPEL-LIKE TRANSCRIPTION FACTOR; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF01352; KRAB; 1.
DR Pfam; PF02892; zf-BED; 1.
DR Pfam; PF00096; zf-C2H2; 25.
DR SMART; SM00349; KRAB; 1.
DR SMART; SM00746; TRASH; 6.
DR SMART; SM00614; ZnF_BED; 8.
DR SMART; SM00355; ZnF_C2H2; 27.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 14.
DR SUPFAM; SSF140996; Hermes dimerisation domain; 1.
DR SUPFAM; SSF109640; KRAB domain (Kruppel-associated box); 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50805; KRAB; 1.
DR PROSITE; PS50806; KRAB_RELATED; 1.
DR PROSITE; PS50808; ZF_BED; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 25.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 25.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00042}.
FT DOMAIN 189..253
FT /note="KRAB-related"
FT /evidence="ECO:0000259|PROSITE:PS50806"
FT DOMAIN 192..265
FT /note="KRAB"
FT /evidence="ECO:0000259|PROSITE:PS50805"
FT DOMAIN 243..270
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 271..298
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 299..326
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 327..354
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 355..382
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 383..410
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 411..438
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 439..466
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 467..494
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 495..522
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 523..550
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 551..578
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 579..606
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 607..634
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 635..662
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 663..690
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 691..718
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 719..746
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 747..774
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 775..802
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 803..830
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 831..858
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 859..886
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 887..914
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 915..942
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1847..1897
FT /note="BED-type"
FT /evidence="ECO:0000259|PROSITE:PS50808"
FT REGION 1..51
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 96..183
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 941..966
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 980..1012
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1042..1121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1270..1320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1417..1542
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1582..1673
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1686..1841
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1893..1913
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..132
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 136..158
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1082..1101
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1511..1534
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1582..1599
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1710..1724
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1735..1749
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1808..1838
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1897..1913
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2436 AA; 269975 MW; BC16DB9B1B5AFE76 CRC64;
MQPPARMEEQ DPAGPKAGAG AEGSRRAPRV GTGGVSPRWV TPQRVKQELP EKPIRCWQVA
THRQLGDVAL DLEGPGALLE PLRSWLEWPQ PRPEHVPLEQ AGWGETPGSQ EELSCVCKQE
SPPQQEPAGA ETLSTAEKQR PEEGPVKLEL RRPSPRKSGE RGFLTPEPGQ VHEEQNRCPQ
QRESMAVSKL REAFDDVAVY FTLEEWELLE AGDKGLYRDQ MLRTYQALVS LVHQVTHAGE
QLYRCAECKK SFTKWDHLQS HVCVHAGEKL FSCQQCGESF AKSSMLVQHL HVHTGEKPFS
CAQCGKSFSK SSTLTKHLRV HTGEKPFSCS HCGKSFTQSS HLTNHLRLHT GEKPFCCSQC
GKSFRQSSNL IYHLRMHTGE KPFSCRHCGK SFMQSSHLTN HLRVHTGEKP FCCCQCGKSF
TQSSKLANHL RVHTGEKPFS CSQCGKSFRQ SSTLTTHLRV HTGEKPFSCI QCEKSFSDSS
TLAYHLRVHT GEKPFSCTQC GKSFTKGYKL KNHLRVHVGE KPFSCTQCEK SFTDSSMLTS
HLRMHTGEKM FLCTQCGKTF TKSYTLTQHL RVHTGEKPFC CSQCGKRFRQ SSNLTYHLRV
HTGEKPFSCS QCGKSFTQSS KLTSHLRVHT GEKPFSCTQC GKSFTQSSNL TNHLRVHTGE
KPFSCSQCGK SFTQSYKLTN HLRVHTGEKP FSCSQCGKSF TLSSTLTTHL RVHTGEKPFS
CTQCEKSFSD SSTLTYHLRV HTGEKPFACT QCGKSFTKSS KLKSHLRVHM EDKPFSCTQC
EKSFTDSSRL ASHMRVHMGE KPFSCTQCGK TFTKRSTLMQ HQRVHTGEML FHCSQCGKSF
TESSKLTRHL RVHTGEKPFS CSQCGKSFTE SSTLRQHLRV HTGEKPFSCT HCGKSFVERS
KLNKHLRVHT GEKPFSCPQC GKSFTDSSTL TKHRRVHTGE KPWAGLVGGG AGAAGSQEPG
HPEEMAAELE PAPAAGFPFP VPVTTEEQDP AGPEPGAGAE GAGKASPRCP AAPVVKQEPV
EEPIQGWQVT VHVKLEDVAL NVEGPGASPE PLSSGLEQPP PQPGHVAKEG AGWGETPRPQ
KELPCAPKEE PPPRQEPDSP DTEATWDSSA DESSLDYFPR GGSLLGTGSP SLWAEANDLR
PRATRPLSRT PGAIRMRRLR WRRRALRDSM QEEAFRAEWV RHREAQDFRA QVLRELRRFR
RNQALALEEA RAGRVALQQL VGEIQEKRQL ARDGFEELRQ LWASIEAAVT QKRRVAEQLL
AAVPALAAPG AAPQPAPAPQ LLGLGAQPMP PAPGAGPSWE QDPFSSPLVH TRQPRHPQEM
AAELEPASAA GFPFPSPVQP VVKMEEQDPT DPEPWVGAER AGNAPCVVRA SPRCPAAPVV
KQEPVEEPIL CWRVTGNGQL KEGPEALAGF LCSGLEQPQP HPGHVPKEEA EWGETPGPQE
ELPCVPKEEP LFQQEPDSPD TDETWDSADE SSLGCFPRRG SWRGAEPSTW EAAVTNPKLE
QGAADRTRRK TVNSPAVQGL RKSPRTSHSS LEKMPKSARV TPKERVAQFG KDKFHTDGTV
LFCTACSKPI DHVRKQTIVE HMESAKHKRN EKRQREDAEM GSSTAQNRGS REPGHPEDMA
AELEPAPAAG FPFPVPEQPV VKTEEQDPAD TGRASSKCRA APVVKQEPLE EPVQCWQVTV
HGQLEDMALD TEGPEAVTDP LSRLDQPPPH PGHMPKEEAG RGETPGPQDE LIIILKEEPL
PHQEPDSPDT EETWDSSADE SSLGCFPRRG PFPGAGAGTL STAEEQSAED GSANLELLRP
SPGQSGEKGP LTPESGQGQE KQGRCQQQRD TMAVNKTSTH RLKIERSKRS LAWNHFTKIK
EDAVRCKLCT TELRYGSSTG AMLNHLKLKH PAISSERQDQ KQPTVPTFVP GSSRKCDTQR
AEKLTELLCS MITEDMLPIS VIEGAGFRAL MSFVEPEYHV PVRQTVTAQL EKWYENCVNS
LREKLGKADK VAFTTDCWTA VNAESYMTIT CHYIENWELR AAVLQTESMP EPHTADNIAE
KLNSIVERWG LENRVLACVH DNVSNVVLAD MPRYVAWDSV NCFAHTLQLA INDGFSGSVA
HVIAAASRLV AHFQPGSVAA KTLERKQTQY QVKHNPLIQS CKTHWNSAYE MFERLSEQRR
PIMAVLSDRS FTKLYDVQTF ELQDEQWQLI EDILPILSTL KCATTAMSAE HSASISNIYP
ICNSLLNTHL KTETAVESGK IADFKSAVRC SLEHRMKPND PAIINRAALI ASILDPRHKH
LQFLLPDIQI AAKSKLMQLA SFLKSEAAPS AAAEPAEPTA PAAKKPRETK KKSAIAVLLG
EDYSKKTDGN AVENELESYF REPCPSLECN PLEWWKVNAP RFPRLAKLAR SYLCIPGSSV
PAERVFSTAG LTVNQLRSRL TSEHVNMLIF LNKNHQ
//