GenomeNet

Database: UniProt
Entry: A0A151P4C8_ALLMI
LinkDB: A0A151P4C8_ALLMI
Original site: A0A151P4C8_ALLMI 
ID   A0A151P4C8_ALLMI        Unreviewed;      2477 AA.
AC   A0A151P4C8;
DT   08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT   08-JUN-2016, sequence version 1.
DT   07-OCT-2020, entry version 12.
DE   SubName: Full=Centromere protein F {ECO:0000313|EMBL:KYO43605.1};
GN   Name=CENPF {ECO:0000313|EMBL:KYO43605.1};
GN   ORFNames=Y1Q_0013620 {ECO:0000313|EMBL:KYO43605.1};
OS   Alligator mississippiensis (American alligator).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC   Alligator.
OX   NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO43605.1};
RN   [1] {ECO:0000313|EMBL:KYO43605.1, ECO:0000313|Proteomes:UP000050525}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO43605.1};
RX   PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA   St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA   Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA   Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA   Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA   Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA   McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA   Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA   Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA   Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA   Ray D.A.;
RT   "Sequencing three crocodilian genomes to illuminate the evolution of
RT   archosaurs and amniotes.";
RL   Genome Biol. 13:415-415(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KYO43605.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AKHW03001146; KYO43605.1; -; Genomic_DNA.
DR   STRING; 8496.XP_006260426.1; -.
DR   eggNOG; ENOG502QVMD; Eukaryota.
DR   Proteomes; UP000050525; Unassembled WGS sequence.
DR   GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR   GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR   GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR   GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR   GO; GO:0008134; F:transcription factor binding; IEA:InterPro.
DR   InterPro; IPR043513; Cenp-F.
DR   InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR   InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR   InterPro; IPR018463; Centromere_CenpF_N.
DR   PANTHER; PTHR18874; PTHR18874; 2.
DR   Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR   Pfam; PF10473; CENP-F_leu_zip; 2.
DR   Pfam; PF10481; CENP-F_N; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT   DOMAIN          51..355
FT                   /note="CENP-F_N"
FT                   /evidence="ECO:0000259|Pfam:PF10481"
FT   DOMAIN          1443..1581
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          1678..1813
FT                   /note="CENP-F_leu_zip"
FT                   /evidence="ECO:0000259|Pfam:PF10473"
FT   DOMAIN          2329..2371
FT                   /note="CENP-F_C_Rb_bdg"
FT                   /evidence="ECO:0000259|Pfam:PF10490"
FT   REGION          184..209
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          241..327
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1259..1283
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2262..2308
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2369..2411
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2443..2477
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          63..146
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          154..181
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          212..239
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          328..443
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          455..563
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          571..591
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          601..649
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          682..765
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          831..851
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1093..1113
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1125..1145
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1170..1190
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1370..1397
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1419..1544
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1552..1572
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1584..1614
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1640..1779
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1857..1898
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1902..1922
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          1936..2027
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2031..2076
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2099..2133
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2155..2175
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2196..2223
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          2231..2251
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        258..303
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1259..1276
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2262..2281
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2282..2302
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2446..2460
FT                   /note="Polar"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2462..2477
FT                   /note="Polyampholyte"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2477 AA;  285715 MW;  ADDBC261AE82DAF0 CRC64;
     MVHTLVHETL ALGELCGWSS KSDHLKESLP RISLETFNNF SFGIKEQGDK MSWAVEEWKE
     GLSTRALQKI QELESQLDKL KKERQQRQFQ LDSLEAALQK QKQKVENEKN EGATLKRENQ
     SLMELCDNLE KAKQKISHEL QVKESQVNYQ AGQLNSGKKQ IEKLEQELKR CKSELERSQQ
     TLITGDLSFS GTPQKSFTAS LTPTQNQNDS KFAELQEKYN KEVEERKRLE AELRASQIKK
     INQSHLQSTK SHREIARHQA SSSVFSWQQE KAPSCPSSSN QETPLKRSFS TSNFPWEQET
     TPSHSGLRSE KKDFNRSFSN NSNNSPVIDR LKAQNQELCS RIKELEHILQ VQEKEKKSHM
     NKLQETQLQL DKMKQKLNEK DNHLNKGKDE LTRMTAQLDQ AAAQYETVEQ KVKKLSEELK
     CQRQNAESAR HSYEQKVKEK EKEYLEELSR QQRSLHTLDQ QCNQIKSKLN QELQQAKNDY
     NALQAELDKV TAAKQLLEHD FSELTQKLSR AEQALLATQS KENELRKNFE EVKKEKKILN
     CQFDQKLREI HQLEEELKTA KQFLKQSQNF AEEMKNKNLS QEAELKLLQE KLDKQDSSVT
     LEKLKLVIAD MEKQQESVQD LLIQRENHIK ELNNKIGKME KETGDLQKVL GVKKRECEEI
     RKEIITFSQW KTENEQLVNH LESEREANYE RLKESAEEKE RDLNKCQVKL ELLQMDLEDK
     EVSVENYKTQ VMQLEAALKS SEIKLEESEK EKEGMMQELE IIKEKLETPD AKLIVMNTNE
     HSEDFNGDVV SQYHYKKGLD ENCFSGLHEL TSSQNDDVQL VSSWQMTVNR INELEKMCEK
     VQIEKLALTT EHNESKTESV ATTAKMAEVG QLMNEVKILK EEKAIFPDEF MDQNDEDRSE
     IQFNEPVSCK SLECNVGLNY DYEFLKLSEK EVKIHFVEIK EKLFSLQNQH KILHEQHCKT
     ISQISELQSC IETLKAENSA LSTSLNKVNT DLVQVTPLQN SGEFKSVGSK HIFSPLGLNE
     ISHFAEESFV SSSFDNLMYK KSEDITHLNS SEESVLGGTT EISLVEEPYD HALERIAQQD
     SITSTKSHLN SKIEELQTLC QTYKKSIKML EDQFCSQENM KNEEIQELKQ IILSERKEID
     DLKKQNISDN DQWQQKLNSV TMEMEYKLAA EKKQTENLFL QLEVARLQLQ GLDLSSRSLL
     CVDVEDVPPE EENGLQQLKV DSLPTENVTH ESDTPDIRHC EQIAIEDIAE CGKVTEITET
     RSTEKHSEKL PSERDYSYMS DKNTNLSDKT SDLSFSRHGL SETAVDFLEN EVAIETLQQQ
     VTQKSEENLK LLHGIQGSNE KADVLLFEIK ELNSRLDLKE TALTAKISVC TELEKTVLDL
     EKEQRDLKEK LESAAFDKQQ LSCRGTTLEK ELEKVRSDIE MYKVRLSDVT DMLDDLEKTK
     GEWQEKFLET ENELKRTKSE KANVESHALA LEDDIEVLQT KYQQLERDSE NKLKTMSGLQ
     EQLAVITAER NQLSEELSIL RENKEELDQV YQKLQEKTKE LESNKIDSTE FIRILEAEVK
     TLTKLLQTEK SNVSHLTKEK DCLLQQLEKN TEVLALEEQK LQSLTGHLNE EKELILKESE
     MLQTQLSASE MEKSKLSKSL EGLLTEKHEL AARLNSAQEE VDQMRCGIEK LKIKIESDEK
     KKRHVAEKLR ECEWKRDSLL DKVEKLEREL QISEDNLEDA ILRAETAKVE ADTLNAEMGE
     RDQKLKTLEL EITDIQAEKE GLVKELKEKQ EKIFELESSN STVVKLLEIK EKENIKMREL
     QNVVLLVTSQ LKDARLSYNE QEVCEAKGVD IINEVGCLEY DDKTQLLEEL QEMDTFSAKL
     EQSVKALVQK LATYKQKLTE KIQENVTLQN QIKDTEQLSV QLLHLESEHE HWKEEKEGLQ
     NLMAELIPKV QNLSNTETWQ SALENLKISY KDLEKELEST RSEKTAFLEK VNELTENSIL
     LEDKLKKGEE KIMKLQEELT TERNILVEQV QHLQEQAENN LIQLNLNTLE KAELSNSLDK
     VQKELEEKGR EMKRELSEYQ RRLHQMEKNH QVVLAETNKK NELEIMACQD KLKSLEQCIS
     AQKLEMELLK SSKEELNNSV KEANQMLEGL TKNKVDNLKT IVQLKKEIEL AHSKLQLCIE
     SCKQVEQEKE VLQKQIVERD ALLKKQNQTV ADGASTEEMR LKLEELQESI EVKTKEADEN
     LEKYCSLIVN FHKLEEANEM LKTQVSLLNA QLKPTTDVVV SNSPLLSLDN PVTVNDQPVT
     ERSQEDSTRL SGKRRRSQEI KENGVPRSPI PEILAKKLKK GAVYQDSLSE NREYQPEGLP
     EVVKKGFGDI PTGKISPYIL RRTSLNLRTS PRLAAQSQRS SPSTQSFQKG RSDNLAELSK
     PTAGGSKSQK VNDALQCQAG TPILPMEPTS RSLCVNKHSM KAVAESSRES LETPQNKYSL
     RKQALPDKDE EENCRVQ
//
DBGET integrated database retrieval system