ID A0A151P4C8_ALLMI Unreviewed; 2477 AA.
AC A0A151P4C8;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 07-OCT-2020, entry version 12.
DE SubName: Full=Centromere protein F {ECO:0000313|EMBL:KYO43605.1};
GN Name=CENPF {ECO:0000313|EMBL:KYO43605.1};
GN ORFNames=Y1Q_0013620 {ECO:0000313|EMBL:KYO43605.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO43605.1};
RN [1] {ECO:0000313|EMBL:KYO43605.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO43605.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO43605.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03001146; KYO43605.1; -; Genomic_DNA.
DR STRING; 8496.XP_006260426.1; -.
DR eggNOG; ENOG502QVMD; Eukaryota.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:InterPro.
DR GO; GO:0070840; F:dynein complex binding; IEA:InterPro.
DR GO; GO:0008017; F:microtubule binding; IEA:InterPro.
DR GO; GO:0042803; F:protein homodimerization activity; IEA:InterPro.
DR GO; GO:0008134; F:transcription factor binding; IEA:InterPro.
DR InterPro; IPR043513; Cenp-F.
DR InterPro; IPR018302; CenpF/LEK1_Rb-prot-bd.
DR InterPro; IPR019513; Centromere_CenpF_leu-rich_rpt.
DR InterPro; IPR018463; Centromere_CenpF_N.
DR PANTHER; PTHR18874; PTHR18874; 2.
DR Pfam; PF10490; CENP-F_C_Rb_bdg; 1.
DR Pfam; PF10473; CENP-F_leu_zip; 2.
DR Pfam; PF10481; CENP-F_N; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525}.
FT DOMAIN 51..355
FT /note="CENP-F_N"
FT /evidence="ECO:0000259|Pfam:PF10481"
FT DOMAIN 1443..1581
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 1678..1813
FT /note="CENP-F_leu_zip"
FT /evidence="ECO:0000259|Pfam:PF10473"
FT DOMAIN 2329..2371
FT /note="CENP-F_C_Rb_bdg"
FT /evidence="ECO:0000259|Pfam:PF10490"
FT REGION 184..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 241..327
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1259..1283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2262..2308
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2369..2411
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2443..2477
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 63..146
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 154..181
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 212..239
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 328..443
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 455..563
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 571..591
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 601..649
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 682..765
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 831..851
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1093..1113
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1125..1145
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1170..1190
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1370..1397
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1419..1544
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1552..1572
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1584..1614
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1640..1779
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1857..1898
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1902..1922
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1936..2027
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2031..2076
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2099..2133
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2155..2175
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2196..2223
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 2231..2251
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 258..303
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1259..1276
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2262..2281
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2282..2302
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2446..2460
FT /note="Polar"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2462..2477
FT /note="Polyampholyte"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2477 AA; 285715 MW; ADDBC261AE82DAF0 CRC64;
MVHTLVHETL ALGELCGWSS KSDHLKESLP RISLETFNNF SFGIKEQGDK MSWAVEEWKE
GLSTRALQKI QELESQLDKL KKERQQRQFQ LDSLEAALQK QKQKVENEKN EGATLKRENQ
SLMELCDNLE KAKQKISHEL QVKESQVNYQ AGQLNSGKKQ IEKLEQELKR CKSELERSQQ
TLITGDLSFS GTPQKSFTAS LTPTQNQNDS KFAELQEKYN KEVEERKRLE AELRASQIKK
INQSHLQSTK SHREIARHQA SSSVFSWQQE KAPSCPSSSN QETPLKRSFS TSNFPWEQET
TPSHSGLRSE KKDFNRSFSN NSNNSPVIDR LKAQNQELCS RIKELEHILQ VQEKEKKSHM
NKLQETQLQL DKMKQKLNEK DNHLNKGKDE LTRMTAQLDQ AAAQYETVEQ KVKKLSEELK
CQRQNAESAR HSYEQKVKEK EKEYLEELSR QQRSLHTLDQ QCNQIKSKLN QELQQAKNDY
NALQAELDKV TAAKQLLEHD FSELTQKLSR AEQALLATQS KENELRKNFE EVKKEKKILN
CQFDQKLREI HQLEEELKTA KQFLKQSQNF AEEMKNKNLS QEAELKLLQE KLDKQDSSVT
LEKLKLVIAD MEKQQESVQD LLIQRENHIK ELNNKIGKME KETGDLQKVL GVKKRECEEI
RKEIITFSQW KTENEQLVNH LESEREANYE RLKESAEEKE RDLNKCQVKL ELLQMDLEDK
EVSVENYKTQ VMQLEAALKS SEIKLEESEK EKEGMMQELE IIKEKLETPD AKLIVMNTNE
HSEDFNGDVV SQYHYKKGLD ENCFSGLHEL TSSQNDDVQL VSSWQMTVNR INELEKMCEK
VQIEKLALTT EHNESKTESV ATTAKMAEVG QLMNEVKILK EEKAIFPDEF MDQNDEDRSE
IQFNEPVSCK SLECNVGLNY DYEFLKLSEK EVKIHFVEIK EKLFSLQNQH KILHEQHCKT
ISQISELQSC IETLKAENSA LSTSLNKVNT DLVQVTPLQN SGEFKSVGSK HIFSPLGLNE
ISHFAEESFV SSSFDNLMYK KSEDITHLNS SEESVLGGTT EISLVEEPYD HALERIAQQD
SITSTKSHLN SKIEELQTLC QTYKKSIKML EDQFCSQENM KNEEIQELKQ IILSERKEID
DLKKQNISDN DQWQQKLNSV TMEMEYKLAA EKKQTENLFL QLEVARLQLQ GLDLSSRSLL
CVDVEDVPPE EENGLQQLKV DSLPTENVTH ESDTPDIRHC EQIAIEDIAE CGKVTEITET
RSTEKHSEKL PSERDYSYMS DKNTNLSDKT SDLSFSRHGL SETAVDFLEN EVAIETLQQQ
VTQKSEENLK LLHGIQGSNE KADVLLFEIK ELNSRLDLKE TALTAKISVC TELEKTVLDL
EKEQRDLKEK LESAAFDKQQ LSCRGTTLEK ELEKVRSDIE MYKVRLSDVT DMLDDLEKTK
GEWQEKFLET ENELKRTKSE KANVESHALA LEDDIEVLQT KYQQLERDSE NKLKTMSGLQ
EQLAVITAER NQLSEELSIL RENKEELDQV YQKLQEKTKE LESNKIDSTE FIRILEAEVK
TLTKLLQTEK SNVSHLTKEK DCLLQQLEKN TEVLALEEQK LQSLTGHLNE EKELILKESE
MLQTQLSASE MEKSKLSKSL EGLLTEKHEL AARLNSAQEE VDQMRCGIEK LKIKIESDEK
KKRHVAEKLR ECEWKRDSLL DKVEKLEREL QISEDNLEDA ILRAETAKVE ADTLNAEMGE
RDQKLKTLEL EITDIQAEKE GLVKELKEKQ EKIFELESSN STVVKLLEIK EKENIKMREL
QNVVLLVTSQ LKDARLSYNE QEVCEAKGVD IINEVGCLEY DDKTQLLEEL QEMDTFSAKL
EQSVKALVQK LATYKQKLTE KIQENVTLQN QIKDTEQLSV QLLHLESEHE HWKEEKEGLQ
NLMAELIPKV QNLSNTETWQ SALENLKISY KDLEKELEST RSEKTAFLEK VNELTENSIL
LEDKLKKGEE KIMKLQEELT TERNILVEQV QHLQEQAENN LIQLNLNTLE KAELSNSLDK
VQKELEEKGR EMKRELSEYQ RRLHQMEKNH QVVLAETNKK NELEIMACQD KLKSLEQCIS
AQKLEMELLK SSKEELNNSV KEANQMLEGL TKNKVDNLKT IVQLKKEIEL AHSKLQLCIE
SCKQVEQEKE VLQKQIVERD ALLKKQNQTV ADGASTEEMR LKLEELQESI EVKTKEADEN
LEKYCSLIVN FHKLEEANEM LKTQVSLLNA QLKPTTDVVV SNSPLLSLDN PVTVNDQPVT
ERSQEDSTRL SGKRRRSQEI KENGVPRSPI PEILAKKLKK GAVYQDSLSE NREYQPEGLP
EVVKKGFGDI PTGKISPYIL RRTSLNLRTS PRLAAQSQRS SPSTQSFQKG RSDNLAELSK
PTAGGSKSQK VNDALQCQAG TPILPMEPTS RSLCVNKHSM KAVAESSRES LETPQNKYSL
RKQALPDKDE EENCRVQ
//