ID A0A0L0G2D7_9EUKA Unreviewed; 1687 AA.
AC A0A0L0G2D7;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Clathrin heavy chain {ECO:0000256|PIRNR:PIRNR002290};
GN ORFNames=SARC_04513 {ECO:0000313|EMBL:KNC83235.1};
OS Sphaeroforma arctica JP610.
OC Eukaryota; Ichthyosporea; Ichthyophonida; Sphaeroforma.
OX NCBI_TaxID=667725 {ECO:0000313|EMBL:KNC83235.1, ECO:0000313|Proteomes:UP000054560};
RN [1] {ECO:0000313|EMBL:KNC83235.1, ECO:0000313|Proteomes:UP000054560}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP610 {ECO:0000313|EMBL:KNC83235.1,
RC ECO:0000313|Proteomes:UP000054560};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Young S.K., Zeng Q., Gargeya S., Alvarado L., Berlin A.,
RA Chapman S.B., Chen Z., Freedman E., Gellesch M., Goldberg J., Griggs A.,
RA Gujja S., Heilman E., Heiman D., Howarth C., Mehta T., Neiman D.,
RA Pearson M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C.,
RA Sykes S., White J., Yandava C., Burger G., Gray M.W., Holland P.W.H.,
RA King N., Lang F.B.F., Roger A.J., Ruiz-Trillo I., Haas B., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Sphaeroforma arctica JP610.";
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Clathrin is the major protein of the polyhedral coat of
CC coated pits and vesicles. {ECO:0000256|PIRNR:PIRNR002290}.
CC -!- SUBCELLULAR LOCATION: Cytoplasmic vesicle membrane
CC {ECO:0000256|PIRNR:PIRNR002290}; Peripheral membrane protein
CC {ECO:0000256|PIRNR:PIRNR002290}; Cytoplasmic side
CC {ECO:0000256|PIRNR:PIRNR002290}. Membrane, coated pit
CC {ECO:0000256|PIRNR:PIRNR002290}; Peripheral membrane protein
CC {ECO:0000256|PIRNR:PIRNR002290}; Cytoplasmic side
CC {ECO:0000256|PIRNR:PIRNR002290}. Membrane
CC {ECO:0000256|ARBA:ARBA00004287}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004287}; Cytoplasmic side
CC {ECO:0000256|ARBA:ARBA00004287}.
CC -!- SIMILARITY: Belongs to the clathrin heavy chain family.
CC {ECO:0000256|ARBA:ARBA00009535, ECO:0000256|PIRNR:PIRNR002290}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KQ241852; KNC83235.1; -; Genomic_DNA.
DR RefSeq; XP_014157137.1; XM_014301662.1.
DR STRING; 667725.A0A0L0G2D7; -.
DR EnsemblProtists; KNC83235; KNC83235; SARC_04513.
DR GeneID; 25905017; -.
DR eggNOG; KOG0985; Eukaryota.
DR OrthoDB; 5474327at2759; -.
DR Proteomes; UP000054560; Unassembled WGS sequence.
DR GO; GO:0030132; C:clathrin coat of coated pit; IEA:InterPro.
DR GO; GO:0030130; C:clathrin coat of trans-Golgi network vesicle; IEA:InterPro.
DR GO; GO:0071439; C:clathrin complex; IEA:InterPro.
DR GO; GO:0032051; F:clathrin light chain binding; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0006886; P:intracellular protein transport; IEA:UniProtKB-UniRule.
DR GO; GO:0016192; P:vesicle-mediated transport; IEA:InterPro.
DR Gene3D; 1.25.40.730; -; 1.
DR Gene3D; 2.130.10.110; Clathrin heavy-chain terminal domain; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000547; Clathrin_H-chain/VPS_repeat.
DR InterPro; IPR015348; Clathrin_H-chain_linker_core.
DR InterPro; IPR016025; Clathrin_H-chain_N.
DR InterPro; IPR022365; Clathrin_H-chain_propeller_rpt.
DR InterPro; IPR016341; Clathrin_heavy_chain.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR10292:SF1; CLATHRIN HEAVY CHAIN; 1.
DR PANTHER; PTHR10292; CLATHRIN HEAVY CHAIN RELATED; 1.
DR Pfam; PF00637; Clathrin; 7.
DR Pfam; PF09268; Clathrin-link; 1.
DR Pfam; PF13838; Clathrin_H_link; 1.
DR Pfam; PF01394; Clathrin_propel; 4.
DR PIRSF; PIRSF002290; Clathrin_H_chain; 1.
DR SMART; SM00299; CLH; 7.
DR SUPFAM; SSF48371; ARM repeat; 6.
DR SUPFAM; SSF50989; Clathrin heavy-chain terminal domain; 1.
DR PROSITE; PS50236; CHCR; 7.
PE 3: Inferred from homology;
KW Coated pit {ECO:0000256|ARBA:ARBA00023176, ECO:0000256|PIRNR:PIRNR002290};
KW Cytoplasmic vesicle {ECO:0000256|ARBA:ARBA00023329,
KW ECO:0000256|PIRNR:PIRNR002290};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|PIRNR:PIRNR002290};
KW Reference proteome {ECO:0000313|Proteomes:UP000054560};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 334..357
FT /note="Clathrin heavy chain linker core motif"
FT /evidence="ECO:0000259|Pfam:PF09268"
SQ SEQUENCE 1687 AA; 191009 MW; 051B9F0FADFFD46C CRC64;
MAQTLPIKFK EHLNLQNVGI QAANIGFTTL TMESDKFITI REKVGEQQQV VIVDMANPNT
PVRRPIGADS AIMHPSSKVI ALKAGRTLQI FNLELKSKMK SHLMDEDVVY WTWVDVRTVG
LVTGTAVFHW SMDGDSQPAK MFDRHANLAN SQIIKYRADV DKKWCCLVGI AASNQGGQQR
VVGACQLFSV ERGVSQAIEA HACTFAEFQM DGNAQKSKLF CFAVRGPQGG KLHIIEPAAP
AGNQPFPKKL ADVFFPPEFQ NDFPVAMEMS DRYGVLYMIT KYGYVHMYDV ESGACIYMNR
ISSETIFVTT KQDSTGGCMG INRKGQVLSV TVDEANIIPY IQNTMNNSQL SVRFAIRNNL
GGADDLFVQR FNQLFSAGQY SEAAKVAANA PKGALRTPET ISKFQSVQVP PGQTSPLLQY
FGILLEKGHL NKEESIELAR PVLQQGRQQL VEKWLKEDKL ECSEELGDIV KQYVPMLALS
VYLRSEVPDK VIQCFAETGQ FQKIVLYARK VDYTPDWIFL LRQVMRVNPD SGAQFANMLC
DGGECLAPID QVVDTFMEMN MVQQCTAFLL EALKGDKEEE AALQTRLLEM NLMAAPQVAD
AILGNNMFTH YDRNYVATLC ENAGLFQRAL EHYTDIFDIK RAIVHTHLLN PEFMINYFGT
LSVEDSMECL KAMLTSNMRQ NLQLVVQIAS KYHEQLGTDQ LIDMFESFKS SEGLFYFLGS
IVNFSQESEV HFKYIQAATK TGQIKEVERI CRESNAYDPE RVKNFLKEAK LTDQLPLIIV
CDRFDFVHDL VLHLYKNNLQ KYIEIYVQKV NTGRLPQVVG GLLDVDCNEE ITKNLILSVK
GPFDVNELAE EVSKRNRLKL IQPWLEQQLA AGSEDPGVHN ALAKIYIDAN QSPERFLKEN
PYYDSLVVGQ YCEKRDPHMA FVAYERGQCD DELIEVCNAN SLFKSEARYL VARREADLWA
KVLDPENEFR RQIIDQVVQT ALPESGDPED VSMTVKAFMT ADLPNELIEL LEKIVLDNSA
FSDNRNLQNL LILTAIKADS SRVMDYVNRL DNYDAPDIAN IAIGSGLYEE AFTIYKKFEV
NTSGIQVLIE HIASLDRAYE FAERCNEPEV YSILAKAQLD NNMVKEAIDS YIKADDPSNY
IEVCEVGARN EKFEDLVRFL NMARLKAREP FIETELVFAY AKCDRLADLE EFINGSNVLA
KVQDVGDRCY DADMFEAAKL CFSNVNNFAR LSQTLVKLKQ YQAAIDSARK ANSTKTWKEV
CFACVGDQEF RLAQVAALHI IVHADELQDL IQHYSGRGHF DELISCLEAG LGLERAHMGM
FTELAILYSQ HKPKKMMEHL KLFWSRLNIP KVLRAAEDAH LWKELVFLYT HYDEYDNAAN
AMMKHPADAW DHGQFKDIIT KCGNSENFYK SMTFYLNFSP LQLSELLTVL IPRIDHTRTV
GMFKKNGHLP LIKAYLQQVQ ELNITAVNEN LNDLLIAEED YTSLRESLDS YDNVDAVALA
IRLEKHELIE FRRIAAYLFK KNGRWEQAIA LVKEDSLYQD AIEYAAESKD AETAENLLNF
FVGLQNYPCF TATLYTCYDL LRPDVVLETS WKNNLLDYAM PYVIQVVREY LTKVDDLKKF
QVTKIEQDEE AEARPQPLMY GNQETLMLTM GPSNPMMAPG MGGPGPQGGG MNMMGAGPPT
YGGFNGY
//