ID V4Z8E2_TOXGV Unreviewed; 5565 AA.
AC V4Z8E2;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=Down-regulated in metastasis protein {ECO:0000313|EMBL:ESS28727.1};
GN ORFNames=TGVEG_216210 {ECO:0000313|EMBL:ESS28727.1};
OS Toxoplasma gondii (strain ATCC 50861 / VEG).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Coccidia;
OC Eucoccidiorida; Eimeriorina; Sarcocystidae; Toxoplasma.
OX NCBI_TaxID=432359 {ECO:0000313|EMBL:ESS28727.1, ECO:0000313|Proteomes:UP000002226};
RN [1] {ECO:0000313|Proteomes:UP000002226}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50861 / VEG {ECO:0000313|Proteomes:UP000002226};
RA Lorenzi H., Inman J., Amedeo P., Brunk B., Roos D., Caler E.;
RT "Annotation of Toxoplasma gondii VEG.";
RL Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESS28727.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAYL02000332; ESS28727.1; -; Genomic_DNA.
DR STRING; 432359.V4Z8E2; -.
DR PaxDb; 5811-TGME49_016210; -.
DR EnsemblProtists; ESS28727; ESS28727; TGVEG_216210.
DR VEuPathDB; ToxoDB:TGVEG_216210; -.
DR eggNOG; KOG1075; Eukaryota.
DR OMA; GEDICTS; -.
DR Proteomes; UP000002226; Partially assembled WGS sequence.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR046523; UTP20_C.
DR InterPro; IPR011430; UTP20_N.
DR PANTHER; PTHR17695:SF11; SMALL SUBUNIT PROCESSOME COMPONENT 20 HOMOLOG; 1.
DR PANTHER; PTHR17695; UNCHARACTERIZED; 1.
DR Pfam; PF20416; UTP20_C; 1.
DR Pfam; PF07539; UTP20_N; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000002226}.
FT DOMAIN 2135..2223
FT /note="U3 small nucleolar RNA-associated protein 20 N-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF07539"
FT DOMAIN 3641..3719
FT /note="U3 small nucleolar RNA-associated protein 20 C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20416"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 265..330
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1578..1599
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1794..1883
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1928..1947
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2001..2032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2269..2291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2389..2416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2603..2646
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2707..2772
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2799..2826
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3086..3105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3507..3580
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3613..3643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3764..3783
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4038..4067
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4306..4383
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4468..4492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4965..5142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5192..5254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5360..5381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 5528..5547
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 2647..2674
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 275..293
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 301..321
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1794..1839
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2621..2638
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2723..2747
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2799..2819
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3507..3529
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3530..3568
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3617..3642
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4039..4064
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4306..4330
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4337..4351
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4355..4383
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4472..4492
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 4976..5014
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5015..5029
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5060..5078
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5082..5096
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5105..5127
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 5196..5236
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 5565 AA; 603698 MW; AE445959CF718DB4 CRC64;
MGNPREGGRG PAVPGGSLPA CDERRKQQLE ERRRVNLQTK CFDPTTTRFR FLSFAERLQI
QLKAQTSPLR STGLLDLLEE QKALLERTVG SVGWGKRSQP DTLFGDAEAE EERRKRRKTE
ELQFSSLREV IRHLATVQHP PAVAALLSEL APLCDSLPLL LFHRERILQS LLRLLLEDEA
ITVVLNLLQA LAKDLRSEFV AFLPSVFSSL LLLSRQLELT VDAERLRLLF SCIGSIFRYL
ARPLLREFDF AVALFMPWLC GDDASPPSGG ERSEAGEARG GRPEKRDGPG RAGRAENRQG
APAAKAKREG HKKETEETPE QMPSVSRKGQ TVTEEIRLLA ARAFACLLRK AGGEEGDSGD
LLGCIETLFR HMSRCSLSAR EAYGESVAMI LFESVKSVQL SFKRPFDCVV RFLFATVLFQ
KPYVEAGERS WAPSLSAAFP SGNETDTQLE GETDIREADV AARGVEEGGQ MAEDRESATE
GKGIPKVEDA FAVYRAQVKC LVSFLLHARQ HTKALESPGA KKIEALLLEF FSVAVKAAPS
ALLASLAKLG RSAADEETDV KHPANSMNLF SGSFASVLAS PFHAYFYNFP APHLFTDHMY
LSGVDRTSPL SSCLSLALAA SGKTGGSLPS SPASPLFLAY GLLTLIELSG AWVFELPRLE
GSFDAFLCRL FTVLSPALLK SGGDKQKQKT KQDSRKKKSW EGSLWRVTAT EEEAQEIERA
PLVLQPLLAT CVCLRRTPVA KEAQQGERSP RQGLSSTSLV ASECLFLSSA LQLEAVSFFS
LFLPCAVLRL LSLLWRAVPL SFCSAVNAVW SSTSSDLFGG LLSVPLKFLQ FSTARALPLA
SPAQPATLLS SSVFPVKSSP TCLLEAYVNC ALAYTSMLHA HLAPFKEETL RRQNCVHGFG
LYGLRSTCGF LLRLLASQPR QVALSSALPL QLRSHSPSLS GDMLEWERSA ASLLLAEEMS
EFVEASSSRL DASGVDSAQT QEKKGESNEG RLLLLQAKQS LPQLLLAVMR RLQKTRSAFE
GSRDSAPPHS GRPTRFEVDL ASAAQRRTVD DLACTYSMLS SFLPLLSATE IQAYVAGEGE
DAGVTKSDQR REELQKLLAL VHTETSHLIA VLLLWLAGKL LALPVDVHLA AQNVDTVLSR
DCARPCACET ADSDGALPTT FASVRSRGKV TREAETDVTA VLHILRICFR CLDTATPLIR
GSSEAAEAAK DLEDTQGRRL RTRAGRLPAP PIELLQVVQE VEEKVKHTAA LTREPRPSFC
VFSLAACSAF SSSEKWLAVI LSLFSSGSFL YELHGPSGSS AEPFVSLFSP LLRSAAFCIS
SVLGASFEEV DGEEEKTEVL ESRSSSPQGS AYLRDVVGPH LASILSCCSS SCRRARQDAI
FFLHLCSPFL CSLQSVFSSS CSVSAPSTHA ALEEAISAPS LLPLHRAFAD ACGAQLRSLL
SLEKLPVGLE TERKLINAYG ALVRGATSVF SQGLLLLSQP LDAAEESMAT PEEARMCLDV
SFTRTVAPIL EATVRVLLSQ LFVKFSPLWQ PAVDAVLQLL EDIHALQGEA LQAEASALTS
AAFAAAVSSS SDSAVSLGEQ RSSARSVTAS REPTEGERET CASLEHEGCV ARKTPSEAAC
ASSSGEADSV ASASVLRTEG AELSRMHARL SSALLSTTWE ILTAQVARAV SDLGACRPGR
RAAVPSCRKP EASLGRPEGE TVEPDARGRA AEANCDVESQ RYSAGVTEAG ERQEGSEREA
NAFSKWFVRT EWKRALGDDA DEVSDPLSRH TRLLRLVEGI MGSYGKDAFV TSKREEAAEP
GVQESEEPGT EHACREENRS GTVEKTGGRE SERRLVPGAE SGDAKPVSGP GSEPLAHANK
LKKAKKRVQE EARMSGSRAG GLSKRREALT RFNWLLRQTL CVVETLTEGG ADNASGEEEG
ELREIREAPH EAAERSETPC VSSAPRGEGE RLVTCEEAIF RTNAVSLVPR ACDCLRAVVA
LSRVDSNCFL SAAEKRALAL KSGEKSQQSP GGALGLAAGG KAGGKAGEGP DETDRLLKEA
AESERLWTRL PAVCAQRLVT IADPSLQQLV IQVLQLSPVL RPQLAPYAPL LAQLISDKSS
SNALLRLRLA EPEEEGEDDL ATDDAQATTK RKNEQGKATV AKLVVVQEQH RHLVVPLVVR
ILMAKVQHRA AGRSATAVFA ERRNVFAFLS SVAAQELPLL LSILLYPLVD VWLGPEASKA
EEAFPAHDSS AVLPSLSVGA ASSVPLSGFL KEQYLCAFVA AQLNASAAAS DSGLSPPEGR
KGELLSRRAS APASPAVLGE CTVDGRSAGE TLEMSAHAEK KAQRVLVRLA PCCLPPNVSS
RLPGCCKTLQ QLQDMLKRLL FPATPFLILF YRALLQNIVS TSVASSSSRE IQGATREQDA
GDLGSPGPSS AGRLGAGRDG RAYVKQTVHL CLLLLRQLLD LFPEASNVWR DILKPAGPAL
QHLLEAAVAT AAASPAGGRG PHEEGKKQKK NIPAVVALVC SWAAQPAYFS FFSSVVPEAL
PTLFAVPALP AVLRRVAPSF GSEGPGRRAF GSVPFGRGRG GLGASCGVLE AVLDAALLLS
VGGMDREREE ALAASFHGEI KDLRRRRRKT RARGRHASQG SAKQIRRSAV SSSSSSSEEE
KEMWVEEDEI AQRRQALLRE QRELEAERQQ HEREGMATLL PHVGGLLASL EILFQHRRLA
TLGGSQGATA PAFGSSQGAS GDRKLEAARE DGDAGKAGDQ VAQEEGPDHD EGGEEEPRER
EEDDASAGPQ RSLFGVVRVK ELQLLTRIAR YAVADVNASS EQLGETPLSP TSPSKGRPKI
EASASSLEAS EAASTFAPVS CSAREQRFSV VLRLIRLLVQ SLPSGSVAPT LNARLGSPSS
TTRLLLSLEA LLQLTFPLAA QVRALQEEEG ATPGAASAEI GAQAGPFAGV VHEDAGRPGE
ASKASPSSQL LELKGVLEAM TRHCSILLQS TVSSACRAAV AKLFLAVEFA ACGVMLFSSD
TELDEKVGAQ IERLLSRPQT GAQEEDEVGV EAKGQQLLAA LVPGGSLLRA LLTEREDQTD
GGDARMEKKL LCSLLYRRCS IEERGSAYGE TQPGSPRASP AGREATAPEA PVAFVDNPET
LFQWRLSVAL IVFSLNYRRG QTGLSRGQAL TLLADDEDQP DSSMQLIVLH ALVASATIRG
LPSPNASPAS NEIGQVPPGL LEPLIRHCLF LLGAAGIEWS VQQAAAAVLK AVVDSVALLS
VAPASRVSSS TSSSCMSRQS PQVRMQLLAR LVMPFVRAQL KAPDDAQLRQ GLHLLAYVVR
TLAPVALRLP SLAGPSGPET ETKRLRQLDK VFHLDLASLL AAPPPGAVAE ALLEPGTQKP
GTEDAGGDFF RDFLHMQRHR QGRALQRLAQ AAREKGVGAT TLRLVALPLC FASLLQRASL
RPRSEEERKK KHSFLRKEVF HQGLAEQAVS CLEACSYRLK WPMTLNAVRL LSRFLSDFPE
REGFLFRAIC GVVRAFEAHV KQAVSEALHR DSRVDGGDEG AQRERQTEGE ESETEEEHEE
EGQDEEEEEE RGEGEEHEER AEEVSGSEDE DEGSRRREAA EGRLRARHRL ELMHARIKAD
LLPLLYRLAF GSSSGKKPSE GGDQMLTSPP GTSQSRGAQK GKEQNVRPSI VAVLLLLIRL
LPAADFQQNL PKLVRSVAFN LRSRDRDLRR KARGALVSMA VSLGPSFFSV LVAEVGVLLQ
DRKPSVKKGG AQEEDVAGSS AQAFYRPVFL FTVHAMLKGL VDEEEKKRGD APGMEVEGDK
TGHEEAGLDK ALPLVLPLIA EELSRVADPD RLSQDPSTRQ HSKIDEARHL KGPSLVFLLA
SCSSATSCAS TILPFLYSLL GGRRRVGDTV SSAYSPAYLE RVKDLFLHFV KGANKNRSLS
PFFFFLLSRR LLTLTVALLQ SQLLHPLQLE QAGRRREHLE FLLAKQELRR QQVGGGRRRK
ARANCLPLVR HAQLAEPLED EAGACAEQAE SQILQERMHA PQDDDREPQR HRTREALLRE
DAGFFHCMLF NSAVGLPGEA DKEHSRRPSG EGDKEAHSEG EGEEGQAEAN FLQAPDSLDE
AMALARERRL DGLRAAVAAK KDRAMVTQPG AAKGISLEKA LHKKSLNAYT RQGFDSTSRA
TTVGFTGLRL LLSALKRWKF LLKKAKPEES SDAKVGSKVE MKTKVTQRQL RGELEKTTFA
LLICFCSNIT DLISWTMRCL PHLLPLRLSA LESRGAVVAS MTLRLFHSMG GGWSSSRREE
DHQLFLACSR LLPLLLLHPQ GNMWFAAALN PYVRVPQTHR QLLLTEMERT RSRGRAAKHV
GEERGENQGE ESAASDGSSE VGEEDEQEQT DDAAGAGGSR RERCPAEAAE AVLGREKASE
EEKCKAVEEP LVLSDTQMRL LREEEEKSWR GETDVETRLE ENFKEALLAQ ILHSLESPQL
RLSGLMLFKT LVLPHYRDVA AAVLASDAGE GKPEGEGGKR KKDKDAGGKK ARKNIGDSTL
LCDVYRGVDT IVRLMIQEAG EDAEGRRLAA VCGDIYSHFL LNFPMTEKTQ QYRILFLIKN
LHYSAPDGRR AVLNCLHKVL IRFPSELVVS RYSEVFLLAL SGRLVLEEDR TAHEMLRMLL
HLLLDIAQDS DDPEHSAASL FQTAVTVFLQ PQIPKIMALQ EFVLVFLDSQ RERPMALLPR
VLPLLHRMLL VAADRESRER FRASGVLPSC GDWRVSYKAL LAFEKLMLFV HPSHLDGLFS
RAARAAATRG ETLFKVEGGE AGEEAEAAGV NSVDLELETQ VAAQILGSCM LPSTGAVGGG
DQDTSVHSRW RQQQGLDEQF DAHTLSKLPK KDLLVGAAVG RLWLEAVGEG LRHEHPWLRA
VSLRCVGAYI AAKSPRSYRP STPCLFYLHR SRVSKQFGLI VSDGAPARPG ADVVLAALSP
FCKDLFLEKN PAAVLSARAL VLPLLQLALS RPDVVVVPPR RRLSRAQRGE ENNEGAGDSE
EAAEEPVSDT GDAASDDNEE ETREQDEEIG EEIKDDGETR DGKLAPTDEV GEELDEEEAL
GKTRETLEDD DDEGGEEEGG EESAKVGEGK RRSSSGSDCE QSQDEAGDEQ DNAEDLHSDD
ECSASGSDEE LEEESKDSDE DDSEEEASRS EALAMQVLLR PPCDLAASTE NALASSSSSA
SGFREDSSLD LLAFENKAAE LACSADTIAF SGKRDLQPPS LSTQQEATSK RQRTGVSSSS
SAALRSPASE TGNSPAAFSE GLVVGPEGDR ETQEQIEVTN GGTRPFVVQD EASHPLLFLV
KRLNFWLRRH LGTLGAYRRA RFKKGSHGGR EGENGQKGED ICTSIVRVGA ILSVFHALVY
QLPLHPSRGL HFSPEFPLGA SSKKRKRDRK DGRKGAETEA DESSVHGGSL FFSEAHLERL
LFFVIDAAYR CSTVIRGGEE GGHKKEARTD SLLLQLLSTE RGEGAFAADG GSSLAALWQG
LASLSPKEQQ REVLTGGMFL LGEIQEVLKD GGREELALGL LAKVRTSVLS CRLKRKVDSQ
QQALLQPQKY AEKRRRQQCK KKLSKKRKVA LLIAKNRGGK RREKS
//