ID G7PS97_MACFA Unreviewed; 1801 AA.
AC G7PS97;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EHH57363.1};
GN ORFNames=EGM_06971 {ECO:0000313|EMBL:EHH57363.1};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541;
RN [1] {ECO:0000313|EMBL:EHH57363.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CE-4 {ECO:0000313|EMBL:EHH57363.1};
RX PubMed=22002653; DOI=10.1038/nbt.1992;
RA Yan G., Zhang G., Fang X., Zhang Y., Li C., Ling F., Cooper D.N., Li Q.,
RA Li Y., van Gool A.J., Du H., Chen J., Chen R., Zhang P., Huang Z.,
RA Thompson J.R., Meng Y., Bai Y., Wang J., Zhuo M., Wang T., Huang Y.,
RA Wei L., Li J., Wang Z., Hu H., Yang P., Le L., Stenson P.D., Li B., Liu X.,
RA Ball E.V., An N., Huang Q., Zhang Y., Fan W., Zhang X., Li Y., Wang W.,
RA Katze M.G., Su B., Nielsen R., Yang H., Wang J., Wang X., Wang J.;
RT "Genome sequencing and comparison of two nonhuman primate animal models,
RT the cynomolgus and Chinese rhesus macaques.";
RL Nat. Biotechnol. 29:1019-1023(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001290; EHH57363.1; -; Genomic_DNA.
DR eggNOG; ENOG502QQKG; Eukaryota.
DR Proteomes; UP000009130; Chromosome 15.
DR GO; GO:0060147; P:regulation of post-transcriptional gene silencing; IEA:InterPro.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR022542; FOCAD/RST1_DUF3730.
DR InterPro; IPR045163; Focadhesin/RST1.
DR InterPro; IPR021392; Focadhesin_C.
DR PANTHER; PTHR16212:SF4; FOCADHESIN; 1.
DR PANTHER; PTHR16212; FOCADHESIN FAMILY MEMBER; 1.
DR Pfam; PF12530; DUF3730; 1.
DR Pfam; PF11229; Focadhesin; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 4: Predicted;
FT DOMAIN 490..714
FT /note="DUF3730"
FT /evidence="ECO:0000259|Pfam:PF12530"
FT DOMAIN 1213..1801
FT /note="Focadhesin C-terminal"
FT /evidence="ECO:0000259|Pfam:PF11229"
SQ SEQUENCE 1801 AA; 200447 MW; 7618F06C071A339D CRC64;
MSDDIRKRFE FPNSLIQSQA VGHLIAAVLK ENGFSEKIHQ STNQTPALNL LWEKCCSDNV
VVRTACCEGL VALVAQDHAE FSYVLNGILN LIPSTRNTHG LIKAIMKLLQ MQALKEGQGG
ERNIQNIYTI RNHPHPLITV LEHRPDCWPV FLQQLTAFFQ QCPERSEVSC IQIMAPFLWY
LYCEPSQLQE YAKLRLALLK VLLQPQVLCD KDQPSILEQQ ILQLCCDMVP CLQIKDLIQT
TEAMMFIEEV YLSLLRHPVF WKIQLTQMTL QLLCVCEVSL KITGECSSLI HLLEHCVELL
KEDFPVELII IGIALLLLQT PASQQKPILN LALKLLSVTE DQKIPKSSLL LVMPILQILS
STALEDCISM DEEGPSRQQL ALNLLEMIQQ ECYRDDHQKL SYKLACPVTS MYGSIFTAWT
ILEVMRDSSA ANDWLASVES LLPITTVIPV PAFLLLAHLL VEDKGQNLHQ ILKVTTELAQ
ADSSQVPNLI PVFMFKLGRP LEPILYNDIL YSLPKLGVHK VCVGQILRVI QLLGTTPRLR
AVTLRLLTSL WEKQDRVYPE LQRFMAMSDV PSLSVGKEVQ WEKLIAKAAS IRDICKQRPY
QHGADMLAAI SQVLNECTKP DQATPAALVL QGLHALCQAE VVCIRSTWNA LSPKLSCDTR
PLILKTLSEL FSLVPSLTVN TTEYENFKVQ VLSFLWTHTQ NKDPIVANAA YRSLSNFSAG
EHTILHLPEK IRPEIPIPEE LDDDEDDEDV DLSVPGSCYL KLLSLTPSLV LPALEEFFTS
LVKQEMVNMP RGIYHSALKG GARSDQGKTV AGIPNFILKM YETNKQPGLK PGLAGGMLFC
YDVSMYQSKD GKPLNRLMAS RGRSFKQTSL ALVHEVHIQL SEWHRAIFLP QAWLAYMSRA
YHAILQGRLG ELELQLKHGK EEPEEVQYKK STAWLWVRDM LTDEITKAAA KESPVVKGNA
LLALSSLAVV VSRHEASLSS DSDGLLEVQP NFLSMKEWIS MVLDTLLVIV DSHYQPRGQL
LSWFYYKSYS GENTASAIAR SAAATALSLL VPVFIISCKE KVEEILNMLT ARLPGKPSAD
ESQAVQIHMG LALGMFLSRL CEEKLSDISG QEMNLLLMKS LDALENCCFD TSLEYNTGCI
LGVGLVLSLM SHSSQMQSRV HVAASLRKLS AYLDDSGSQS RTFQEVLAYT LSCVCTSAFS
AGIIEAAEAE DVMNKLRLLV ENSQQTSGFA LALGNIVHGL SVCGHGKAED LGSKLLPAWI
RIVLTEGTPT MLCLAALHGM VALVGSEGDV MQLKSEAIQT SHFQGRLNEV IRTLTQVISV
SGVIGLQSNA IWLLGHLHLS TLSSSQSRAS VPTDYSYLPE SSFIGAAIGF FITGGKKGPE
SVPPSLLKLV MKPIATVGES YQYPPVNWAA LLSPLMRLNF GEEIQQLCLE IMVTQAQSSQ
NAAALLGLWV TPPLIHSLSL NTKRYLLISV PLWIKHISDE QILGFVENLM VAVFKAASPL
GNPELCPSAL QGLSQAMKLP SPAHHLWSLL SEATGKIFDL LPNKIRRKDL ELYISIAKCL
LEMTDDDANR IAQVTKSNIE KAAFVKLYLV SQGRFPLMNL TDMLSVAVQH REKEVLAWMI
LHSLYQARIV SHANTGVLKR MEWLLELMGY IRNVAYQSTS FQNMALDEAL DFFLLIFATA
VVAWADHAAP LLLGLSASWL PWHQENGPAG PVPSFLGRSP MHRVTLQEVL TLLPNSMALL
LQKEPWKEQT QKFIDWLFSI MESPKEALSA KSRDLLKATL LSLRVLPEFK KKAVWTRAYG
W
//