ID A0A2B4REM7_STYPI Unreviewed; 2524 AA.
AC A0A2B4REM7;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 22-FEB-2023, entry version 13.
DE SubName: Full=Complement C1q and tumor necrosis factor-related protein 9 {ECO:0000313|EMBL:PFX16081.1};
GN Name=C1QTNF9 {ECO:0000313|EMBL:PFX16081.1};
GN ORFNames=AWC38_SpisGene19681 {ECO:0000313|EMBL:PFX16081.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX16081.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX16081.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000579; PFX16081.1; -; Genomic_DNA.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR Gene3D; 3.40.1800.10; His-Me finger endonucleases; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR038563; Endonuclease_7_sf.
DR InterPro; IPR044925; His-Me_finger_sf.
DR PANTHER; PTHR31511:SF12; DNA-DIRECTED DNA POLYMERASE; 1.
DR PANTHER; PTHR31511; PROTEIN CBG23764; 1.
DR Pfam; PF01391; Collagen; 4.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF54060; His-Me finger endonucleases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000225706}.
FT REGION 1301..1412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1657..1774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1361..1375
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1723..1737
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2524 AA; 282224 MW; 5E3E15D125D93E9E CRC64;
MNSIARRHLN TLQRNLRLRG YSHLSEASHT DEPKKSENNL RKKIEKVVEW GKKKVGNWGK
WLKGLVAPRA VVDDKLKDFK THINKLNEQR DQHQFNLVES KSSLKKFANQ YIIDGVEGYG
PIAFLQAVKP TVIKFLLTQK NIKMTLVLRC NMSKSNIATG ETVLQLAYFK SFVEIVFQET
DREELYQKCV DKMMESLAKF TREGSGWVVH SIAGLDLHTV EYTPLEGSSY IKLSPYLAKK
KAIVNMKNED DECFKWCVTR ALNPVKKDQE RITKILRKQA EKLKWGGIEF PMEVKDIHRF
ETLNPGIAVN VYLYEGGLQP LRVSKVESSK IHIDLLLISD GEKKHYCLIK DLSRLISSEL
SKKKNKKFFC RRCLNYFGSQ KLLDTHIELC GEHEAVRERM PKKGTFLSFK NHHKKMDKPF
VIYADFESII KPLHSVQPNP KECYTEKKQQ HIPVSFCYYI KCSFDDRHSK LVEYTAASED
EDVAQIFVNM LEKEVKAIYK NHPSKEMIFT DSDVEIFEKA TCCWLCEEDF KEGEEKVRDH
CHYTGKFRGA AHNSCNLRFR RPKFTPVVFH NLAGYDAHLF VRNLGVSEGD INCIPNNEEK
YISFTKNIVV DTFFDKKKEK VVEVKRELRF IDSFKFMASS LDKLVNNLVK KDDTLVNTGK
YYDGEKLELL KRKGVYPYEW MDSIFKMNET QLPQIEAFFS VLSGRGISEE DYCHALKVWK
TFGMKTMRDY HNLYNKSDVL LLCDVFENFR KVCKKNYDLD PCWYYTAPGL AWDACLKMTE
IKLELLSDVN MLHMFEKGIR GGISMIPTRH SKANNKYMGE KFDSTQPSKF ITYLDANNLY
GWAMSKSLPT GGFEWVDEKD FGGWENFPCI LEVDLLPIEK DLYDYFDHYP LAPENLLIGK
VKKLVCTLNE KKKYIIHHET LKLYRSLGIK IGKIHRVIRF NESPWMKKYI DLNTSLRTKA
DNDFEKDFFK LMNNSVYGKT MENIRNRVDV RLVNSEDKAK KLANKVNFKH CTIFSENLCA
IEMRKTQVTF NKPLYLGMCI LDISKTLMYD FHYNYIKIKY EDKAKLLFTD TDSLCYEIQT
EDFYKDIIND VDRLFDTSNI SKEHPSGIPS GVNKKVIGMF KDEAGGKIER LTPRKICKRK
RKVVFFKKFF CYVVFIIKMS NYYSYGVTLS DNQKRKLAKA FNEKSAITIR LSHDELVGSD
EMMLTQTQIK KIKKAINSGK GVDLKISKTQ MTKVAQKGGS LFSSLLALGT KLLPKAMNLA
TKALPGLATG ALSSLGNFAT DKILGAGQSG EAKWWLEVAR SQGPKGEKGD TGSQGPAGSK
GDVGPKGEKG DTGSQGPKGD TGSQGPRGPR GLQGLRGSAG PKGDKGDKGD KGDKGDVGPQ
GSQGPKGDTG ARGPRGPTGP GGSNIDLSNY LDKTKGGNLH KALKFVSSHG ADRQVSGLSD
QPLNGTAAIN LNKLNTELAK KADSSTVING LSGKADTSSV LLLDGSKKMT SNIDLNGNDV
INSKRENYLG MTTAQQAAYE NSNTLVSRYE AGSIKRHLKG LLDITTLSNA NQQYVDNVKK
TVPSFKNIDD KQILDARKRK IVNLPDTFSD NDEAVSKKYS DTKLSKAGGT MTGNLAMGAN
KVTSSHTAAA DSDLVNKKFV EDRLAHNLTP AQLSNDLSYI MSRNGPKGEK GDTGSQGPAG
SKGDVGPKGE KGDTGSQGPK GDTGSQGPRG PRGLQGLRGS AGPKGDKGDK GDKGDKGDVG
PQGSQGPKGD TGARGPRGPT GPGGSNIDLS NYLDKTKGSN LHKALKFVSS HGADRQVSGL
SDQPLNGTAA INLNKLNTEL AKKADSSTVI NGLSGKADTS SVLLLDGSKK MTSNIDLNGN
DVINSKRENY LGMTTAQQAA YENSNTLVSR YEAGSIKRHL KGLLDITTLS NANQQYVDNV
KKTVPSFKNI DDKQILDARK RKIVNLPDTF SDNDEAVSKK YSDTKLSKAG GTMTGNLAMG
ANKVTSSHTA AADSDLVNKK FVEDRLAHNL TPAQLSNDLS YIMSRNGQFS DEDDITGKPI
TDQLVLYPKN PRTKPFDLSL DTSKGYYSSR FGVNMYPADR AEYTVVCELC WQSSKVDSSS
VTLTATSSVE TISTQRSNRF ENHIVALIHM TKWSNATPNY LMFDVVIKNK SGQSYDQKLP
IWVIVYGSKG YHNSVPKSVW TSWYSFVSGG VQINSALTLA KQPSVATSAV TKKYVDDIQT
SLQTKIDLKA TKTALTNATK KIWYRGNCAH NNASQVTFYV NGTSDHTTNV SQSDNADFTV
NRSDNTKLII NNAGAYLITY IDGVKSQHLS HLKFILSNSF VNTANGDEKI DELKALYKHY
HKKWWCSRKI FKNFKRKSLL CNLGSTLLIV IGGIAGGVTM NPIPLATISG AGLLLKTFTD
VKKFDGKIDM SKFAFTTYEK ILTELRSYLR GRPFDNASFL NEVRLIDETI IDLCPILQST
KVLKQYETRF SREEINKVHK PYETRFSRKE IIKVVLSADD DKRIVLPDKI NTHAIGYYQD
LTEI
//