ID A0A370DRS1_9GAMM Unreviewed; 2179 AA.
AC A0A370DRS1;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=RapA2 cadherin-like domain-containing protein {ECO:0000259|Pfam:PF17803};
GN ORFNames=DIZ78_06295 {ECO:0000313|EMBL:RDH87094.1};
OS endosymbiont of Escarpia spicata.
OC Bacteria; Pseudomonadota; Gammaproteobacteria; sulfur-oxidizing symbionts.
OX NCBI_TaxID=2200908 {ECO:0000313|EMBL:RDH87094.1, ECO:0000313|Proteomes:UP000254771};
RN [1] {ECO:0000313|EMBL:RDH87094.1, ECO:0000313|Proteomes:UP000254771}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A1462 {ECO:0000313|EMBL:RDH87094.1};
RX PubMed=30022157;
RA Li Y., Liles M.R., Halanych K.M.;
RT "Endosymbiont genomes yield clues of tubeworm success.";
RL ISME J. 0:0-0(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RDH87094.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QFXE01000007; RDH87094.1; -; Genomic_DNA.
DR Proteomes; UP000254771; Unassembled WGS sequence.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProt.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR Gene3D; 2.60.40.2810; -; 5.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 2.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR040853; RapA2_cadherin-like.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR NCBIfam; NF012211; tand_rpt_95; 6.
DR PANTHER; PTHR45739:SF12; CHONDROITIN SULFATE PROTEOGLYCAN 4-LIKE ISOFORM X2; 1.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR Pfam; PF17963; Big_9; 5.
DR Pfam; PF17803; Cadherin_4; 1.
DR Pfam; PF05345; He_PIG; 1.
DR SUPFAM; SSF49313; Cadherin-like; 1.
DR SUPFAM; SSF103647; TSP type-3 repeat; 1.
DR PROSITE; PS00018; EF_HAND_1; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000254771};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..2179
FT /note="RapA2 cadherin-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5016662000"
FT DOMAIN 685..752
FT /note="RapA2 cadherin-like"
FT /evidence="ECO:0000259|Pfam:PF17803"
FT REGION 330..605
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 354..389
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..414
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..432
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..493
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 499..514
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..566
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 568..582
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2179 AA; 224807 MW; 9EB0EE6C32EC2589 CRC64;
MFRAFLLVAS MALVLSGCGG GGGDEADPAP APRGIGGGGV KGPLANAVAT ATLFDASQPG
FKGTVVDTGT TDSAAAIQGL ALPFPLTPPY ILEFTSNAGT TDITTGAFPV ISIMRTVITQ
AMLDKGEQIY ATPLTTMATD LAMMNADSSV APFTGDGDGT TTAAEFIAAA QVASTVGFGM
SGDIDIFDTP PLVDDTTDTT DEQADVASYR AAVEALTAVV AQIDDQLPDA DANAVLEELA
ADLSTGQIDG QVPDETGAPI PSSVLTETSA TVLQQDPATL PIPGTDDGTG TSLTVDQVQQ
ILIDETATTG ATTNTDDLDP TTGTIELVLE PAETDPDRDG DNVLNTLDAF PDDATESVDS
DGDGTGDNAD TDNDGVADVD DDFPFDPAEQ VDTDGDGTGN NADTDDDGDG VADTSDDYPL
DNTKSSINDQ DNDGWDALYD PDDNDDTNPG VSFASQDPDG DGIPNDVDTD DDNDGVPDVQ
DAFDTDPNEF LDTDGDGTGD NADTNDDDDA ELDASDAFPK DASETVDTDG DGIGDNADAD
ADADDDNDND NDGLSDVDEG DGAVDTDGDG TPDSRDIDSD GDGYLDSVDL DPTDPSVTVN
SAPAASNGTL TTAEATVGTG TLIGSDVNAG DVLAFSVDSQ GTNGTVVITD AVAGTYTYTP
TDNDFNGTDA FTFRVSDGTA SSNVATVAVT ITPVNDASAA DDDAGATDEG GSVTVNVLDG
DTDVEGDALS VTNLSVPANG TATLNADGTV TYTHDGGETI SDSFTYTAND GTDDSVPATV
NITVTPVNDA PVGVADSGAT LEGGTVTIDL LANDSDAESD PLTISDMTLP ANGQLANNGD
GTVTYIHNGG ETVSDSFTYR ADDGNLTSDL TTVTITITPV NDTPVANDDS FSVGFNSSNN
PLDVLVNDTD AEGSALTINT PLGVTSHGGT VSTDGGTITY TPTDGFNGTE TFTYSVTDGT
ANSADATVTV IVSDNQAPVI TEGVSTAVTM DEDSTPTPFS LTLNATDAES DLLTWSVDTQ
ATNGVATASG TGNSMVVTYV PIADFNGPDS FVVAVTDGIN SDTITVNVTV TAQPDAPEIT
EGATTTVNMD EDGAPTAFAL TLNATDADGD VLTWSISSAA TNGLATASGT GASKVIGYTP
NADANGTDTF TVQVDDGTGN TDTIDVTVNI APVNDAPTIG GTAPDGMVGI AYSFTPTSAD
TEGDTLTFTA TNLPSWASIN GTTGEIFGTP DAEGEQANIT ITVTDDGVPS ESADLGPFSI
TINPATAVDD LSGIYRINST VTGVVPPSSV PISCYSELVG DTDWGYIGMS QSGVSLTSTA
PNGESGLLIS GTFNPADNSF ILSLNLSWED AGYTVNVTFT VNGTYDPATK TFTATIQEDG
DIRDGVGNSV DSCSLSTEAA GTFVYRHNGT EDYNGQYAIE YRGENDVNPD RGNAPVEIEI
SGSSFNLHLP PGAGSSTVGA SNFDASTGFF TFTLEDTWID DWNGDGVDDL VCEADNFGGL
FVRAPDDVTG KPILVMASEG EERVFYGTTD PTVCTNSALT PNDSDGWSDR LYGKPLTAET
YTRRITNAVS GGGVEELHIM GVRNPWLRKA TAASILRVEV YAGATKLCSR RYDQDGFKIM
SRLPYPDFDS EAFQTGFYGF ASCDTYRDGA GTALIHGESY VVKIFDDMGD GEGGNDLEVA
SYGTDAELAD LVTERISRRD IDLNGVASSR TESGQVIGLY GFFNPYQAMT AEFPGVAGAV
GYNFDFDNTE EPIRMRVSSA SASVTVPAGA INEWDGPSEL RVWSIHDDAN GRANSRSRRL
GVSAGMNGYF TVETDNPDLP LGKFSLQVIS DGNNVTCVVP TFTGISCHPW LSGASQFTPN
AIDWASNRVS LTLYDLANGG VAMPTTLTFT DSGNATVTMG TSTGVARAVN PELMVRSQLT
EGGLPKTVIS FGNAPAAFLN ADVSLLAGTF DGGGNSFALW DDVPTGGGND YMDVVDAYFQ
FPFDDGKAQR VGSYASTRDH SILVADTVTA AFTNDRLGLG IPPMTFSLDY TVPVPTAVAA
PPRSAITVNL SVGPPFTGDT ATSLTTAHDI GTAAVTSVSW TSALPADTLW LVRGRLSDPA
TGNAIQYGEV RSEPMNHATS ADLAYDGTNT NTWTWTAPVG LDPSLEPGEL MRLDLRTMDP
ARTMQGDTRS IFIMRSAVP
//