GenomeNet

Database: UniProt
Entry: F2UN34_SALR5
LinkDB: F2UN34_SALR5
Original site: F2UN34_SALR5 
ID   F2UN34_SALR5            Unreviewed;       983 AA.
AC   F2UN34;
DT   31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT   31-MAY-2011, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   RecName: Full=Integrase catalytic domain-containing protein {ECO:0000259|PROSITE:PS50994};
GN   ORFNames=PTSG_12850 {ECO:0000313|EMBL:EGD78533.1};
OS   Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC   Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX   NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN   [1] {ECO:0000313|Proteomes:UP000007799}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA   Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA   Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA   Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA   Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA   Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA   Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA   Birren B.;
RT   "Annotation of Salpingoeca rosetta.";
RL   Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL832983; EGD78533.1; -; Genomic_DNA.
DR   RefSeq; XP_004989482.1; XM_004989425.1.
DR   AlphaFoldDB; F2UN34; -.
DR   STRING; 946362.F2UN34; -.
DR   EnsemblProtists; EGD78533; EGD78533; PTSG_12850.
DR   GeneID; 16070027; -.
DR   KEGG; sre:PTSG_12850; -.
DR   eggNOG; KOG0017; Eukaryota.
DR   InParanoid; F2UN34; -.
DR   OrthoDB; 2470635at2759; -.
DR   Proteomes; UP000007799; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR   PANTHER; PTHR42648:SF11; TRANSPOSON TY4-P GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF00665; rve; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007799}.
FT   DOMAIN          252..418
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          487..537
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          961..983
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        487..522
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   983 AA;  106827 MW;  587D62EF0FD3797A CRC64;
     MFTTHTHTSL QQQHHHHELT TVETFVIHHD ALMLQPEGEE GHAIGPVPPQ LWLADSGATA
     HITSDASLLT NYIAVTQHVT VANSQAVTAI GKGDAVLRVT GTDGAPGTIT LKNVLHVPDI
     KYNLLALNLV TRARRGASVV IAADDSRIQF ATWSVPLLPS KDTGHLFLYA STTTSQQATA
     LAGELVHSSD GEAEDADEGA SAHDVLGHRS SAAVDAIRQQ IAQASSKHIV TCGPCALGKS
     TRASIPKQSA PRSAGPCQLL YADIAGPLEE TSLRGARYAL TFIDDCTRFA WTFLIRHKND
     VGAALEQLRK DRHVVNALRG ATLQTDSDAV FKSQDFTDLC LAFDIKQRFS PPHSQAKNGS
     AERTFRTLFE TARTMLFAAD LDKPFWGFAV QHATLLHNIA PRRSLKDKSP FEALTGHAFD
     VSLLRVFGSP AYVHVEKSST RHKLDPRSRV GVYVGFAEED QAHVVWMPDT RRVVHTIHAR
     FGAIKNKDVA SSQQQQEQDN SGASASHANK TAQRSTEDSA TGAPTAPRAG EPQLQPRALG
     GSVWDQLLLS ETLGDKDTPG GSTTTCHVAT SAAGTGEKTS AVGGHLPDIA AGSVATSAEP
     ATVKEALASP DSEEWRQAIL DEFAAMQAND VYDVVPRADL PKGHKLLRSM VIFRRSAMTR
     QMKDLGSPDI CLGIKVQHDR KARTVTMSQE HYLRNVLETF GMADCKPVGT PLCTGYVDKP
     ADKEEPLPDV PFRELLGSLL YAATMTRPDI SAAVSILSRR MQHFGMDHWK AAKHLLRYIK
     GTLSYIIKYA ATDDASNGST ADGVLSAYSD ASFASDVDTR RSRTGFVVFC STGPVSWNSK
     LQATVATASA DAEYMALAAT AQELMFLRQL QEELTGAVLC EPTLVHTDNQ PAQRVAEHAA
     SRMRHILVKY HYIRECVDTK RIVLGYLATE HMIADLLTKI LPRPKTAQFC TMLFDQRPLP
     GCQPRARPHS GRSPRPSTPR QDL
//
DBGET integrated database retrieval system