ID A0A498N2V8_LABRO Unreviewed; 936 AA.
AC A0A498N2V8;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Asteroid-like protein {ECO:0000313|EMBL:RXN26503.1};
GN ORFNames=ROHU_036463 {ECO:0000313|EMBL:RXN26503.1};
OS Labeo rohita (Indian major carp) (Cyprinus rohita).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC Cyprinidae; Labeoninae; Labeonini; Labeo.
OX NCBI_TaxID=84645 {ECO:0000313|EMBL:RXN26503.1, ECO:0000313|Proteomes:UP000290572};
RN [1] {ECO:0000313|EMBL:RXN26503.1, ECO:0000313|Proteomes:UP000290572}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DASCIFA01 {ECO:0000313|EMBL:RXN26503.1};
RC TISSUE=Testis {ECO:0000313|EMBL:RXN26503.1};
RA Das P., Kushwaha B., Joshi C.G., Kumar D., Nagpure N.S., Sahoo L.,
RA Das S.P., Bit A., Patnaik S., Meher P.K., Jayasankar P., Koringa P.G.,
RA Patel N.V., Hinsu A.T., Kumar R., Pandey M., Agarwal S., Srivastava S.,
RA Singh M., Iquebal M.A., Jaiswal S., Angadi U.B., Kumar N., Raza M.,
RA Shah T.M., Rai A., Jena J.K.;
RT "Draft genome sequence of Rohu Carp (Labeo rohita).";
RL Submitted (MAR-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the asteroid family.
CC {ECO:0000256|ARBA:ARBA00007398}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RXN26503.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QBIY01012241; RXN26503.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A498N2V8; -.
DR STRING; 84645.A0A498N2V8; -.
DR Proteomes; UP000290572; Unassembled WGS sequence.
DR GO; GO:0004518; F:nuclease activity; IEA:InterPro.
DR CDD; cd18676; PIN_asteroid-like; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR026832; Asteroid.
DR InterPro; IPR039436; Asteroid_dom.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR15665; ASTEROID PROTEIN; 1.
DR PANTHER; PTHR15665:SF1; PROTEIN ASTEROID HOMOLOG 1; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF12813; XPG_I_2; 1.
DR Pfam; PF00752; XPG_N; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000290572}.
FT DOMAIN 1..97
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00752"
FT DOMAIN 128..180
FT /note="Asteroid"
FT /evidence="ECO:0000259|Pfam:PF12813"
FT REGION 393..420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 552..577
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 658..750
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 394..417
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 731..747
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 936 AA; 102093 MW; 821E8A95B5C12106 CRC64;
MGVHGLTSFV EGNRQFFTDM RLRDCRLVID GCSLYYRLYF NSGLDQARGG DYDTFAVLIR
QFFAALTECA VQPFVVLDGG MDQTDKKFKT LQERAQSKIR EANTLSRGFH GFVLPLLVCE
VFKQVLSELG VPFVQCISEA DFEIASLAKH WGCPVLTNDS DFYIFDLKGG YLPFSFFQWN
NVCGKATERY VPACHFSVNR FCSHFNHINK QLLPLFAVVL GNDYTPGKIT EIFFSRVELE
RVPSGRKSGR SGSPRIEGFL LWLSQFTNPV VALEEVLEIL GEQRKGNLRA QISAGMRDYQ
LPQSSSLAQY FSSPQPALPD AQGLPAALVS QPEWLLRMFA SGKLPSLVLD VLVHQKVLLL
AQVENSNLPS SHTTSLSIRK TIYSLLLEKA RHDSQTPQAV TQRGRGRGRQ SQGKGGQQCD
VPCVDEYDRQ NLILKKNTVE AQRPKSVPQL ELAAIDKDVA VLRLTDALFI YLFIYLTEWF
LSRILSDRDP NDVLKRFDPC CLLSPPPPPL FPPPPGVWKR EPTMAIRVGG TRVREEDEKQ
VKELCCVAGP PGPPGPVGPQ GPSGIPGMEG PKGDKGDIGR PGSKAGCILQ SSTLFMTRTL
GACNSFTGCQ AMCCPLAVFI GCSPLQVVLG VRVCLGNQVL QGYPAQRDLR WGEKGDPGLM
GMPGLRGPPG PKGLSGYKGE KGANGLPGML GQKGEMGPKG ELGVPGKRGP TGRPGKRGKQ
GSDGERGFPG PVGPIGPPGP RGHPGPPGIP ASGLFVVGEK GEKGLPGPPG VCDCSFPSLS
PASAPLQHRN KYDKVPAIFV VSSEEELKGL HEDNALAFRK DQRSLYFKDK NGWEPIQLMP
LQATERMRDG NGFCGDGKVQ ELNGEECDDG NKVVTDDCVG CKKAYCGDGY RNEGVEECDG
KDFGYQTCKS YLPGTFGQLR CTDSCFIDST GCKYRT
//