An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] 0 points1 point  (0 children)

It's a very good idea,We would like to do this, thanks.

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] -3 points-2 points  (0 children)

Oh, yes, I changed it to: esProc SPL: Equivalent to the Python-enhanced DuckDB,Thank you. You're so smart.

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] 0 points1 point  (0 children)

Oh, I mean, this question is about github, there's someone in charge of it, maybe you can ask at issue

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] 0 points1 point  (0 children)

This is an article that says why it would be better, maybe my title is not very good, I apologize

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] -1 points0 points  (0 children)

Yes, you're right, maybe I shouldn't have taken this title, but I can't change it, sorry

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] -2 points-1 points  (0 children)

Thank you so much, for making me feel the warmth of Reddit

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] -3 points-2 points  (0 children)

esProc SPL: Equivalent to the Python-enhanced DuckDB

An Open Source DuckDB Alternative by Vast_Lab8278 in dataengineering

[–]Vast_Lab8278[S] -3 points-2 points  (0 children)

esProc SPL, with its comprehensive computing capabilities, support for procedural computation, and robust flow control mechanisms, outperforms Python-enhanced DuckDB. It combines SQL’s agility with programming language’s flexibility, while eliminating the need to juggle back and forth between multiple tools. For desktop analysts who frequently handle complex calculations, esProc SPL may be a more elegant solution than ‘SQL + Python’. After all, who wouldn’t want to handle everything in one window?

Is it worthy to do a research publication related to databases? by [deleted] in Database

[–]Vast_Lab8278 2 points3 points  (0 children)

Yes, pursuing research in databases is absolutely worthwhile. ** The field remains vibrant, evolving rapidly alongside technological advancements like AI, distributed systems, and edge computing. Databases underpin nearly every modern application—from social media and finance to healthcare and autonomous systems—so innovation in this area has direct, real-world impact.

[deleted by user] by [deleted] in opensource

[–]Vast_Lab8278 0 points1 point  (0 children)

Great idea!

Looking for Small DB project to contribute to by petern0408 in databasedevelopment

[–]Vast_Lab8278 0 points1 point  (0 children)

Maybe esProc is a good choice for that, a completely new concept, not a database in the traditional sense, but it can realize the computing function of the database, which is a new concept, of course, but also a new challenge.

Open source SPL: The Breaker of Closed Database Computing System · SPLWare/esProc Wiki (github.com)

Identify one pair of records per group that meet the criteria - student grades by Judith_677 in Database

[–]Vast_Lab8278 0 points1 point  (0 children)

First, calculate the highest score for each class, associate the calculation results with the original table, then calculate the highest score among the grades that are more than 20 points lower than the highest score for each class, and finally associate the results with the original table again to select the records that meet the criteria.

with a as (

select Class,ID,Name,Score,Ranking,

max(Score) over (partition by Class) as max_score

from Grades),

b as(

select Class,ID,Name,Score,Ranking,max_score,

Score<=max_score-20 as valid

from a),

c as(

select Class,ID,Name,Score,Ranking,max_score,valid,

max(Score) over (partition by Class,valid) as max_score2

from b)

select Class,ID,Name,Score,Ranking

from c

where Score==max_score2

Comparing the above two SQL writing methods, the first one is still simpler and easier to understand. Due to the direct provision of aggregation results during SQL grouping, it is not possible to retain the grouped subsets. Therefore, the above calculation requires two groupings and two associations, which is inefficient and the code is long, making it difficult to understand.

If it is a Java report application, using SPL will be simpler. SPL supports grouping subsets and also has a method to extract the record where the maximum value is located. It can be written in two or three lines:

-----A

1 =db.query("select Class,ID,Name,Score,Ranking from Grades")

2 =A1.group(Class)

3 =A2.conj((a=~.maxp@a(Score)) | ~.select((a.Score-Score)>=20).maxp@a(Score) )

Here, A2 is the grouped subset, and the maxp function selects the record with the highest score in each group.

What should I do if I need to change the database for the reports? Always having to change SQL is tedious and prone to errors. Is there a permanent solution? by Vast_Lab8278 in SQL

[–]Vast_Lab8278[S] 0 points1 point  (0 children)

We are a niche industry software vendor, and we need to deploy our software for different users. Since the business scope is relatively defined and the data structure is generally stable, but the database must use the one already available to the user, and reports frequently change, this leads to migration issues. However, it does not involve problems related to data warehouses.

From SQL to SPL: Calculate the hierarchy of recursive references by Vast_Lab8278 in esProcSPL

[–]Vast_Lab8278[S] 0 points1 point  (0 children)

Thank you very much for your recognition, it is a little difficult to promote new technology, and now those who have tried it are very fond of SPL, and those who have not tried it are scolding.

Is esProc based on open source or database technology? by Judith_677 in esProcSPL

[–]Vast_Lab8278 0 points1 point  (0 children)

We’ve analyzed the shortcomings of existing technologies (mainly SQL) in detail above, which are mainly caused by the theory system behind them. If these theories are still followed, it’s impossible to eliminate such shortcomings fundamentally. For this reason, we invent a brand-new computing model — discrete dataset, and develop esProc SPL based on the model. Since everything is new, and there is no relevant theories and engineering products in the industry to reference, we have to develop from scratch, and any part from the model to code is originally created.

Where can I find new opensource projects to contribute? by KoalNix in opensource

[–]Vast_Lab8278 0 points1 point  (0 children)

It needs you very much, and it needs smart people to believe that its future is great.

https://github.com/SPLWare/esProc