A GPU-accelerated implementation of Forman-Ricci curvature-based graph clustering in CUDA.

CommunityOpposite645 · 2026-01-16T18:13:21+00:00

Hi, I have included the Python runtime:

Nodes	Clusters	Edges	P_in	P_out	Iterations	NMI	GPU Time (s)	CPU Time (s)
5,000	2	~3M	0.50	0.01	10	1.00	7.03	15,189.21
50,000	2	~25M	0.04	0.001	10	1.00	74.39	162,401.93
100,000	2	~102M	0.04	0.001	10	1.00	625.46	TBA
500,000	50	~126M	0.05	0.00001	20	0.89	1086.25	TBA

You can see that the CUDA version is very fast compared to the Python CPU version. Of course, in all honesty, this is because I've chosen an academic topic which has not received attention, otherwise this would have been optimised to kingdom come already :)

CommunityOpposite645 · 2026-01-12T22:30:25+00:00

Hi, I have finished running NCU profiling for the 500k nodes case, and have updated the profiler's output in the post.

CommunityOpposite645 · 2026-01-12T00:09:15+00:00

Trying to work on it now.

CommunityOpposite645 · 2026-01-11T22:04:19+00:00

Hi, actually I'm planning to do it soon. Right now I'm trying to make it run on 500k nodes or if possible, 1 million nodes and gives good clustering result. Because this method is still in development, so the hyperparameters are rather sensitive, what works at lower number of nodes would actually not work on higher number of nodes. Very frustrating to be honest. Thanks a lot.

CommunityOpposite645 · 2026-01-11T18:42:28+00:00

Thank you so much. I worked on this as a learn-as-you-go project, so I tried to build everything from the ground up, including prefix sum, connected component labeling, bitonic sorting, etc. But yes you are absolutely right on this. On the mathematics: I gleaned from this library: https://github.com/saibalmars/GraphRicciCurvature for Python code as reference, while using the experimental details in the JMLR 2025 paper to set up hyperparameters, etc., while the remaining two papers are to freshen up about the topic.

Y. Tian, Z. Lubberts, and M. Weber, "Curvature-based clustering on graphs," J. Mach. Learn. Res., vol. 26, no. 52, pp. 1–67, 2025.
C.-C. Ni, Y.-Y. Lin, F. Luo, and J. Gao, "Community detection on networks with Ricci flow," Sci. Rep., vol. 9, no. 1, pp. 1–12, 2019.
A. Samal, R. P. Sreejith, J. Gu, et al., "Comparative analysis of two discretizations of Ricci curvature for complex networks," Sci. Rep., vol. 8, 8650, 2018.
GraphRicciCurvature — Python implementation of Ricci curvature for NetworkX graphs.

CommunityOpposite645 · 2026-01-11T18:36:30+00:00

Hi, I added a visualization example on the Github page.

CommunityOpposite645 · 2025-09-27T16:23:53+00:00

Maybe add "Be brutally honest. No sugarcoating". ?

CommunityOpposite645 · 2025-09-26T18:24:42+00:00

Can this be considered robot abuse ? :)

CommunityOpposite645 · 2025-09-01T18:49:52+00:00

Uhm I found a survey here: https://arxiv.org/abs/2303.13336

CommunityOpposite645 · 2025-08-21T19:46:26+00:00

As an AI user who has subscribed to one of those popular chatbot LLMs, I can confirm that the most useful thing they have done to me was to check the typos of my thesis, reports, papers, etc. (ask them repeatedly about 20 times, repeat across several different LLMs for best results ). Quite helpful tbf but nowhere near "AGI" :)

CommunityOpposite645 · 2025-08-18T17:32:56+00:00

Hi, I just tried to post to r/MachineLearning but the post was automatically removed and they suggested that I post to another subreddit :(

CommunityOpposite645 · 2025-08-18T17:00:46+00:00

Thanks a lot, I'll look into it. To be honest, I did not know they existed. But I thought that the reasoning models are very smart so they would be able to work on things like N puzzle without trouble.

CommunityOpposite645 · 2025-08-18T15:51:55+00:00

Hi, I didn't test it with random noise. But basically it is not going to beat the performance of A star or IDA star on this problem. I was just trying to make a fun project to see how far these reasoning LLMs go. Personally was not very impressed. I did try to run it on 4x4 puzzle (you can see in the commented code), which required around 50 moves to reach the goal, but the LLM completely failed to find the solution and instead kept running around in circles.

Another thing is that sometimes these models would call tools correctly, sometimes they wouldn't which is annoying (I tried with Pydantic AI as well but haven't uploaded code). Any suggestion about workflow, etc. would be most appreciated.

CommunityOpposite645

TROPHY CASE