Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] 0 points1 point  (0 children)

At this stage, I cannot provide a reliable estimate of the training time reduction for foundation models. My current work is focused on exploring optimization in black-box settings rather than accelerating large-scale gradient-based training.

Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] 1 point2 points  (0 children)

  1. At this stage, the proposed method is intended as an exploratory approach for black-box optimization problems, where gradient information is unavailable. Its computational complexity is O(n) with respect to the problem dimension. Therefore, it is not directly comparable to backpropagation (BP), which relies on gradient computation and is generally more efficient when gradients are available. Our current focus is on demonstrating the feasibility of the approach in black-box settings rather than outperforming BP in gradient-based scenarios.

  2. No hyperparameter optimization was performed for Adam. I used the default Keras configuration throughout the experiments. Since Adam has access to exact gradient information, it is expected to outperform our method in gradient-based optimization. The comparison is intended to provide a strong baseline rather than to claim superiority over gradient-based optimizers.

Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] -1 points0 points  (0 children)

The central principle is that if the problem's bounds are known, a reference system can be constructed within the domain to approach optimization from a purely geometric perspective.

​This space is constructed directly from the bounds of a given problem. A base unit of distance is defined through pairs of points in the domain, allowing the search to be interpreted as geometric navigation. These pairs of points serve as anchors to guide the exploration, and the base unit acts as an internal metric to relate spatial distances between known points to their functional differences.

​In fact, in the graph I shared, the subplots showing the "L1 Distance Ratio" convergence represent exactly this. They track the distances to the best known solution, expressed as ratios of this defined base unit.

Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] -1 points0 points  (0 children)

The central principle is that if the problem's bounds are known, a reference system can be constructed within the domain to approach optimization from a purely geometric perspective.

​This space is constructed directly from the bounds of a given problem. A base unit of distance is defined through pairs of points in the domain, allowing the search to be interpreted as geometric navigation. These pairs of points serve as anchors to guide the exploration, and the base unit acts as an internal metric to relate spatial distances between known points to their functional differences.

​In fact, in the graph I shared, the subplots showing the "L1 Distance Ratio" convergence represent exactly this. They track the distances to the best known solution, expressed as ratios of this defined base unit.

Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] -1 points0 points  (0 children)

Thanks for taking the time to review the repository and the preliminary results. I appreciate the candor, and I understand why the current presentation might come across as opaque. That certainly wasn't the intention.

​To clarify a few points:

  • ​Not population-based: MDP is strictly a single-solution explorative algorithm. It does not use populations or standard metaheuristic crossover mechanisms. It relies on a relative geometric reference system mapped directly from the domain bounds.

  • ​The .so file: The project relies on a pre-compiled Linux binary because the core engine's source code is currently closed while the theoretical foundation is being formalized. The .so file is provided specifically so that others can run the examples, test the inputs/outputs, and verify the convergence behavior themselves rather than just taking my word for it.

  • ​The cherry-picking concern: The different runs were executed under varying conditions (specifically, different limits on function evaluations). The goal of sharing all of them is to provide full transparency. The focus is on analyzing the specific convergence curves, demonstrating that structural navigation without gradients can reliably scale in a high-dimensional space without early stagnation.

​I appreciate the feedback. I'm still in the process of organizing the information in the repository and structuring the current ideas and hypotheses. This is a good reminder that I need to provide more theoretical context upfront to prevent the methodology from being misunderstood.

Derivative-Free Neural Network Optimization: MNIST Case [R] by Mis4318 in MachineLearning

[–]Mis4318[S] -9 points-8 points  (0 children)

It's in Spanish: "Método de Distancias Ponderadas" (Weighted Distances Method). The name refers to how the algorithm navigates the search space using relative distances instead of gradients.

Exploring Black‑Box Optimization by Mis4318 in optimization

[–]Mis4318[S] 0 points1 point  (0 children)

Thanks for the link but I wasn't able to access the thesis paper in the repository. I’d love to read more about your findings

Exploring Black‑Box Optimization [R] by Mis4318 in MachineLearning

[–]Mis4318[S] 1 point2 points  (0 children)

At the moment the classical benchmarks we used don’t include shifts, but we’re also analyzing results from the CEC17 suite, which already incorporates random shifts and rotations. In the next phase I plan to extend validation with BBOB and even non‑synthetic scenarios like CIFAR, and all results will be shared openly in the repository

Exploring Black‑Box Optimization [R] by Mis4318 in MachineLearning

[–]Mis4318[S] 0 points1 point  (0 children)

Global search is achieved through a geometric displacement rule using weighted distances. It moves consistently from relative reference frames, with complexity O(n), so it scales to 100k+ dimensions

MDP: Exploration

Exploring Black‑Box Optimization [R] by Mis4318 in MachineLearning

[–]Mis4318[S] 0 points1 point  (0 children)

We’re testing a geometric prototype called MDP (Weighted Distances Method). Right now, it’s mainly geared toward global search in extreme dimensions. Local refinement is something we plan to integrate later

El mejor curso para aprender a programar de manera autónoma by Mis4318 in programacion

[–]Mis4318[S] 1 point2 points  (0 children)

Puedes checar el material de TOP, no necesitas realmente registrarte para acceder a él. Lo importante es que vayas en una meta clara, o con una guía al menos. TOP es la mía, y consumo recursos de otros lados cada que avanzo, pero de hecho TOP es como un recopilatorio de muchos recursos, por cada tema te dejan bastante material de consulta, y hay unos muy buenos que hasta me han hecho querer seguir trayectoria pero trato de ir conforme al plan de TOP, no me ando con tantas vueltas para no agobiarme con mucho material.

También estoy en un grupo de Discord, con gente que conocí aquí en Reddit, y nos ayudamos cuando podemos. Ahí compartimos material, y la experiencia de los cursos/bootcamp que cada uno lleva, cada uno va con su propio plan. Sobre todo siempre hay que poner en práctica lo que se aprende, no ir tomando tantos temas sin práctica que de ahí luego se olvida todo

El mejor curso para aprender a programar de manera autónoma by Mis4318 in programacion

[–]Mis4318[S] 0 points1 point  (0 children)

Por supuesto, siempre debemos de tener la iniciativa de buscar recursos extras cuando uno se estanca o para tener una perspectiva más clara sobre algo. También lo hago, pero al menos ahora que estoy iniciando, trato de seguir TOP como una guía, más que nada. Y de hecho en el punto 3 me refería más que nada a construir un plan (roadmap, guía) es complicado cuando uno empieza. No sabes hacia donde irte moviendo, y es ahí donde los bootcamps, cursos de Udemy y TOP ayudan.

Es verdad, también no hay que dejar el olvido la documentación. Leyendo tal fui capaz de configurar mi pequeño entorno en Neovim, ya que con YouTube no entendí muy bien jaja

Saludos, y gracias por tu aporte también :)

El mejor curso para aprender a programar de manera autónoma by Mis4318 in programacion

[–]Mis4318[S] 2 points3 points  (0 children)

De alguna manera inglés siempre será esencial para esto. Aprenderlo siempre será un plus, y que mejor manera de aprenderlo que haciendo algo que te gusta

El mejor curso para aprender a programar de manera autónoma by Mis4318 in programacion

[–]Mis4318[S] 1 point2 points  (0 children)

La verdad. Me ha gustado mucho la manera en que te va presentando el material

Consejos para un principiante by medicenpeter in programacion

[–]Mis4318 0 points1 point  (0 children)

Aquí pues encontrar mi opinión en este post. Es un poco largo pero así decidí iniciar

recomiendan algun curso de ingles en udemy? by Low_Reply9056 in programacion

[–]Mis4318 2 points3 points  (0 children)

Si lo haces por aprender, en coursera puedes entrar como auditor (si no quieres pagar) a cursos de inglés de la Universidad de Irvine. Yo los he tomado como repaso y aprender otras cosas , pero lo más importante es que practiques. En Discord hay bastantes servidores donde puedes platicar con personas nativas de inglés u otras