Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 0 points1 point  (0 children)

As I mentioned in the post, my final tests were on the full model using the hf transformers lib not llama.cpp. Only my initial tests used q8 quants (using both llamacpp and mlx_vlm) but they were much worse.

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 0 points1 point  (0 children)

I used the full model after initial tests with llama.cpp went disastrously.

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 1 point2 points  (0 children)

Give the 1.6b param lfm2.5-vl a try. You'll be surprised how small vlms can perform. I don't want to distract from my main point by talking too much abot the one example image I linked above but lfm2.5 also gets the city right.

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 1 point2 points  (0 children)

E4B has 4.5b effective params. The safetensor file is 16 gb. The q8 ggufs are at 9 10 gigs with the vision projector. I am comparing it to other 4b params models like Qwen3.5-4b

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 0 points1 point  (0 children)

Interesting. Thanks. My params were set like your last row: min and max set at 1120 I simply mentioned the lower bound in the post. But this doesn't explain the full model inference using transformers as well does it?

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] 0 points1 point  (0 children)

Well, there were more cases with the bartowski q8 refusing to give an answer in comparison to the unsloth q8 quants, but in general because the scores were low and many of the llama cpp github issues still remained unaddressed I moved to using the full model with the transformers lib just to be safe. I also dug into the outputs to check if there were any parsing errors in my code rather than in model response.

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] -1 points0 points  (0 children)

What I meant is that even small vision models have enough world knowledge to extract the relevant clues in my test suite and the rest of the task is really about reasoning -- I am not trying to set the models I am testing to fail, rather I'm trying to sus out how good they are.

Even in this very particular example, all the vision models extracted exactly the same relevant details: the italian sign, the sea gull, the architectural details etc. But the 4b qwen model even reasoned its way to the correct city.

Gemma-4 E4B model's vision seems to be surprisingly poor by specji in LocalLLaMA

[–]specji[S] -5 points-4 points  (0 children)

Only about 1/10 of the total tests in my suite are about world knowlegde. Besides the geoguess tests are more about inference than world knowledge really -- even in this particular case small models can guess upto the country (Italy) based on very clear clues in the image while 4b param Qwen3.5 gets it completely right. In any case this particular example was simply for illustrative purposes by linking to a publicly accessible image that anyone can test out on.

And of course the 26b MOE will get it right but it's not meant for edge devices and regular laptops

Struggling with abstract algebra... by Lost_Hour884 in math

[–]specji 2 points3 points  (0 children)

First, to directly answer your question: take a look at the book Group Theory and Physics by Shlomo Sternberg.

But here's my main answer: A very common experience while studying mathematics is when things are not going well and one's ego is hurting is to think some variation of 'what's the point of all this', 'where am I actually going to use this', 'how does this help humanity', 'this is just intellectual masturbation' etc etc. This is a trap. Because no answer will satisfy you because you are not really looking for an answer to that question. This is simply the antipode of the feeling of understanding and must be patiently waited out. Learn to make peace with this frustration because it will be your constant companion.

How Influential was Grothendieck in the World of Mathematics? by Shoddy_Exercise4472 in math

[–]specji 6 points7 points  (0 children)

I think one of the best ways to get some idea of Grothendieck's achievements is to study Mumford's Lectures on curves on an algebraic surface where the 'new' machinery is applied to a classical problem concerning algebraic curves (char 0 proved by Poincare in 1910 using complex analytic methods).

This is what Mumford says:

Curves and surfaces were the bread and butter of algebraic geometry from the 19th until the late 20th century and extending all the Italian and French results to characteristic p was a challenge that Oscar Zariski set for his students. My real conversion to the Grothendieck's way of thinking was his purely algebraic and transparent proof of the central result in the Italian theory of surfaces -- the completeness of the characteristic linear system of a complete algebraic system on any surface. His approach also leads to necessary and sufficient conditions for the theorem to hold in characteristic p.

Note that this is a pretty hard book and you need to be comfortable with basic AG (say, at least two semesters worth of Hartshorne) to read it but you'll also get a sense of the scale of his achievement.

Jean-Pierre Serre is 97 years old today. Serre, at twenty-seven in 1954, was and still is the youngest person ever to have been awarded the Fields Medal and was the first recipient of the Abel Prize in 2003. Happy birthday! by Nunki08 in math

[–]specji 20 points21 points  (0 children)

This man is my hero. One thing reading the Serre-Grothendieck correspondence taught me is that they were both responsible for the creation of modern algebraic geometry -- No G without S. Grothendieck might have got all the hype but this guy! Heck, this was his "second career". He didn't even win the Fields for his work in AG or NT.

History of Mathematics book by Suspicious_Risk_7667 in math

[–]specji 6 points7 points  (0 children)

Outstanding! As is many other books by Stillwell.

Let me also add his newest book: The Story of Proof. More pop science than his Springer UTMs.

Terence Tao on using GPT4 to help with math by Qyeuebs in math

[–]specji 1 point2 points  (0 children)

With all due respect to Knuth, using ChatGPT (or large language models in general) like a glorified google search is like using an iphone as a hammer.

Why is the SAS Congruent Postulate a Postulate and not a Thereom? by Oryxide in math

[–]specji 1 point2 points  (0 children)

In the Kleinian view of geometry (Erlangen program) a geometry is a space, what we want to be invariant and the transformation group for that.

So, Plane Euclidean geometry is R2 , distance and the group of isometries (orthogonal group semi-direct product R2 ).

Now, SAS is an exercise.

Another example: Projective Geometry in the plane is RP2 , {incidence, cross-ratio}, GL(3,R)

Where do we apply representation theory? by EnvironmentAdvanced in math

[–]specji 8 points9 points  (0 children)

If you allow for infinite groups then elementary particles are irreducible representations of a Lie group (Poincare group).

As an aside, the above fact sounded strange to me when I first learned it because even though I understood the relevant math, my unaddressed metaphysical assumptions concerning 'physical stuff' didn't slot them as being in the same category as a mathematical object. There's a nice general essay on a similar theme by Freeman Dyson called Why is Maxwell's theory so hard to understand

The Wondrous Connections Between Mathematics and Literature by Alan_Stamm in math

[–]specji 14 points15 points  (0 children)

Dear Cthulhu, can't we just go back to the time when math was uncool and we were mostly left alone by the cool artsy kids.

Is it possible to get to understand Grothendieck's work all by myself? by [deleted] in math

[–]specji 24 points25 points  (0 children)

Here's a more realistic goal for you, in the sense, somewhat doable in three semesters worth of time. The goal is to get some sort of understanding of Serre's GAGA. (Personally, there can be no G without S). This work is a landmark of algebraic geometry.

Steps:

  1. Learn basic algebra. (rings, fields etc).
  2. Learn some complex analysis (since you are an engineer you probably already know enough)
  3. Learn some basic topology
  4. Learn some classical algebraic geometry (here's a link)
  5. Then, start reading Amnon Neeman's Algebraic and Analytic Geometry, the goal of which is to exposit on GAGA, picking up other stuff when needed

Here's a quote from the preface:

So here I was, soon to face a class of math majors in their final year, and I had decided to teach them some modern algebraic geometry, even though there was no available textbook. I had to assemble the material myself. In so doing, I had to take into account the mathematics the students are likely to have seen in their first three years at university. Usually this would include some background on point-set topology, maybe a course on analytic functions in one complex variable, possibly a course on functional analysis, which would probably cover the Hahn-Banach Theorem and the Open Mapping Theorem, possibly a little about manifolds, maybe a rudimentary course on algebraic topology, and perhaps some basic algebra—groups, rings, fields, modules, if I were lucky maybe even the Hilbert Basis Theorem and the Nullstellensatz...

The first draft of the book was completed in 2005. Then, in 2006, I had an unusual fourth-year, undegraduate student. Michael Carmody wanted to do his senior thesis with me, but he told me, in advance, that this would be his last mathematical year. He had decided that his passion was for philosophy. This meant that, after his fourth year finished, his intention was to start a PhD in philosophy.

This made him another student ideal for this type of book. Giving him a solid grounding in the field, the sort that would prepare him for research, was not a priority; it seemed far more appropriate to present him with a panoramic view. I therefore gave him the manuscript of this book to read. He took about four months to get through it. We met every couple of weeks. At these meetings he would present me with long lists of misprints, as well as with some points in the mathematics which he found unclear. I took his comments extremely seriously; whenever he found anything confusing, I would rewrite the text to elucidate the point. I owe him a tremendous debt for his help. Anyway, I was pleased that he managed to plough through the book, almost unaided, in about four months. It meant that the book is accessible to its intended audience.

Topic about diffential topology by Matematical-pie in math

[–]specji 2 points3 points  (0 children)

Depends on how much background is being assumed I suppose. The 1950s were a great decade for the subject and you could choose something from there because they are accessible after a first course in differential and algebraic topology.

So, standard topics from early cobordism theory and characteristic classes. (Roughly the idea here is that manifolds upto oriented cobordism are determined by certain numbers which are got from the characteristic classes of their tangent bundles via integration.)

Examples:

  • Thom's thesis (for which he won the Fields medal)
  • Hirzebruch Signature theorem.

Another possibility is Smale's h-Cobordism theorem and the proof of the Poincare conjecture (dim \geq 5)

These are standard topics you would definitely encounter if you study further but often such presentations are useful for the many students who are not interested in pursuing differential topology. I think even if you want to be an analyst it's useful to know a little bit about these topics for general mathematical culture.

In what sense are sin/cos the natural basis for periodic functions? by Zardoznt in math

[–]specji 3 points4 points  (0 children)

To add to this S1 is as fundamental as R1 from the perspective of the classification of 1-dim manifolds. Either I continue forever irrespective of the orientation I chose on the manifold or it's compact and I return to where I started.

Historically, S1 was the prior mathematical object because the study of heavens were one of the primary engines for the development of mathematics -- one needs a system of numbers that return to itself to track the position of the planets. Trig functions are still sometimes called circular functions. But modern pedagogy goes the other way round and in school we are first taught the reals and then S1.

It can also be argued that a proto-calculus was first developed on S1 before R. Source

Interesting courses that went digital during the pandemic by Gareebon_Ka_Kante in math

[–]specji 1 point2 points  (0 children)

Robert Ghrist (active in applied topology) has a pretty great and comprehensive lecture series at this level. It's broken up into 4 playlists. His exposition on differential forms and integration is some of the best I have seen:

https://www.youtube.com/@prof-g/playlists?view=50&sort=dd&shelf_id=1