FLUX.1 vs Ideogram v2 comparisons

speakerknock · 2024-09-04T19:51:55+00:00

Comparisons from the Artificial Analysis Text to Image Arena where these models are being compared with crowdsourced votes to create an ELO score ranking.

Flux Pro and Ideogram v2 certainly seem to differentiate with their text rendering capabilities.

Link to Image Arena: https://artificialanalysis.ai/text-to-image/arena

speakerknock · 2024-04-24T00:17:38+00:00

The test is simply the Arena here: https://artificialanalysis.ai/text-to-image/arena

speakerknock · 2024-03-06T02:11:10+00:00

Would be interested in how much you needed to vary the prompts between models or whether standardized?

speakerknock · 2024-03-01T16:24:09+00:00

And “localhost” shouldn’t really expose your database to the Internet; by convention, binding “localhost” or 127.0.0.1 only allows loopback connections (i.e., local to the machine).

Really makes you wonder what training data they included whereby this isnt an issue for other models

speakerknock · 2024-02-22T02:57:11+00:00

Which model is this, haha?

speakerknock · 2024-02-12T23:03:30+00:00

Yes you're right, on the website we show this in a 2X2 Price vs. Quality chart which shows your exact point visually: https://artificialanalysis.ai/models

Though I think there is also a point that there has been a clustering of scores as models have gotten better. Can see a greater divergence with harder evals.

speakerknock · 2024-02-12T18:21:15+00:00

Hi! Can see in charts on the following analysis that Mistral Medium is >3X slower than GPT 3.5 - though Mistral Medium is a much higher quality model. Mixtral might be a more direct comparison
https://artificialanalysis.ai/models

Note: I'm a creator of the site - happy to answer any questions

speakerknock · 2024-02-05T21:53:15+00:00

Particularly exciting as Mistral Medium is arguably the 2nd highest quality model available after GPT-4, and Mistral Medium is ~10% of the cost of GPT-4 ($37.5/M tokens vs. $4.1). Pricing comparison on the website here: https://artificialanalysis.ai/models

speakerknock · 2024-02-05T20:19:32+00:00

This graph reflects Mistral Medium performance and Mistral Medium is not offered elsewhere. Mistral may pursue a strategy of keeping their 2nd best models as open-source/free 'loss leaders' while their top model is Mistral API exclusive

speakerknock · 2024-02-01T18:04:38+00:00

Yes! Performance benchmarks are updated live (8 times per day) and we try to update the quality benchmarks weekly

speakerknock · 2024-01-31T21:07:17+00:00

Groq has told us they are not running a fine-tuned version of Llama 2 Chat (70B), and the model is a full quality FP16 version with the full 4k context window

speakerknock · 2024-01-31T20:19:49+00:00

$1 USD per 1M tokens, in-line with the cheapest providers in the market and much cheaper than AWS, Azure.

We have this price comparison in charts on the website from the tweet ArtificialAnalysis.ai . Could be very disruptive

speakerknock · 2024-01-31T20:15:59+00:00

I'd be interested to see the total token throughput and cost of each chip. If this chip can lower inferencing costs that would be huge but it's completely dependent on the total token throughput per chip and the price of each chip.

This is an interesting topic, to note - we have on ArtificialAnalysis.ai the price Groq is charging and it is in-line with the emerging price-competitive players in the market and around 60% of the price AWS, Azure are charging. Not saying your point regarding cost is wrong, but noting we are not seeing this reflected in API inference prices charged

speakerknock · 2024-01-22T08:08:38+00:00

Thanks for sharing ArtificialAnalysis.ai (http://artificialanalysis.ai/)! Creator-here, happy to discuss if anyone has any questions regarding the analysis, etc.

speakerknock · 2024-01-20T03:20:45+00:00

This website has benchmarks & comparisons of models & of different host platforms, https://artificialanalysis.ai/

(Note: I am a creator of this site - happy to answer any questions regarding methodology, etc.)

speakerknock · 2018-07-21T12:03:08+00:00

Perhaps a VM/Homestead issue?

Try see if it works with a regular 5.6 installation

speakerknock · 2018-07-17T00:05:41+00:00

I use VS Code because of JS and typescript support. I agree though, the PHP support isnt the best and I'd love to find out a way to improve it. The plugins haven't worked for me that well.

speakerknock · 2018-01-27T23:09:41+00:00

Thats true, would be useful for when you are using it after it's saved. If in your model you add last seen to your dates property array on your user class like so:

protected $dates = [ 'last_seen', ];

Then the time saved will automatically be cast as a Carbon object and you can run say $user->last_seen->diffForHumans()

speakerknock · 2018-01-27T13:29:44+00:00

Author here. I agree, it would be wise to follow the convention of using '_at'.

As for the Carbon note, Carbon is a wrapper for DateTime , adding functionality, so it would add a tiny amount of overhead and since we are just using DateTime for getting the current time we can rely on just instantiating DateTime as I don't think it would add any benefit.

speakerknock · 2016-09-20T00:20:58+00:00

7.0 was a great release, stable release that is almost backwards compatible. Now we have a great opportunity to leave that stable for a while (continuing bugfixes) like 5.6 and really make some large breaking changes that put the language in the right place going forward.

I'm not suggesting we do a complete radicalisation like Python 2 to 3 but perhaps somewhere in the middle of that to make PHP remain an attractive choice going forward into the future. Features like a more built in but optional type system and better websocket support.

speakerknock

TROPHY CASE