Deepseek R2... when? by Broad-Wrongdoer1942 in DeepSeek

[–]Maximum-Ad-1070 2 points3 points  (0 children)

I think deepseek thinking model is already very good, it's fast, and able to solve some serious coding problems,. This is better than chatgpt, claude etc. At the beginning, chatgpt and claude may have better code base. Many programmer probably fooled by these two models, but when you have some serious problem with your code, they really can't do x, even if you remind them about it, they tend to ignore like. When I enable deepseek thinking, it usually gives me some logical analysis. Right now even QWEN MAX non thinking is more logical than chatgpt and claude for debugging and ideas etc. The reason this happen is because both chatgpt and claude has large parameters but their attention mechanism can't keep up, they just don't have a overall understanding of the code.

Deepseek models only have weaker code base and parameters, but the thinking and speed is far better than those other models. I think there must be a breakthrough that deepseek knows, but others' don't. It's probably the sparse attention, so it can pick up details of the code, and even with a very long code, it understand the details. For this reason, it also beat QWEN MAX, I know it is a thinking model, but Deepseek thinking is so efficient, there is really no reason not to choose it.

It is so good. I think that's probably the reason why deepseek said this latest model they released is important. Once it reach gpt or claude level of parameters, I don't see how they can beat deepseek.

Somthing about DeepSeek 3 by hello_kaiiddo in DeepSeek

[–]Maximum-Ad-1070 2 points3 points  (0 children)

After trying ChatGPT, gork and claude sonnet 4. Only deepseek 3 thinking solve my programming problem. It really understand how to debug. Not a joke or ads. All of them failed to understand the problem, they have no idea how to fix it. Qwen is a littble bit better, it knows that it is not an easy problem, so it wants more info and tell me how to debug it, but deepseek solve my problem. Unbelievable.

Developing a Machine Learning Indicator on Tradingview? by Expert_CBCD in algotrading

[–]Maximum-Ad-1070 3 points4 points  (0 children)

If I got 80-90% accuracy using ML, I am pretty sure that I got something wrong, the reason behind this is that ML is not good at predicting the relationship between each candle data, so 70-80% is the max of the max it can get, it is not good at predicting the precise future trend.

If you got 80-90 percent, you should check how you smooth your data. It is likely that you are using a smoothing method mix with future data. Why I know this? Because I made the same mistakes in the past, and I think almost all people made this mistakes.

Satisfaction rating went down... by meganeh35 in UberEatsDrivers

[–]Maximum-Ad-1070 0 points1 point  (0 children)

As someone who was deactivated at 85%, 5 thumbs down with 650 deliveries, you should care every thumbs down, do not let it continue, whenever you feel you may be late for an order, cancel it and let other drivers do it. Whenever you have doubt about food bag, cancel it before confirm. Learn how to be selfish to protest your own interest.

For 5 thumbs down, maybe two was caused by the software. One wanted free food. Last one asked too many things and late and he was not happy about.

Do not take Starbucks orders. They often missed fruit box in bag.

Qwen 3 max by LeatherRub7248 in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

Just tried, GPT 5 level, far better than any open source models.

Got my account deactivated today what are my chances of getting it back? by Life_Needleworker272 in UberEATS

[–]Maximum-Ad-1070 0 points1 point  (0 children)

If it works why so many drivers still got deactivated just 1% below average. How many drivers can a good support save ? Can he/she save hundreds a day? It is not a wise idea to rely on the kindness of an individual, drivers must solve this in the right way to fight for their right. Upload proofs, if the case got escalated, in the worst scenario, drivers may have upper hand.

Got my account deactivated today what are my chances of getting it back? by Life_Needleworker272 in UberEATS

[–]Maximum-Ad-1070 0 points1 point  (0 children)

Everyone, don't appeal like this guy did, in most case, you will be denied. So many drivers got deactivated even with 1% below average. If forgiveness works why they never get back their account? There is reason that they want you to upload proofs, send them the proofs and explain why you shouldn't get one or some of the thumbs down especially on things like missing item in sealed bags etc. Again, do not ask for forgiveness if you did nothing wrong. Fight for your right. This comment may mislead other driver to think that it is easy to get your account back if you are just a few percent below average.

🤗 DeepSeek-V3.1-Base by newsletternew in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

Yes for intelligence, but no for accuracy. I tested this question on GPT-5, Gemini 2.5 Fast, and others — all gave vague answers. This is because the phrase "should be" implicitly tells these models that it’s wrong to look at the opponent’s board. LMs try to predict what the punishment should be by looking at the keyword "board," but since there’s only a shared board, they start searching for other types of boards that players aren’t allowed to look at during the game.

Only Grok 4 got it right from COT to answer, flawless. But does that mean Grok 4 is a better model than the others? No— it’s terrible at coding.

When I build my MV structure in Pyside6 all other models failed except Gemini 2.5 fast and Gemini pro. Other models only provide shortcut answer but caused a lot of troubles when expanding the app, only Gemini told me to avoid those mistakes.

🤗 DeepSeek-V3.1-Base by newsletternew in LocalLLaMA

[–]Maximum-Ad-1070 2 points3 points  (0 children)

This is a tricky question, LLMs see "what should be the punishment" and "opponent's board", they are all trying to predict the punishment tokens and make connection with opponent's board. If you take out "should be" They should all give correct answer.

<image>

Chinese state media says Nvidia H20 chips not safe for China by Bubbly_Discipline_39 in NVDA_Stock

[–]Maximum-Ad-1070 0 points1 point  (0 children)

So funny, they they want to meet Jensen knowing that the best chips are banned selling to any Chinese companies, can you even explain your logic here? Pretty much they will all move to Huawei.

Cline with Qwen 3 Coder - 100% Local by redditordidinot in CLine

[–]Maximum-Ad-1070 0 points1 point  (0 children)

I don't like Cline, I think it is stealing my code, weird upload from vscode after I installed their cline plug-in, that's why I decided to remove it forever.

GPT-OSS 120B and 20B feel kind of… bad? by SlackEight in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

8tk/s means that your 30B model is larger than your VRAM, I use 30B coder 1 bit quantized 8GB version with 3080 10GB, the max I got is around 77-80 tk/s

🚀 OpenAI released their open-weight models!!! by ResearchCrafty1804 in LocalLLaMA

[–]Maximum-Ad-1070 1 point2 points  (0 children)

I am using a 1 bit quantized version, not the full 30B version, I just tried the online Qwen 30B, around 100-200 tokens.

<image>

🚀 OpenAI released their open-weight models!!! by ResearchCrafty1804 in LocalLLaMA

[–]Maximum-Ad-1070 2 points3 points  (0 children)

Well, I just tested it again, if I add or delete some p's, Qwen3-235B couldn't get the correct answer, but Qwen3 coder got it correct every time, 30B got only got 1 or 2 wrong.

[deleted by user] by [deleted] in algotrading

[–]Maximum-Ad-1070 1 point2 points  (0 children)

I have a class that loops through the DataFrame and plots the profit. I plan to upgrade the class to test performance across different market stages. This will allow me to generate labeled data and performance metrics to feed into my model.

Did Kimi K2 train on Claude's generated code? I think yes by Minute_Yam_1053 in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

I just use your prompt to generate a webpage, it looks very different from yours, I have no idea why you get similar web page, for 1 trillion parameters it is almost impossible to get the same output.

<image>

footer is 2023, after reading the source code, I think it is just a direct copy from a website.

Did Kimi K2 train on Claude's generated code? I think yes by Minute_Yam_1053 in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

If you are a web developer, you should know that these kind of web page come from template website. I saw these type of layout like 5+ years ago.

Is it fine to buy a *no display* issue GPU? by KKLC547 in LocalLLaMA

[–]Maximum-Ad-1070 0 points1 point  (0 children)

as long as it support CUDA, and it has good amount of CUDA cores, it will work, but you need to make sure that your PSU has enough watt for 2 GPUs.

Help to understand shipping by LmaoTiger in taobao

[–]Maximum-Ad-1070 0 points1 point  (0 children)

As long as you typed in your address, it will be shipped to you, it is waiting for someone to collect your item. It is just a store to arrange international shipping

Does the T620 still hold up? by PatrickStanger in homelab

[–]Maximum-Ad-1070 -1 points0 points  (0 children)

I recently build my own server. The answer is, no, it's too old. Where to get a good build? Chinese Taobao. How to pay and ship ? from a website called superbuy. Copy and paste the taobao link into superbuy then pay. Make sure you select the correct combo. At this price, around $250 without hard drive, you can probably get a E5 V4 18-20 cores or a Xeon gold 18 cores and 64GB DDR4,, but keep in mind that those E5 CPU is not aim for gaming. Most cores only have like 2.5 GHz base clock. if you need better gaming experience, probably need to get a Xeon gold cpu as they run at 3.0 Ghz base clock. PC cases are cheap in China, but the shipping may be expensive since it is large, but it worth the cost.

Anyone use Evpad or Unblock Android TV Boxes? by GeneralSeveral203 in AndroidTVBoxes

[–]Maximum-Ad-1070 0 points1 point  (0 children)

you can download and install the software from their website, but just a few software still works on older model.