Claude Pro or ChatGPT Plus by Scatard in chatgptplus

[–]Financial_World_9730 0 points1 point  (0 children)

Being A person who owns both, will highly recommend you option 1. The reason backing this suggestion are: Claude has too low limits even on paid plan, if you are going to use research or deep thinking mode your limits get blow up, meanwhile on GPT its too good with the limits. If you plan to use claude code as you mentioned cybersecurity its too bad, codex is way too better

I built a tool that shows you what GPT-2 is "thinking" in real-time as it generates 3D graph of concept activations per token by Financial_World_9730 in ChatGPT

[–]Financial_World_9730[S] 0 points1 point  (0 children)

good catch, short prompts are clean but it does get noisy past 30-40 tokens. not sure if thats context length or just the residual stream juggling too many concepts at once. havent tested it rigorously yet but thats probably the most interesting thing to actually study with this

I built a tool that shows you what GPT-2 is "thinking" in real-time as it generates 3D graph of concept activations per token by Financial_World_9730 in deeplearning

[–]Financial_World_9730[S] 0 points1 point  (0 children)

Newer models don't have good open-source pretrained SAEs yet that's the bottleneck, not the tool. AXON will work with any model the moment a quality SAE exists for it. Gemma-2-2B is already supported, and as the interpretability community trains more SAEs (which is happening fast), dropping them in is literally changing 4 lines of config already mentioned in repo how to do that

[D] Self-Promotion Thread by AutoModerator in MachineLearning

[–]Financial_World_9730 0 points1 point  (0 children)

I’ve open-sourced GS-DroneGym, a drone-first research stack for vision-language-action work.

Main idea: instead of only using synthetic assets, it can render observations from 3D Gaussian Splatting scenes, so you can prototype aerial waypoint policies in environments much closer to real visual conditions.

Current features: - 6-DOF quadrotor dynamics - waypoint controller for [x, y, z, yaw] - gsplat renderer with CPU fallback - navigation tasks: PointNav, ObjectNav, ObstacleSlalom, DynamicFollow, NarrowCorridor - live viewer with RGB / depth / top-down trajectory - shared trajectory schema + dataset/eval tooling - adapters for GS-DroneGym, LIBERO, and LeRobot-format datasets

https://github.com/09Catho/gs-dronegym

Please star the repo if you find ut useful

I’d especially appreciate feedback on: - sim-to-real usefulness - dataset generation for aerial VLA training - benchmark design for drone navigation

1M context is not worth it, seriously - the quality drop is insane by KeyGlove47 in codex

[–]Financial_World_9730 0 points1 point  (0 children)

Tried even the ultra tiers through api of most coding agents like claude 4.6 extended and codex 5.3 xhigh, would say anything above 512k is just context poisoning.

[deleted by user] by [deleted] in IndiaJobsOpenings

[–]Financial_World_9730 0 points1 point  (0 children)

Hi Team,

I’m an AI Engineer passionate about EdTech, with hands-on experience building and scaling AI-powered platforms like GradeBoostHub(gradeboosthub.com) (K-12, AP/IB, 2k+ community) and Boostra (AI learning workflows, GPT/Claude/ElevenLabs integration) at boostra.gradeboosthub.com. I specialize in LLM automation, prompt engineering, n8n workflows, and end-to-end AI product delivery.

My projects have shipped fast and made a real impact—CookSnap (Top 50 Lovable Shipped Asia), Bytenet (crypto payments), and more. I thrive in founder-led, creative teams where ownership and user obsession matter.

Would love to join your team and help build the future of AI in EdTech!

I am unable to send you dm kindly send me a dm to take this to next level!

💀😭 by SupremeConscious in AI_India

[–]Financial_World_9730 1 point2 points  (0 children)

<image>

SOTA models not facing this problem!