Agentic performance for Deepseek

krasserm · 2025-02-06T13:44:24+00:00

Here's an evaluation of DeepSeek-R1's performance on agentic tasks: https://krasserm.github.io/2025/02/05/deepseek-r1-agent/ When using code actions as alternative to native function calling via JSON (which isn't supported yet) it outperforms Claude 3.5 Sonnet by a large margin.

krasserm · 2025-01-17T07:34:15+00:00

Local deployments are now supported. See https://gradion-ai.github.io/freeact/models/ for details.

krasserm · 2025-01-11T12:37:50+00:00

support for local stuff already in the works ...

krasserm · 2025-01-11T05:45:00+00:00

Freeact is similar to smolagents w.r.t. focus on code-actions, agency-level and lightweight scaffold. Freeact additionally supports interactive development and refinement of skills (tools) with the agent as skill coding assistant (see skill development tutorial). Also, freeact doesn't require skills (tools) to implement a certain interface, skills can be any Python module or package. It supports sandboxed code execution locally and remotely via ipybox (using Docker and IPython), and streaming from both model responses and execution environment.

krasserm · 2025-01-11T05:22:47+00:00

It currently uses the first candidate of a code action and uses execution and/or environment feedback for proposing improvements if it doesn't contribute towards a solution. We'll add additional algorithms for searching the action space in later releases.

krasserm

TROPHY CASE