If you're looking for open source agents, which can run locally, in Docker, or in the cloud, and which have a consistent track record of acing benchmark scores like SWE-bench, check out https://github.com/All-Hands-AI/OpenHands
We're about to release our Agent SDK (https://github.com/All-Hands-AI/agent-sdk/) which provides devs with all the nuts and bolts you need to define custom prompts, tools, security profiles, and multi-agent interfaces
Shameless plug on https://github.com/aperoc/toolkami here which is designed to be simple enough such that you can truly "own" and understand your agents.
It has a CLI which spins up sandbox for your agents, and has curated tools for your agent to be useful.
If you're looking for open source agents, which can run locally, in Docker, or in the cloud, and which have a consistent track record of acing benchmark scores like SWE-bench, check out https://github.com/All-Hands-AI/OpenHands
We're about to release our Agent SDK (https://github.com/All-Hands-AI/agent-sdk/) which provides devs with all the nuts and bolts you need to define custom prompts, tools, security profiles, and multi-agent interfaces
Any news about devstral small 1.2? magistral 1.2 was a huge leap forward.
How about GPT 20b and 120b?
Qwen3?
Shameless plug on https://github.com/aperoc/toolkami here which is designed to be simple enough such that you can truly "own" and understand your agents.
It has a CLI which spins up sandbox for your agents, and has curated tools for your agent to be useful.
Why only run inside docker? How about the dev's using devcontainer already!
So this is a chat app? It is hard to understand what is the key value prop from the screenshots and by quickly looking at the repo.