Agentic web browsing can't scale with cloud LLMs

1 points by djh1995 10 hours ago

TLDR:

- Agentic web browsing can’t scale if every page action triggers an LLM.

- Small language models can power most interactive AI experiences.

- The browser is the ideal runtime being open, GPU-accelerated, and everywhere.

- Agentary.js is an open-source SDK for building agentic workflows that run locally, in the browser.

Repo URL: https://github.com/agentary-ai/agentary-js

---

Most agentic browsing today sends every click, scroll, or query to a cloud LLM. This doesn’t scale, economically or technically. Each DOM action becomes another API call that is slow and expensive, not to mention a compromise to user privacy.

Agentic behavior belongs inside the browser, not round-tripped to the cloud.

NVIDIA Research even published a paper (https://arxiv.org/pdf/2506.02153) arguing that most agentic tasks don’t need generalist LLMs. They need fast, specialized SLMs running close to the user.

I wanted to build an open-source SDK to support a future where agentic workflows are not only built and owned by the website, but powered by SLMs. With the expansion of WebGPU, the browser is now the perfect runtime.

I put together a simple demo that uses Agentary.js, so feel free to check it out and share feedback: https://demo.agentary.ai

I'd love to collaborate with other engineers in the community to make this future a reality, and importantly open-source!