
software by Stafford Williams
blog more
- 2025-06-04 AI Code Assistance: Are we talking about the same thing?
- 2025-03-16 Embrace Vibe Coding, Know Its Limits
- 2025-03-09 You Might Not Need an AI Framework
- 2025-03-03 Understanding AI-assisted Coding Workflows
- 2025-01-29 Running DeepSeek R1 Locally
- 2025-01-12 LLM Agent Assisted Coding
- 2024-10-14 Comparing Netlify and Azure Static Web Apps
links more
How we built our multi-agent research system speaks to Anthropic’s multi-agent build experiences.
We found that a multi-agent system with Claude Opus 4 as the lead agent and Claude Sonnet 4 subagents outperformed single-agent Claude Opus 4 by 90.2% on our internal research eval
However these architectures burn through tokens fast:
In our data, agents typically use about 4× more tokens than chat interactions, and multi-agent systems use about 15× more tokens than chats.
Cognition goes further in Don’t Build Multi-Agents
In some cases, libraries such as swarm by OpenAI and autogen by Microsoft actively push concepts which I believe to be the wrong way of building agents.
a tool that you can use to create and run Linux containers as lightweight virtual machines on your Mac. It’s written in Swift, and optimized for Apple silicon.
After installing and starting the service via container system start
the cli gives you a similar experience to docker. Unfortunately in my case after resolving dated sources and switching dependencies to arm64 I discovered my Dockerfile candidate still had an x86 binary so I’ll need to find another candidate to test any performance improvements.
Pricing remains consistent with previous Opus and Sonnet models: Opus 4 at $15/$75 per million tokens (input/output) and Sonnet 4 at $3/$15.
Claude Opus 4 is our most powerful model yet and the best coding model in the world. GitHub says Claude Sonnet 4 soars in agentic scenarios and will introduce it as the model powering the new coding agent in GitHub Copilot
Claude Code, now generally available, brings the power of Claude to more of your development workflow - new beta extensions for VS Code and JetBrains integrate Claude Code directly into your IDE. Beyond the IDE, we’re releasing an extensible Claude Code SDK, so you can build your own agents and applications using the same core agent as Claude Code.
Jony Ive joins OpenAI via a $6.5b acquisition of an AI device startup, io, that Jony co-founded.
Microsoft to open source VSCode copilot & WSL
Over the next few months, AI-powered capabilities from the GitHub Copilot extension will migrate to the VS Code open source repository, says Microsoft. Windows Subsystem for Linux (WSL), the feature in Windows that allows users to run a Linux environment directly on their Windows system, will be open sourced.
Google releases an asynchronous coding agent called Jules. Unfortunately it appears to be wait-list only in Australia.
OpenAI introduces a cloud-based software engineering agent. Limited to Pro users for now.
OpenAI to buy Windsurf for $3 billion.
Firefox integrated AI chatbots into a new sidebar within the browser but the integration doesn’t appear to have access to the page the browser is currently on 🤦. It appears to just be an iframe/tab to the provider’s normal web interface.
devlog more
- 2024-05-29 [spacetraders-v2] v2.14 - more data browser
- 2024-05-16 [spacetraders-v2] v2.13 - data browser
- 2024-05-04 [spacetraders-v2] v2.12 - reset 2024-04-09
- 2024-04-21 [spacetraders-v2] v2.11 - monitoring markets
- 2024-04-18 [spacetraders-v2] v2.10 - trading contracts
- 2024-03-31 [spacetraders-v2] v2.9 - improved waypoint monitoring
- 2024-03-28 [spacetraders-v2] v2.8 - over supply
notes more
- 2025-03-29 [ssl] mac keychain
- 2025-03-03 [javascript, timezones, vitest] timezones
- 2025-01-14 [bookmarklet, chrome] add text to clipboard
- 2024-12-11 [javascript, timezones] datetime libraries
- 2024-03-13 [azure-b2c] limits
- 2024-03-11 [azure-b2c] phone mfa - microsoft samples
- 2024-03-06 [http] testing