home
services
projects
blog
directory
courses
resume
about
contact
meet

Mark Life Ltd

Home
Directory
Vercel Agent Eval

Mark Life Ltd

BG208147965

Home Contact Privacy LLM-friendly Blog RSS Directory RSS

Directory
agent-eval

RepoAI CodingDev Toolsevalsagents

agent-eval

Testing framework for measuring how well AI coding agents perform against your framework or configuration.

Added April 30, 2026

Related

memory-viewClaude Code / Codex skill that reads a project's auto-memory vault and generates a self-contained HTML explorer to visualize what the agent has remembered — MEMORY.md plus topic files — without editing or managing it.
session-reportClaude Code / Codex skill that generates a self-contained HTML report debugging what is in a session's context window and how every token is spent — context budget, retained thinking, the dumb-zone cutoff, loaded CLAUDE.md and skills, and full history.
It's Time To Rethink EverythingTheo Browne's CascadiaJS 2026 talk arguing that AI is a "new cloud moment" — just as the cloud removed the cost of provisioning servers, agents remove the cost of building, so the sacred rules of software (file systems, codebases, packages, git, deployment) are worth tearing down and rebuilding from first principles.
EveOpen-source agent framework from Vercel — define agents as directories of TypeScript and Markdown config and deploy them as standard Vercel projects.