Andrey Markin
  • home
  • services
  • projects
  • blog
  • directory
    • Tool
    • Library
    • Repo
    • Article
    • XTweet
    • Guideline
    • Video
  • courses
  • resume
  • about
  • contact
  • meet

Mark Life Ltd

  1. Home
  2. Directory
  3. Vercel Agent Eval
Meet

Mark Life Ltd

BG208147965

HomeContactPrivacyLLM-friendlyBlog RSSDirectory RSS
  1. Directory
  2. agent-eval
RepoAI CodingDev Toolsevalsagents

agent-eval

Testing framework for measuring how well AI coding agents perform against your framework or configuration.

Added April 30, 2026
Visit repo

Related

  • memory-viewClaude Code / Codex skill that reads a project's auto-memory vault and generates a self-contained HTML explorer to visualize what the agent has remembered — MEMORY.md plus topic files — without editing or managing it.
  • session-reportClaude Code / Codex skill that generates a self-contained HTML report debugging what is in a session's context window and how every token is spent — context budget, retained thinking, the dumb-zone cutoff, loaded CLAUDE.md and skills, and full history.
  • It's Time To Rethink EverythingTheo Browne's CascadiaJS 2026 talk arguing that AI is a "new cloud moment" — just as the cloud removed the cost of provisioning servers, agents remove the cost of building, so the sacred rules of software (file systems, codebases, packages, git, deployment) are worth tearing down and rebuilding from first principles.
  • EveOpen-source agent framework from Vercel — define agents as directories of TypeScript and Markdown config and deploy them as standard Vercel projects.