Product Verification
Test and verify code is working
31 skills
multi-agent-shogun
by yohey-w
Samurai-inspired multi-agent system for Claude Code. Orchestrate parallel AI tasks via tmux with shogun โ karo โ ashigaru hierarchy.
claude-hud
by jarrodwatts
A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
page-agent
by alibaba
JavaScript in-page GUI agent. Control web interfaces with natural language.
vercel-labs/agent-browser
by vercel-labs
Browser automation CLI for AI agents
CodeBoarding
by CodeBoarding
Interactive architecture diagrams for codebases
chrome-devtools-mcp
by ChromeDevTools
Chrome DevTools for coding agents
Intent-Lab/VisionClaw
by Intent-Lab
Real-time AI assistant for Meta Ray-Ban smart glasses -- voice + vision + agentic actions via Gemini Live and OpenClaw
microsandbox
by superradcompany
opensource secure local-first sandboxes for ai agents
simstudioai/sim
by simstudioai
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
backnotprop/plannotator
by backnotprop
Annotate and review coding agent plans and code diffs visually, share with your team, send feedback to agents with one click.
promptfoo/promptfoo
by promptfoo
Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.
bytedance/UI-TARS-desktop
by bytedance
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
inspect_ai
by UKGovernmentBEIS
Inspect: A framework for large language model evaluations
langwatch
by langwatch
The platform for LLM evaluations and AI agent testing
claude-code-safety-net
by kenryu42
A Claude Code plugin that acts as a safety net, catching destructive git and filesystem commands before they execute.
agent-skills
by apify
Agent Skills for Test Automation
pingcap/tidb
by pingcap
TiDB is built for agentic workloads that grow unpredictably, with ACID guarantees and native support for transactions, analytics, and vector search. No data silos. No noisy neighbors. No infrastructure ceiling.
playwright-skill
by lackeyjb
Claude Code Skill for browser automation with Playwright. Model-invoked - Claude autonomously writes and executes custom automation for testing and validation.