๐ resourceActivePromisinglow
inspect_ai
Inspect: A framework for large language model evaluations
Quality Breakdown
105/ 200
Content Signals
Repo Health
Multi-platform bonus: +5 pts if tool supports 2+ platforms. Score derived from 12 structural signals โ not stars or popularity.
Trust & Verification
low
Standard permissions only. Safe for general use.
Active
Updated within the last 90 days. Actively maintained.
Risk Assessment
- Official UK Government repository for LLM evaluation framework - legitimate vendor project
- No install scripts or bootstrap automation requesting elevated privileges in file list
- No evidence of autonomous operation without human approval gates
- No multi-agent spawning or orchestration capabilities requested
- Standard Python development setup (pip, make, pre-commit) with typical GitHub workflows
- CLAUDE.md file present but typical for development projects; no evidence of persistent agent configuration