<< All versions
Skill v1.0.1
Trusted Publisher100/100openai/plugins/evaluate-plugin
[codex] Add mobile performance skills (#177)
──Details
PublishedApril 26, 2026 at 07:11 AM
Content Hashsha256:364c832f14f4f49c...
Git SHA421657af0e7e
──Files
Files (1 file, 1.8 KB)
SKILL.md1.8 KBactive
SKILL.md · 44 lines · 1.8 KB
name: evaluate-plugin description: Evaluate a local Codex plugin in engineer-friendly language. Use when the user says "evaluate this plugin", "audit this plugin", "why did this score that way", "what should I fix first", "help me benchmark this plugin", or asks for a plugin-wide report before comparing versions.
Evaluate Plugin
Use this skill when the target is a plugin root with .codex-plugin/plugin.json.
Workflow
- Treat "Evaluate this plugin." as the default entrypoint.
- If the request comes in as natural chat language, use
plugin-eval start <plugin-root> --request "<user request>" --format markdownfirst so the user sees the routed local path. - Run
plugin-eval analyze <plugin-root> --format markdown. - Read
Fix Firstbefore drilling into manifest findings, nested skill findings, and code or coverage details. - If the plugin contains multiple skills, summarize the strongest and weakest ones explicitly.
- If the user wants measured usage, switch to "Help me benchmark this plugin." and use the starter benchmark flow.
- If the user wants trend data, compare two JSON outputs with
plugin-eval compare.
Chat Requests To Recognize
Evaluate this plugin.Audit this plugin.Why did this score that way?What should I fix first?Help me benchmark this plugin.What should I run next?
Commands
bash
plugin-eval start <plugin-root> --request "Evaluate this plugin." --format markdownplugin-eval analyze <plugin-root> --format markdownplugin-eval start <plugin-root> --request "What should I run next?" --format markdownplugin-eval compare before.json after.jsonplugin-eval report result.json --format html --output ./plugin-eval-report.htmlplugin-eval init-benchmark <plugin-root>plugin-eval benchmark <plugin-root> --dry-run
Reference
../../references/chat-first-workflows.md