ZoeZOE/Orbit
SkillsPersonasDocsPricing
Sign InGet Started
Orbit/Skills/ai/Model Benchmark

Model Benchmark

Official Pro

Compare LLM models on your specific use cases with cost analysis

15 installs

Install

$orbit install model-benchmark

Pro subscription required

Sign up to access

What it does

Run standardized benchmarks against multiple LLM providers on YOUR test cases, not generic benchmarks.

Why you need it

Generic benchmarks don't tell you which model is best for your use case. This skill tests models against your actual prompts with your actual data.

Key capabilities

- Custom test suites from your real prompts - Latency, cost, accuracy, and token efficiency metrics - Side-by-side output comparison - CSV and interactive HTML reports

Category

ai

Tier

Pro

Version

v1.0.0

Installs

15

Tags

#ai#benchmarks#evaluation#cost

Latest: v1.0.0

Initial release

Apr 13, 2026

Sign in to track installs

Install

$orbit install model-benchmark

Pro subscription required

Sign up to access

Details

Versionv1.0.0
Categoryai
TierPro
Installs
15
RatingNo ratings yet
AddedApr 13, 2026
UpdatedApr 16, 2026

Tags

#ai#benchmarks#evaluation#cost

Related Skills

ai
Pro

AI Code Review

Deep code review agent — catches bugs, security issues, and anti-patterns

No ratings yet
42 installs
#ai#code-review#security+1
ai
Free

Prompt Engineer

Build, test, and optimize prompts with version control

No ratings yet
38 installs
#ai#prompts#llm+1
communication
Free

Meeting Notes Agent

Extract action items, decisions, and summaries from meeting transcripts

No ratings yet
22 installs
#meetings#notes#action-items+2
infrastructure
Free

Terraform Planner

Terraform plan analysis with cost estimation and risk scoring

No ratings yet
18 installs
#terraform#iac#cost+1