fix: auto-weight grouped rubrics shorthand by criteria count by christso · Pull Request #1099 · EntityProcess/agentv

christso · 2026-04-14T07:43:40Z

Problem

When string shorthand assertions are mixed with explicit graders, the internal grouping creates a hidden weight asymmetry. A user who writes 4 assertions expects equal weight per line:

assertions:
  - Identifies the undefined access   # user thinks: 1/4 weight
  - Suggests a null-safe fix           # user thinks: 1/4 weight
  - Explains the root cause            # user thinks: 1/4 weight
  - type: contains
    value: "null"                      # user thinks: 1/4 weight

But the framework creates 2 graders — rubrics (weight 1) and contains (weight 1) — so contains got 50% of the score instead of 25%.

Fix

One line change in the string shorthand grouping logic (evaluator-parser.ts): when string criteria are grouped into a rubrics grader, set its weight = criteria.length. This makes each user-visible assertion contribute equal weight regardless of how many strings are grouped together.

Before: rubrics(w=1) + contains(w=1) → 50/50
After: rubrics(w=3) + contains(w=1) → 75/25 (each of 4 lines = 25%)

Behaviour

Mixed assertions: rubrics weight scales with criteria count ✓
All-string assertions: weight is set but has no effect (sole grader) ✓
Explicit type: rubrics with weight: set: unaffected (different code path) ✓
Explicit weight: on string shorthand: not possible by design — use type: rubrics for that

🤖 Generated with Claude Code

When string shorthand assertions are mixed with other explicit graders, the rubrics grader created from the strings now gets weight = number of criteria, making each user-visible assertion contribute equal weight to the overall score. Before: [contains, "A", "B", "C"] → contains(w=1) + rubrics(w=1) → 50/50 After: [contains, "A", "B", "C"] → contains(w=1) + rubrics(w=3) → 25/75 The shorthand abstraction is now transparent — users who write N string criteria alongside M explicit graders get equal weight per visible line, without needing to know about internal grader grouping. Closes #1098 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

cloudflare-workers-and-pages · 2026-04-14T07:44:13Z

Deploying agentv with Cloudflare Pages

Latest commit:	`20efd94`
Status:	✅ Deploy successful!
Preview URL:	https://16a9af3e.agentv.pages.dev
Branch Preview URL:	https://fix-1098-rubrics-shorthand-a.agentv.pages.dev

View logs

christso and others added 2 commits April 14, 2026 07:40

style: fix biome formatting

62fa5e1

christso added 2 commits April 14, 2026 12:34

test: remove redundant shorthand weight tests

e37fe40

style: fix trailing blank line

20efd94

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: auto-weight grouped rubrics shorthand by criteria count#1099

fix: auto-weight grouped rubrics shorthand by criteria count#1099
christso wants to merge 4 commits intomainfrom
fix/1098-rubrics-shorthand-auto-weight

christso commented Apr 14, 2026

Uh oh!

cloudflare-workers-and-pages bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

christso commented Apr 14, 2026

Problem

Fix

Behaviour

Uh oh!

cloudflare-workers-and-pages bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying agentv with Cloudflare Pages

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloudflare-workers-and-pages bot commented Apr 14, 2026 •

edited

Loading