• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

IBM / unitxt / 15581348656 / 1
81%
main: 81%

Build:
DEFAULT BRANCH: main
Ran 11 Jun 2025 09:41AM UTC
Files 64
Run time 8s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

11 Jun 2025 09:34AM UTC coverage: 80.241% (-0.007%) from 80.248%
15581348656.1

push

github

web-flow
Bluebench fixes (#1828)

* Fix Arena Hard template to use latest llama judge model (used in Bluebench)

Signed-off-by: Jonathan Bnayahu <bnayahu@il.ibm.com>

* Update the Arena Hard recipe in Bluebench to use llama-3-3-70b-instruct as judge.

Signed-off-by: Jonathan Bnayahu <bnayahu@il.ibm.com>

* Add a requirements section for bluebench

Signed-off-by: Jonathan Bnayahu <bnayahu@il.ibm.com>

---------

Signed-off-by: Jonathan Bnayahu <bnayahu@il.ibm.com>

1689 of 2081 branches covered (81.16%)

Branch coverage included in aggregate %.

10478 of 13082 relevant lines covered (80.09%)

0.8 hits per line

Source Files on job 15581348656.1
  • Tree
  • List 64
  • Changed 1
  • Source Changed 0
  • Coverage Changed 1
Coverage ∆ File Lines Relevant Covered Missed Hits/Line Branch Hits Branch Misses
  • Back to Build 15581348656
  • a853a9d5 on github
  • Prev Job for on main (#15557150208.1)
  • Next Job for on main (#15585855594.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc