• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

IBM / unitxt / 15112860133 / 1
81%
main: 81%

Build:
DEFAULT BRANCH: main
Ran 19 May 2025 12:29PM UTC
Files 64
Run time 2s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

19 May 2025 12:22PM UTC coverage: 79.799% (+0.01%) from 79.788%
15112860133.1

push

github

web-flow
LLM judge judgebench benchmarks (#1800)

* Move Metric class into its own file

To avoid import cycle issues

Signed-off-by: Martín Santillán Cooper <msantillancooper@ibm.com>

* Add MetricInferenceEngine

Signed-off-by: Martín Santillán Cooper <msantillancooper@ibm.com>

* Add toxic chat LLM judge benchmarks

Signed-off-by: Martín Santillán Cooper <msantillancooper@ibm.com>

* Fix imports

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update cards to use LoadJsonFIle

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix empty template list

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

---------

Signed-off-by: Martín Santillán Cooper <msantillancooper@ibm.com>
Signed-off-by: elronbandel <elronbandel@gmail.com>
Co-authored-by: elronbandel <elronbandel@gmail.com>

1662 of 2068 branches covered (80.37%)

Branch coverage included in aggregate %.

10311 of 12936 relevant lines covered (79.71%)

0.8 hits per line

Source Files on job 15112860133.1
  • Tree
  • List 64
  • Changed 2
  • Source Changed 0
  • Coverage Changed 2
Coverage ∆ File Lines Relevant Covered Missed Hits/Line Branch Hits Branch Misses
  • Back to Build 15112860133
  • 585e281a on github
  • Prev Job for on main (#15107172026.1)
  • Next Job for on main (#15113858975.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc