• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

IBM / unitxt / 17639482433
81%

Build:
DEFAULT BRANCH: main
Ran 11 Sep 2025 09:06AM UTC
Jobs 2
Files 64
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

11 Sep 2025 08:59AM UTC coverage: 80.906% (+0.1%) from 80.81%
17639482433

push

github

web-flow
Add ReflectionToolCallingMetric and update related metrics (#1931)

* Add ReflectionToolCallingMetric and update related metrics

- Introduced ReflectionToolCallingMetric for assessing syntactic and semantic validity of tool calls.
- Updated MultiTurnToolCallingMetric description for clarity.
- Added reflection.json to catalog with appropriate descriptions.
- Enhanced test coverage for ReflectionToolCallingMetric and its reduction logic.

* removed redundant import which makes tests fail.

* Minor fix for mock provider name in llmevalkit

* Update descriptions for ReflectionToolCallingMetric and ReflectionToolCallingMetricSyntactic; enhance clarity and detail on evaluation criteria and installation instructions.

* Refactor ReflectionToolCallingMetric to use settings directly instead of unitxt.settings; update provider name format for watsonx.

* Fixed minor bugs to support different tasks.

* Fixed requirements issue, general import bug, and added some guards for the provider.

* Fixed pre-commit issues.

* made sure that we reinstall libraries from git.

* Add logging for installation URL and version info in internal pip action

* minor change

* fixed assignment of mock provider

* Update .github/actions/install-internal-pip/action.yml

Co-authored-by: Elron Bandel <elronbandel@gmail.com>

* removed two unittests that were causing problems and fixed assertEqual to assertFalse/assertTrue.

---------

Co-authored-by: Koren Lazar <koren.lazar@ibm.com>
Co-authored-by: Elron Bandel <elronbandel@gmail.com>

1607 of 2005 branches covered (80.15%)

Branch coverage included in aggregate %.

10944 of 13508 relevant lines covered (81.02%)

1.62 hits per line

Uncovered Existing Lines

Lines Coverage ∆ File
650
75.87
0.62% unitxt/metrics.py
Jobs
ID Job ID Ran Files Coverage
1 17639482433.1 11 Sep 2025 09:05AM UTC 64
80.91
GitHub Action Run
2 17639482433.2 18 Sep 2025 12:40PM UTC 64
80.91
GitHub Action Run
Source Files on build 17639482433
  • Tree
  • List 64
  • Changed 1
  • Source Changed 0
  • Coverage Changed 1
Coverage ∆ File Lines Relevant Covered Missed Hits/Line Branch Hits Branch Misses
  • Back to Repo
  • Github Actions Build #17639482433
  • 95ad743b on github
  • Prev Build on main (#17464258472)
  • Next Build on main (#17891966098)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc