• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

IBM / unitxt / 17129746456 / 1
81%
main: 81%

Build:
DEFAULT BRANCH: main
Ran 21 Aug 2025 02:36PM UTC
Files 64
Run time 2s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

21 Aug 2025 02:21PM UTC coverage: 80.81% (+0.04%) from 80.769%
17129746456.1

push

github

web-flow
Add ReflectionToolCallingMetricSyntactic for evaluating tool call predictions referenceless (#1923)

* Add ReflectionToolCallingMetricSyntactic for evaluating tool call predictions

- Implemented ReflectionToolCallingMetricSyntactic to assess tool calls without references using static checks.
- Added corresponding JSON schema for the metric.
- Created example usage in evaluate_tool_calling_with_reflection.py.
- Updated test_metrics.py to include comprehensive tests for the new metric.

* Install llmevalkit before tests

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Another try

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix style

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Fix examples

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update the message

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Improve docs

Signed-off-by: elronbandel <elronbandel@gmail.com>

* Update

Signed-off-by: elronbandel <elronbandel@gmail.com>

---------

Signed-off-by: elronbandel <elronbandel@gmail.com>
Co-authored-by: Koren Lazar <koren.lazar@ibm.com>
Co-authored-by: elronbandel <elronbandel@gmail.com>

1595 of 1992 branches covered (80.07%)

Branch coverage included in aggregate %.

10840 of 13396 relevant lines covered (80.92%)

0.81 hits per line

Source Files on job 17129746456.1
  • Tree
  • List 64
  • Changed 7
  • Source Changed 0
  • Coverage Changed 7
Coverage ∆ File Lines Relevant Covered Missed Hits/Line Branch Hits Branch Misses
  • Back to Build 17129746456
  • d2223f87 on github
  • Prev Job for on main (#16963744293.1)
  • Next Job for on main (#17140497965.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc