• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

IBM / unitxt / 12765217246 / 1
81%
main: 81%

Build:
DEFAULT BRANCH: main
Ran 14 Jan 2025 10:03AM UTC
Files 61
Run time 2s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

14 Jan 2025 09:58AM UTC coverage: 79.393% (+0.02%) from 79.372%
12765217246.1

push

github

web-flow
Add Tables Understanding Benchmark (#1506)

* init commit bench

Signed-off-by: ShirApp <shirashury@gmail.com>

* merge updates

* Make tables benchmark

Signed-off-by: elronbandel <elronbandel@gmail.com>

* modify prompts (instruction once)

* modify prompts (instruction once) in generation template

* change llm as judge metric for scigen (Yifan's code)

* updated recipes

* add table augmenter

* update table benchmark files

* delete some files from branch

* fix typo of augmeter list in benchmark code + update recipes to include loader limit

* fix typos

* drop personal scripts

* create updated json cards (tab fact+turl)

* updated cards (tab fact+turl)

* add tablebench visualization json file

* delete old file

* update df serializer test

* drop table bench visualization since it is not a part of the benchmark, and we are not sure about its evaluation metric

---------

Signed-off-by: ShirApp <shirashury@gmail.com>
Signed-off-by: elronbandel <elronbandel@gmail.com>
Co-authored-by: elronbandel <elronbandel@gmail.com>

1387 of 1735 branches covered (79.94%)

Branch coverage included in aggregate %.

8742 of 11023 relevant lines covered (79.31%)

0.79 hits per line

Source Files on job 12765217246.1
  • Tree
  • List 61
  • Changed 3
  • Source Changed 0
  • Coverage Changed 3
Coverage ∆ File Lines Relevant Covered Missed Hits/Line Branch Hits Branch Misses
  • Back to Build 12765217246
  • 614bb122 on github
  • Prev Job for on main (#12749236191.1)
  • Next Job for on main (#12790383827.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc