• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

rendezqueue / rendezllama / 21335653326 / 1
86%
trunk: 87%

Build:
Build:
LAST BUILD BRANCH: assistant_cli
DEFAULT BRANCH: trunk
Ran 25 Jan 2026 04:19PM UTC
Files 31
Run time 0s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

25 Jan 2026 04:15PM UTC coverage: 88.704% (-0.02%) from 88.722%
21335653326.1

push

github

grencez
Infer default thread count to match physical cores

Introduces `rendezllama::infer_thread_count` to heuristically determine a suitable thread count for inference, aiming to use physical cores rather than logical threads on x86 architectures (by halving the count if > 4). This avoids oversubscription and improves performance stability.

- Updates `ChatOptions` to initialize `thread_count` using this helper.
- Updates `Inference::commit_to_context` to use `std::thread::hardware_concurrency` (all logical cores) for `batch_thread_count` default, distinct from the physical core preference for generation.
- Updates `assistant_cli` to rely on the `ChatOptions` default instead of manually setting `hardware_concurrency`.
- Shared `infer_thread_count` logic resides in `src/chat/opt.cc` and is exposed via `src/chat/opt.hh`.

2128 of 2399 relevant lines covered (88.7%)

523.93 hits per line

Source Files on job 21335653326.1
  • Tree
  • List 31
  • Changed 3
  • Source Changed 0
  • Coverage Changed 3
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Build 21335653326
  • e9fda922 on github
  • Prev Job for on tmp (#21254307595.1)
  • Next Job for on tmp (#21341807490.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc