• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / timdex-dataset-api / 20069786007
94%
main: 93%

Build:
Build:
LAST BUILD BRANCH: USE-306-handle-missing-metadata-or-embeddings
DEFAULT BRANCH: main
Ran 09 Dec 2025 03:59PM UTC
Jobs 1
Files 8
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

09 Dec 2025 03:53PM UTC coverage: 93.861% (+0.4%) from 93.503%
20069786007

Pull #179

github

ghukill
Join embeddings queries on record metadata

Why these changes are being introduced:

Two nice-to-have functionalities were missing from the first pass of embeddings
reading:

1. filter by record metadata columns now in embeddings schema
2. retrieve record metadata columns in embeddings read methods

There was a deliberate choice to keep embeddings read methods simple in the first
pass.  This builds on that work.

How this addresses that need:

For TIMDEXEmbeddings.read_batches_iter(), the base read method for all embeddings
read methods, perform a join to record metadata via the composite key (timdex_record_id,
run_id, run_record_offset).  Given that record metadata is very fast and memory safe, this
join is too.  By performing this join, we can expose record metadata columns that
intentionally don't exist in the embeddings schema -- e.g. 'source' or 'run_timestamp' --
for filtering and selecting.

Side effects of this change:
* Read methods for TIMDEXEmbeddings can filter and return columns only found in
records metadata tables/views.

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/USE-143
Pull Request #179: Join embeddings queries on record metadata

33 of 33 new or added lines in 2 files covered. (100.0%)

3 existing lines in 1 file now uncovered.

688 of 733 relevant lines covered (93.86%)

0.94 hits per line

Uncovered Existing Lines

Lines Coverage ∆ File
3
96.36
1.36% timdex_dataset_api/embeddings.py
Jobs
ID Job ID Ran Files Coverage
1 20069786007.1 09 Dec 2025 03:59PM UTC 8
93.86
GitHub Action Run
Source Files on build 20069786007
  • Tree
  • List 8
  • Changed 1
  • Source Changed 1
  • Coverage Changed 1
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • Github Actions Build #20069786007
  • Pull Request #179
  • PR Base - main (#19945147360)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc