• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / timdex-dataset-api / 15121174566
94%
main: 93%

Build:
Build:
LAST BUILD BRANCH: USE-306-handle-missing-metadata-or-embeddings
DEFAULT BRANCH: main
Ran 19 May 2025 07:09PM UTC
Jobs 1
Files 7
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

19 May 2025 07:06PM UTC coverage: 94.136% (-1.5%) from 95.652%
15121174566

push

github

ghukill
Add TIMDEXRunManager and TIMDEXSource classes

Why these changes are being introduced:

Until now, this library aimed to support the known use cases
of efficiently writing the results of an ETL run to the dataset
and efficiently reading all records from an ETL run for indexing
into Opensearch.  However, it was known that eventually this library
should support more nuanced access of the dataset, including a
representation of the current state of a source in TIMDEX.

How this addresses that need:

This commit introduces two new classes:
1. TIDMEXRunManager
2. TIMDEXSource

TIMDEXRunManager is a utility class that efficiently retrieves
metadata about all ETL runs in the dataset. Equally important, it is
capable of then returning specific parquet files from the dataset
that are associated with current records in TIMDEX for the source.

Building on this, TIMDEXSource aims to provide the more nuanced behavior
of yielding only the current versions of records for a source.  This class
extends TIMDEXDataset, providing all the base class functionality,
with the layered opinionation of only yielding "current" versions of
a record.

Together, these classes provide efficient and normalized patterns
for yielding all current records from the dataset for a source,
pulling them from the timeseries nature of the dataset.

Side effects of this change:
* None at this time

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/TIMX-494

112 of 122 new or added lines in 4 files covered. (91.8%)

305 of 324 relevant lines covered (94.14%)

0.94 hits per line

New Missed Lines in Diff

Lines Coverage ∆ File
4
91.11
timdex_dataset_api/source.py
6
91.3
timdex_dataset_api/run.py
Jobs
ID Job ID Ran Files Coverage
1 15121174566.1 19 May 2025 07:09PM UTC 7
94.14
GitHub Action Run
Source Files on build 15121174566
  • Tree
  • List 7
  • Changed 0
  • Source Changed 0
  • Coverage Changed 0
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • Github Actions Build #15121174566
  • e7df7bcb on github
  • Prev Build on TIMX-494-pip-audit-and-logging-updates (#15141417534)
  • Next Build on TIMX-494-new-timdexsource-class (#15121591888)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc