• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / transmogrifier-ab-diff / 11350173928
86%

Build:
DEFAULT BRANCH: main
Ran 15 Oct 2024 04:18PM UTC
Jobs 1
Files 10
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

15 Oct 2024 04:17PM UTC coverage: 98.457% (-0.06%) from 98.519%
11350173928

push

github

web-flow
Add 'collate_ab_transforms' command (#22)

* Add 'collate_ab_transforms' command

Why these changes are being introduced:
* The A/B diff workflow requires a function that can
combine the transformed records (stored in JSON files)
generated by A/B versions of transmogrifier into a
single data structure that can be used for analysis
downstream. It was decided for this function to
create a PyArrow dataset and create Parquet file(s)
that contain joined records from A/B transformed files.

How this addresses that need:
* Create function that reads transformed files and yields
  dictionaries describing the transformed records.
* Create generator that yields pyarrow.RecordBatch(es) of
  TIMDEX record dictionaries
* Write TIMDEX record dictionaries to a temporary Parquet
  dataset for joining.
* Create function that uses DuckDB to query the temporary
  Parquet dataset, joins A/B versions of records, and
  yields query result in pyarrow.RecordBatch(es).
* Write batches of joined records to a Parquet dataset
  ("collated") in the run directory.
* Create function that parses version and filename from transformed file.
* Add util for writing Parquet datasets.

Side effects of this change:
* None

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/TIMX-349

60 of 61 new or added lines in 3 files covered. (98.36%)

319 of 324 relevant lines covered (98.46%)

0.98 hits per line

Jobs
ID Job ID Ran Files Coverage
1 11350173928.1 15 Oct 2024 04:18PM UTC 0
98.46
GitHub Action Run
Source Files on build 11350173928
Detailed source file information is not available for this build.
  • Back to Repo
  • Github Actions Build #11350173928
  • 3c6c6fe5 on github
  • Prev Build on main (#10927280513)
  • Next Build on main (#11353124427)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc