• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / transmogrifier-ab-diff / 11181571879
98%
main: 86%

Build:
Build:
LAST BUILD BRANCH: IN-1240-pip-audit
DEFAULT BRANCH: main
Ran 04 Oct 2024 02:16PM UTC
Jobs 1
Files 11
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

04 Oct 2024 01:51PM UTC coverage: 97.885% (-0.6%) from 98.519%
11181571879

push

github

jonavellecuerdo
Add 'collate_ab_transforms' command

Why these changes are being introduced:
* The A/B diff workflow requires a function that can
combine the transformed records (stored in JSON files)
generated by A/B versions of transmogrifier into a
single data structure that can be used for analysis
downstream. It was decided for this function to
create a PyArrow dataset and create Parquet file(s)
that contain joined records from A/B transformed files.

How this addresses that need:
* Create function that reads transformed files and yields
  dictionaries describing the transformed records.
* Create generator that yields pyarrow.RecordBatch(es) of
  TIMDEX record dictionaries
* Write TIMDEX record dictionaries to a temporary Parquet
  dataset for joining.
* Create function that uses DuckDB to query the temporary
  Parquet dataset, joins A/B versions of records, and
  yields query result in pyarrow.RecordBatch(es).
* Write batches of joined records to a Parquet dataset
  ("collated") in the run directory.
* Create function that parses version and filename from transformed file.
* Add util for writing Parquet datasets.

Side effects of this change:
* None

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/TIMX-349

58 of 61 new or added lines in 2 files covered. (95.08%)

324 of 331 relevant lines covered (97.89%)

0.98 hits per line

Jobs
ID Job ID Ran Files Coverage
1 11181571879.1 04 Oct 2024 02:16PM UTC 0
97.89
GitHub Action Run
Source Files on build 11181571879
Detailed source file information is not available for this build.
  • Back to Repo
  • Github Actions Build #11181571879
  • 9747f6a0 on github
  • Prev Build on main (#10927280513)
  • Next Build on TIMX-349-collate-ab-transforms (#11181642972)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc