• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / transmogrifier-ab-diff / 11820448005
87%
main: 86%

Build:
Build:
LAST BUILD BRANCH: IN-1240-pip-audit
DEFAULT BRANCH: main
Ran 13 Nov 2024 03:29PM UTC
Jobs 1
Files 17
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

13 Nov 2024 03:27PM UTC coverage: 86.702% (+0.03%) from 86.674%
11820448005

Pull #65

github

ghukill
Use parquet globbing vs registering datasets

Why these changes are being introduced:

For very large runs, the previous code attempted to register
a pyarrow dataset python object as a DuckDB table.  This works
for smaller tables, but caused an out of memory (OOM) error
as it called to_table() which brought the entire dataset into
memory.

How this addresses that need:
* Utilize parquet globbing, like we do elsewhere, to ensure
that a full input dataset is not brought into memory.

Side effects of this change:
* No OOM errors during final records dataset creation for
large runs.

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/TIMX-389
Pull Request #65: TIMX 388, 389 - handle missing A or B records and construction of final records dataset

9 of 11 new or added lines in 3 files covered. (81.82%)

828 of 955 relevant lines covered (86.7%)

0.87 hits per line

Jobs
ID Job ID Ran Files Coverage
1 11820448005.1 13 Nov 2024 03:29PM UTC 0
86.7
GitHub Action Run
Source Files on build 11820448005
Detailed source file information is not available for this build.
  • Back to Repo
  • Github Actions Build #11820448005
  • Pull Request #65
  • PR Base - main (#11819386123)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc