• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / geo-harvester / 8438177369
98%
main: 100%

Build:
Build:
LAST BUILD BRANCH: IN-1246-pip-audit
DEFAULT BRANCH: main
Ran 26 Mar 2024 03:01PM UTC
Jobs 1
Files 29
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

26 Mar 2024 02:56PM UTC coverage: 98.369% (+0.01%) from 98.358%
8438177369

push

github

ghukill
Post normalize data quality hooks

Why these changes are being introduced:

It was discovered that some TIMDEX records had empty strings for 'subjects' field that originated
from this harvester.  While it could be addressed whack-a-mole style at the individual metadata format
normalization logic, a more holistic approach is performing some data cleanup after normalization
logic has taken place, removing values that should never end up in the final MITAardvark record.

How this addresses that need:
* adds post normalization data cleanup method _remove_none_and_blank_strings()
* adds post normalization data cleanup method _dedupe_list_fields()
* both methods are run for all field and values, for all source record normalizations

Side effects of this change:
* None values and empty strings removed from final MITAardvark record

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/GDT-241

20 of 20 new or added lines in 1 file covered. (100.0%)

1870 of 1901 relevant lines covered (98.37%)

0.98 hits per line

Jobs
ID Job ID Ran Files Coverage
1 8438177369.1 26 Mar 2024 03:01PM UTC 0
98.37
GitHub Action Run
Source Files on build 8438177369
Detailed source file information is not available for this build.
  • Back to Repo
  • 035c4255 on github
  • Prev Build on main (#8378874038)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc