• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / browsertrix-harvester
100%
main: 100%

Build:
Build:
LAST BUILD BRANCH: IN-1524-maintenance
DEFAULT BRANCH: main
Repo Added 22 Sep 2023 01:36PM UTC
Files 8
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

LAST BUILD ON BRANCH align-btrix-v0.12.0-release
branch: align-btrix-v0.12.0-release
CHANGE BRANCH
x
Reset
  • align-btrix-v0.12.0-release
  • IN-1240-pip-audit
  • IN-1524-maintenance
  • TIMX-247-initial-build
  • TIMX-542-output-jsonlines
  • TIMX-557-and-misc
  • TIMX-562-incremental-runs
  • USE-86-align-with-etl-step-function
  • USE-93-sitemap-arg-streamline-and-flexibility
  • USE-93-sitemap-parsing-and-url-file
  • USE-97-generate-delete-records
  • dependabot/pip/black-23.11.0
  • dependabot/pip/black-23.12.0
  • dependabot/pip/black-23.12.1
  • dependabot/pip/mypy-1.6.1
  • dependabot/pip/mypy-1.7.0
  • dependabot/pip/mypy-1.7.1
  • dependabot/pip/mypy-1.8.0
  • dependabot/pip/pandas-2.1.3
  • dependabot/pip/pandas-2.1.4
  • dependabot/pip/pre-commit-3.5.0
  • dependabot/pip/ruff-0.1.0
  • dependabot/pip/ruff-0.1.1
  • dependabot/pip/ruff-0.1.11
  • dependabot/pip/ruff-0.1.3
  • dependabot/pip/ruff-0.1.4
  • dependabot/pip/ruff-0.1.5
  • dependabot/pip/ruff-0.1.6
  • dependabot/pip/ruff-0.1.7
  • dependabot/pip/ruff-0.1.8
  • dependabot/pip/ruff-0.1.9
  • dependabot/pip/safety-2.3.5
  • dependabot/pip/urllib3-1.26.18
  • in-1500-new-workflows
  • main
  • maintenance
  • maintenance-09-2024
  • pr4-ci-aws
  • refs/tags/v1.0.0
  • refs/tags/v1.1.0
  • v1.1.1
  • v1.2

03 Nov 2023 01:43PM UTC coverage: 100.0%. Remained the same
6746009289

push

github

ghukill
Align btrix CLI arguments for v0.12.0 release

Why these changes are being introduced:

Browsertrix had a [minor release v0.12.0](https://github.com/webrecorder/browsertrix-crawler/releases/tag/v0.12.0) that changed the
optional fulltext CLI argument --text, now requiring where the fulltext should be stored (where previously defaulted to pages JSON
files).

In time, we may want to leverage storing fulltext in WARC which is now possible from this change, but okay to continue
looking for fulltext in pages JSON files.

How this addresses that need:

Updates the example config YAML from 'text: true' to 'text: to-pages'

Side effects of this change:

None at this time.

Relevant ticket(s):

None

416 of 416 relevant lines covered (100.0%)

1.0 hits per line

Relevant lines Covered
Build:
Build:
416 RELEVANT LINES 416 COVERED LINES
1.0 HITS PER LINE
Source Files on align-btrix-v0.12.0-release
  • Tree
  • List 8
  • Changed 0
  • Source Changed 0
  • Coverage Changed 0
Coverage ∆ File Lines Relevant Covered Missed Hits/Line

Recent builds

Builds Branch Commit Type Ran Committer Via Coverage
6746009289 align-btrix-v0.12.0-release Align btrix CLI arguments for v0.12.0 release Why these changes are being introduced: Browsertrix had a [minor release v0.12.0](https://github.com/webrecorder/browsertrix-crawler/releases/tag/v0.12.0) that changed the optional fulltext CLI argum... push 03 Nov 2023 01:48PM UTC ghukill github
100.0
See All Builds (68)
  • Repo on GitHub
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2025 Coveralls, Inc