• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

berserkhmdvhb / agentic-scraper / 17369246166
87%
main: 88%

Build:
Build:
LAST BUILD BRANCH: docs/modules-funcs
DEFAULT BRANCH: main
Ran 01 Sep 2025 06:15AM UTC
Jobs 4
Files 69
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

01 Sep 2025 06:13AM UTC coverage: 62.343% (+3.2%) from 59.159%
17369246166

push

github

berserkhmdvhb
refactor(rule_based): improve robustness of rule-based agent and add stricter validation

- Enhanced `guess_price`:
  - Added fallback regexes for formats like `19,99 €` and `€ 19.99`.
  - Implemented candidate scoring to prefer decimal-containing matches.
  - Added synthesis logic for separated integer/fractional groups.
  - Appended fractional part from trailing text when missing.
  - Cleaned extracted strings to normalize separators (comma vs dot).

- Improved `guess_description`:
  - Added `_strip_trailing_price_lines` helper to drop price-only lines from candidate paragraphs.
  - Ensured returned description excludes misleading price artifacts.

- Updated `extract_structured_data`:
  - Now returns `None` early if no informative fields are extracted (title, description, price).
  - Added warning log `MSG_WARN_RULE_BASED_NO_FIELDS` when this occurs.
  - Preserved screenshot capture and structured data logging.

- Introduced new message constants:
  - `MSG_WARN_RULE_BASED_NO_FIELDS = "Rule-based extraction produced no informative fields for {url}"`

- Adjusted tests:
  - Updated validation-failure test to expect early `None` instead of relying solely on schema exceptions.
  - Ensured log capture verifies warning emission.

57 of 67 new or added lines in 2 files covered. (85.07%)

16 existing lines in 1 file now uncovered.

2879 of 4618 relevant lines covered (62.34%)

4.99 hits per line

New Missed Lines in Diff

Lines Coverage ∆ File
10
85.44
56.59% src/agentic_scraper/backend/scraper/agents/rule_based.py

Uncovered Existing Lines

Lines Coverage ∆ File
16
84.68
63.68% src/agentic_scraper/backend/scraper/agents/llm_dynamic_adaptive.py
Jobs
ID Job ID Ran Files Coverage
1 17369246166.1 01 Sep 2025 06:15AM UTC 138
62.34
GitHub Action Run
2 17369246166.2 01 Sep 2025 06:15AM UTC 138
62.34
GitHub Action Run
3 17369246166.3 01 Sep 2025 06:15AM UTC 138
62.34
GitHub Action Run
4 17369246166.4 01 Sep 2025 06:15AM UTC 138
62.34
GitHub Action Run
Source Files on build 17369246166
  • Tree
  • List 69
  • Changed 5
  • Source Changed 3
  • Coverage Changed 5
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • Github Actions Build #17369246166
  • 16d0ceff on github
  • Prev Build on infra/unit-tests (#17353807333)
  • Next Build on infra/unit-tests (#17373082737)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc