Ran
|
Jobs
1
|
Files
403
|
Run time
12s
|
Badge
Embed ▾
README BADGES
|
push
github
feat: make probabilistic optimizations optional and tunable in the YAML config (#1912) Probabilistic optimization sacrifices accuracy in order to reduce memory consumption. In certain parts of the pipeline, a Bloom Filter is used ([set_processor](https://github.com/getdozer/dozer/blob/<a class=hub.com/getdozer/dozer/commit/<a class="double-link" href="https://git"><a class=hub.com/getdozer/dozer/commit/2e3ba96c3f4bdf9a691747191ab15617564d8ca2">2e3ba96c3/dozer-sql/src/pipeline/product/set/set_processor.rs#L20)), while in other parts, hash tables that store only the hash of the keys instead of the full keys are used ([aggregation_processor](https://github.com/getdozer/dozer/blob/2e3ba96c3f4bdf9a691747191ab15617564d8ca2/dozer-sql/src/pipeline/aggregation/processor.rs#L59) and [join_processor](https://github.com/getdozer/dozer/blob/2e3ba96c3f4bdf9a691747191ab15617564d8ca2/dozer-sql/src/pipeline/product/join/operator.rs#L57-L58)). This commit makes these optimizations disabled by default and offers user-configurable flags to enable each of these optimizations separately. This is an example of how to turn on probabilistic optimizations for each processor in the Dozer configuration. ``` flags: enable_probabilistic_optimizations: in_sets: true # enable probabilistic optimizations in set operations (UNION, EXCEPT, INTERSECT); Default: false in_joins: true # enable probabilistic optimizations in JOIN operations; Default: false in_aggregations: true # enable probabilistic optimizations in aggregations (SUM, COUNT, MIN, etc.); Default: false ```
347 of 347 new or added lines in 25 files covered. (100.0%)
47165 of 61858 relevant lines covered (76.25%)
48442.96 hits per line
Lines | Coverage | ∆ | File |
---|---|---|---|
1 |
29.69 |
0.0% | dozer-cli/src/lib.rs |
1 |
71.38 |
13.45% | dozer-cli/src/simple/orchestrator.rs |
1 |
97.69 |
2.2% | dozer-core/src/app.rs |
1 |
62.07 |
0.0% | dozer-sql/src/pipeline/product/set/operator.rs |
1 |
35.2 |
-0.28% | dozer-sql/src/pipeline/product/set/set_processor.rs |
2 |
98.6 |
-0.46% | dozer-sql/src/pipeline/aggregation/processor.rs |
2 |
97.59 |
dozer-sql/src/pipeline/product/set/record_map.rs | |
7 |
0.0 |
0.0% | dozer-cli/src/live/state.rs |
8 |
41.61 |
-1.44% | dozer-cli/src/ui_helper.rs |
10 |
82.17 |
0.17% | dozer-sql/src/pipeline/product/join/operator.rs |
Lines | Coverage | ∆ | File |
---|---|---|---|
1 |
86.16 |
-0.63% | dozer-cache/src/cache/lmdb/indexing.rs |
ID | Job ID | Ran | Files | Coverage | |
---|---|---|---|---|---|
1 | 5972853941.1 | 403 |
76.25 |
GitHub Action Run |
Coverage | ∆ | File | Lines | Relevant | Covered | Missed | Hits/Line |
---|