• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

MITLibraries / timdex-pipeline-lambdas / 21599578295
92%
main: 92%

Build:
Build:
LAST BUILD BRANCH: v2.10
DEFAULT BRANCH: main
Ran 02 Feb 2026 05:04PM UTC
Jobs 1
Files 6
Run time 1min
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

02 Feb 2026 05:00PM UTC coverage: 91.971% (-4.2%) from 96.164%
21599578295

Pull #334

github

ghukill
Add embeddings-create and embeddings-load steps to pipeline

Why these changes are being introduced:

Now that we have AWS Batch pipelines that can generate embeddings, and TIM is prepared
to load them into Opensearch, we need the pipeline lambda to prepare commands for both
running the AWS Batch job to create embeddings, and then the TIM command to load the
embeddings.

This dovetails with work in https://mitlibraries.atlassian.net/browse/USE-215 which
has proposed updates to the StepFunction.   There are two new pipeline lambda
invocations in the StepFunction that will utilize the two new allowed 'next-step'
values introduced in this commit.

How this addresses that need:
* Add "embeddings-create" and "embeddings-load" as valid steps in config
* Add SKIP_EMBEDDINGS_SOURCES config for sources that don't need embeddings (alma, gisogm)
* Add generate_embeddings_create_command() which determines compute env (cpu vs gpu-spot)
  based on record count threshold
* Add generate_embeddings_load_command() for TIM bulk-update-embeddings command
* Add handlers for both new steps in format_input.py
* Update handle_load() to flow into embeddings-create instead of end
* Add run_id and embeddings fields to ResultPayload
* Add unit tests for new functionality

Side effects of this change:
* Pipeline will now continue to embeddings steps after load completes

Relevant ticket(s):
* https://mitlibraries.atlassian.net/browse/USE-140
Pull Request #334: USE 140 - embeddings create and index flow control

31 of 50 new or added lines in 3 files covered. (62.0%)

378 of 411 relevant lines covered (91.97%)

0.92 hits per line

New Missed Lines in Diff

Lines Coverage ∆ File
19
86.88
-11.49% lambdas/format_input.py
Jobs
ID Job ID Ran Files Coverage
1 21599578295.1 02 Feb 2026 05:04PM UTC 6
91.97
GitHub Action Run
Source Files on build 21599578295
  • Tree
  • List 6
  • Changed 3
  • Source Changed 3
  • Coverage Changed 3
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Repo
  • Github Actions Build #21599578295
  • Pull Request #334
  • PR Base - main (#21413873513)
  • Delete
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc