• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

pirl-unc / tcrsift / 25821338153 / 2
85%
main: 85%

Build:
DEFAULT BRANCH: main
Ran 13 May 2026 07:25PM UTC
Files 25
Run time 2s
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

13 May 2026 07:24PM UTC coverage: 71.339% (+0.1%) from 71.194%
25821338153.2

push

github

web-flow
Fix #45 bug 1: load_vdjdb resolves real release-zip directory correctly (#53)

* Fix #45 bug 1: load_vdjdb against the real release-zip directory

After `tcrsift data download --db vdjdb`, the cache directory expands
to ~18 files. The old loader did `path.glob("vdjdb*.txt")[0]`, which
picked `vdjdb.meta.txt` alphabetically — a metadata-about-fields file
with no CDR3 data — and downstream `match_clonotypes` then died with
KeyError: cdr3_alpha.

Two complementary changes:

1. `_pick_vdjdb_file` resolves the directory to a canonical file by
   priority: `vdjdb_full.txt` (paired αβ, the one we actually want)
   first, then `vdjdb.txt` (slim/long β-only), then a filtered glob
   that skips meta/slim/broken/motif/cluster/summary/scored sidecars.
   The fallback path also raises with a clear hint when the directory
   contains only sidecars, instead of silently picking one.

2. Column mapping fixes:
   - Add `cdr3.beta` → `cdr3_beta`. The full export uses paired-chain
     rows with `cdr3.alpha` + `cdr3.beta`; the slim file uses `cdr3`.
     The old mapping only handled `cdr3`, so the full-format file
     produced a dataframe with no `cdr3_beta` column at all.
   - Drop the donor `species` column before renaming
     `antigen.species` → `species`. The full export carries both
     fields (donor T-cell origin vs. epitope source organism); we
     use the latter, and dropping the former first guards against
     any pandas same-name collision.

Verified locally against ~/.cache/tcrsift/vdjdb/ (139,744 rows;
102K with α, 124K with β, 83K viral by antigen species).

Bumps to 0.8.12.

Closes #45 (bug 1; IEDB half was already closed by #51).

* Fix schema conflation between paired/long VDJdb formats

Code review (and a closer look at the real on-disk shapes) caught
that the original PR conflated VDJdb's two formats:

  - vdjdb_full.txt: 1 row per clonotype, paired columns
    `cdr3.alpha` / `cdr3.beta`.
  - vdjdb.txt:      1 row per **chain**, with `cdr3` holding... (continued)

4764 of 6678 relevant lines covered (71.34%)

0.71 hits per line

Source Files on job python-3.9 - 25821338153.2
  • Tree
  • List 25
  • Changed 1
  • Source Changed 0
  • Coverage Changed 1
Coverage ∆ File Lines Relevant Covered Missed Hits/Line
  • Back to Build 25821338153
  • 366dc71e on github
  • Prev Job for on main (#25818564171.4)
  • Next Job for on main (#25829560221.1)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc