• Home
  • Features
  • Pricing
  • Docs
  • Announcements
  • Sign In

ml6team / fondant / 7397728517
91%

Build:
DEFAULT BRANCH: main
Ran 03 Jan 2024 01:16PM UTC
Jobs 0
Files 0
Run time –
Badge
Embed ▾
README BADGES
x

If you need to use a raster PNG badge, change the '.svg' to '.png' in the link

Markdown

Textile

RDoc

HTML

Rst

pending completion
7397728517

push

github

web-flow
Propagate dataset schema eagerly (#748)

This PR propagates the dataset schema eagerly when an operation is
applied. This is useful for iterative development in a notebook, since
the schema of the current dataset can be inspected.

Result:

```
In [2]: from fondant.pipeline import Pipeline
In [3]: import pyarrow as pa
In [4]: pipeline = Pipeline(name="my-pipe", base_path="data")
In [5]: text_data = pipeline.read(
   ...:     "load_from_hf_hub",
   ...:     arguments={
   ...:         "dataset_name": "wikitext@~parquet",
   ...:         "n_rows_to_load": 1000,
   ...:     },
   ...:     produces={
   ...:         "text": pa.string(),
   ...:     },
   ...: )
In [6]: text_data.fields
Out[6]: {'text': Field({'name': 'text', 'type': Type(DataType(string)), 'location': '/my-pipe-20231227152908/load_from_hugging_face_hub'})}
In [7]: chunk_data = text_data.apply(
   ...:     "chunk_text",
   ...:     arguments={
   ...:         "chunk_size": 256,
   ...:         "chunk_overlap": 32,
   ...:     },
   ...: )
In [8]: chunk_data.fields
Out[8]: 
{'text': Field({'name': 'text', 'type': Type(DataType(string)), 'location': '/my-pipe-20231227152920/chunk_text'}),
 'original_document_id': Field({'name': 'original_document_id', 'type': Type(DataType(string)), 'location': '/my-pipe-20231227152920/chunk_text'})}
```
Source Files on build 7397728517
Detailed source file information is not available for this build.
  • Back to Repo
  • 665805d4 on github
  • Prev Build on main (#7397725109)
  • Next Build on main (#7447528201)
STATUS · Troubleshooting · Open an Issue · Sales · Support · CAREERS · ENTERPRISE · START FREE · SCHEDULE DEMO
ANNOUNCEMENTS · TWITTER · TOS & SLA · Supported CI Services · What's a CI service? · Automated Testing

© 2026 Coveralls, Inc