Baker

High performance modular pipelines for the Big Data era

Blazing fast and open-source data processor

Baker pipelines can fetch, transform and store records in a flash thanks to a high-performant and fully-parallel implementation.

Baker is open-source and written in Go by NextRoll, inc.

Modular pipeline

Baker pipelines are assembled using fully modular components, including many AWS services, websocket, TCP, local disk, etc.

Check out the full list »

High performances

Baker is fully parallel and maximizes the usage of both CPU-bound and I/O bound pipelines.

Performance examples »

Extendable

With Baker it is super easy to write your own components and process any kind of structured data.

How to do it »

Contributions welcome!

New components, Baker core or documentation. Please open a Pull-Request »

Found a bug?

If you think you have found a problem, create a new issue »

Questions?

Get in touch with the Baker team, join our Slack channel »