
bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
The Lens
Bruin is a command-line framework for building data pipelines end to end. You write transformations in SQL or Python, pull data in from many sources, and attach data-quality checks, all in one tool instead of stitching together an ingestion service, a transformation tool, and an orchestrator. Apache-2.0, free, and it runs locally, on a server, or inside CI.
Because it is a CLI you own, you also run it. There is no managed cloud here: scheduling, infrastructure, and warehouse credentials are your responsibility. For teams comfortable with that, it is liberating, one tool, version-controlled, no per-row billing. For teams that want a dashboard and a vendor to call, it is more hands-on than they may want. Call the ops moderate.
This fits data and analytics engineers who would rather have one tool covering ingestion, transformation, and quality than assemble Fivetran plus dbt plus Airflow and pay for each. Solo and small teams: free and capable. Larger teams: still free to run, but weigh the operational load against managed options. If you want fully no-ops pipelines with a support contract, that is Fivetran and dbt Cloud, billed by usage and seats.
The catch is that you are the platform. Bruin gives you the pieces in one place; it does not give you a managed service running them. That is exactly the point if you want control and the wrong call if you want someone else holding the pager.
Free vs Self-Hosted vs Paid
fully freeFree tier: The full Apache-2.0 CLI, free.
Self-hosted: Runs locally, on a server such as EC2, or inside CI like GitHub Actions. You own scheduling, infrastructure, and warehouse credentials.
Paid: No paid tier from the project. Fully managed pipelines with a support contract are Fivetran, dbt Cloud, or Airflow via Astronomer, billed by usage and seats.
Completely free and open source. You run the infrastructure yourself.
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
Similar Tools

Orchestration platform for data assets

Data transformation using software engineering practices

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
License: Apache License 2.0
Use freely. Patent grant included.
Commercial use: ✓ Yes
About
- Owner
- Bruin Data (Organization)
- Stars
- 1,621
- Forks
- 82