dagster-io/dagster

Analysis updated 2026-06-24

★ 15,498PythonAudience · dataComplexity · 4/5Setup · moderate

Mindmap

mindmap
  root((dagster))
    Inputs
      Python asset definitions
      Schedules
      Sensors
    Outputs
      Materialized data assets
      Run logs
      Lineage graph
    Use Cases
      Replace cron-driven ETL
      Orchestrate dbt and Spark jobs
      Monitor data freshness
    Tech Stack
      Python
      GraphQL
      React

mindmap root((dagster)) Inputs Python asset definitions Schedules Sensors Outputs Materialized data assets Run logs Lineage graph Use Cases Replace cron-driven ETL Orchestrate dbt and Spark jobs Monitor data freshness Tech Stack Python GraphQL React

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Replace a tangled cron and Airflow setup with declarative Python data assets

USE CASE 2

Orchestrate a dbt project alongside Spark and Python transforms in one DAG

USE CASE 3

Add freshness checks and alerts when an upstream data asset goes stale

What is it built with?

PythonGraphQLReact

How does it compare?

	dagster-io/dagster	mindverse/second-me	fabric/fabric
Stars	15,498	15,532	15,430
Language	Python	Python	Python
Setup difficulty	moderate	moderate	easy
Complexity	4/5	3/5	3/5
Audience	data	general	ops devops

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 1h+

Local dev is pip-install easy, production needs a database, daemon, and a deployment target like K8s.

In plain English

Based on the description and topics, this appears to be an orchestration platform for data pipelines, a tool that helps teams schedule, run, monitor, and manage the flow of data through ETL (Extract, Transform, Load) processes and data engineering workflows. The topics indicate it targets analytics, data science, data integration, and data engineering use cases. The README does not provide further detail beyond a file path reference.

Copy-paste prompts

Prompt 1

Walk me through setting up a Dagster project locally with a single asset that loads a CSV into DuckDB

Prompt 2

Convert this Airflow DAG into Dagster software-defined assets with the same schedule

Prompt 3

Wire Dagster up to dbt so each model becomes a Dagster asset with lineage

Prompt 4

Show me how to deploy Dagster to a small Kubernetes cluster with the official Helm chart

Frequently asked questions

What is dagster?

Dagster is a Python orchestration platform for data pipelines. You define data assets as code and Dagster schedules, runs, and observes them across environments.

What language is dagster written in?

Mainly Python. The stack also includes Python, GraphQL, React.

How hard is dagster to set up?

Setup difficulty is rated moderate, with roughly 1h+ to a first successful run.

Who is dagster for?

Mainly data.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub dagster-io on gitmyhub

Verify against the repo before relying on details.