Discover open source data engineering tools for building reliable data pipelines, ETL processes & scalable data storage. Unlock the power of your data!
Data Engineering · Developer Tools
Open-source ELT platform with 600+ connectors for seamless data movement
Automation · Data Engineering · Developer Tools
Orchestrate complex data pipelines with code-driven workflows
Automation · Data Engineering · Devops
Orchestrate complex workflows on Kubernetes with containerized steps and DAGs
AI Development · Automation · Data Engineering
Run AI workloads at scale with zero infrastructure overhead
AI Assistants · Analytics · Data Engineering
Drag-and-drop BI for everyone — Tableau alternative with AI-powered analytics
Analytics · Data Engineering
Modern BI platform with drag-and-drop dashboards, SQL analytics, and role-based access control
Data Engineering · Databases · Developer Tools
Visually design database schemas and generate SQL without an account
Analytics · Data Engineering
Business Intelligence as Code: SQL + Markdown Dashboards
AI Development · Data Engineering · Search
AI-powered web data extraction for LLMs — crawl, scrape, map, and search the web with one API.
Data Engineering · Design Tools · Developer Tools
Beautiful isometric infrastructure diagrams, browser-based and offline-capable
Analytics · Data Engineering · Monitoring
Visualize metrics, logs, and traces from Prometheus, Loki, Elasticsearch, and more.
Automation · Data Engineering · Devops
Event-driven orchestration for data pipelines, microservices, and automation—built with YAML and Git.
AI Development · Data Engineering · Developer Tools
Label data for AI models with a flexible, multi-modal annotation platform
Data Engineering · Developer Tools · Devops
Self-hostable LLM observability and ML ops platform for teams building production AI apps
Analytics · Data Engineering
Easy, open-source BI for everyone in your company to explore data
Analytics · Data Engineering · Monitoring
X-Ray Vision for Your Infrastructure: Every Metric, Every Second.
Analytics · Data Engineering
Powerful, no-code data visualization and analytics for modern teams
Data Engineering · Databases · Search
AI Search Without Moving Data — Deploy in Minutes
Data Engineering · Search
SQL for stream processing — no JVM, zero dependencies, ClickHouse-powered.
Analytics · Data Engineering · Monitoring
Self-hosted, production-ready event tracking with real-time analytics and Segment API compatibility
Data Engineering · Databases · Search
Vector database with hybrid search, RAG, and production-grade scalability
Data engineering focuses on building and maintaining robust data pipelines that enable organizations to make data-driven decisions. These tools are essential for turning raw data into actionable insights, automating data workflows, and ensuring data quality.
Typical features within this category include:
Data engineering solves critical problems such as siloed data, inefficient workflows, and a lack of reliable data for analytics. By streamlining the data process, organizations can unlock business value faster, improve decision-making accuracy, and gain a competitive edge. Furthermore, robust data pipelines are foundational for machine learning initiatives, enabling teams to build and deploy predictive models with confidence.