Discover open source data engineering tools for building reliable data pipelines, ETL processes & scalable data storage. Unlock the power of your data!
Data Engineering · Developer Tools
Open-source ELT platform for 600+ data connectors and AI agent infrastructure
Data Engineering · Automation · Developer Tools
Programmatically author, schedule, and monitor data workflows in Python
Data Engineering · Devops · Automation
Container-native workflow engine for Kubernetes DAGs and parallel jobs
Data Engineering · Automation · AI Development
Serverless GPU inference and AI workloads with zero infrastructure overhead
Analytics · Data Engineering · AI Assistants
开源 BI 工具,拖拽即分析,零代码制作数据大屏
Analytics · Data Engineering
Open-source BI platform with SQL, dashboards, and Yandex Maps integration
Databases · Data Engineering · Developer Tools
Free browser-based ERD editor with SQL generation and migration support
Analytics · Data Engineering
Business Intelligence as Code: Build dashboards with SQL and Markdown
Data Engineering · Search · AI Development
Power AI agents with clean, real-time web data
Developer Tools · Design Tools · Data Engineering
Create beautiful isometric infrastructure diagrams in your browser
Monitoring · Analytics · Data Engineering
Open-source observability platform for metrics, logs, and traces
Devops · Automation · Data Engineering
Event-Driven Orchestration for Data, AI & Infrastructure Workflows
AI Development · Developer Tools · Data Engineering
Open source data labeling platform for images, text, audio, video, and time series
Devops · Developer Tools · Monitoring
Open source LLM observability, prompt management & evaluation platform
Analytics · Data Engineering
Open source BI with AI-powered queries and embedded analytics
Security · Monitoring · Analytics
AI-powered, zero-config observability with real-time per-second metrics
Analytics · Data Engineering
Open-source BI platform with no-code viz builder and SQL editor
Search · Databases · Data Engineering
AI Search & RAG Without Moving Your Data
Data Engineering · Search
Fastest SQL ETL pipeline in a single C++ binary for real-time analytics and streaming
Analytics · Data Engineering · Monitoring
Open-Source Analytics Infrastructure Built on Kafka & ClickHouse
Search · Databases · Data Engineering
Open-source vector database for semantic, hybrid, and image search at scale
Data engineering focuses on building and maintaining robust data pipelines that enable organizations to make data-driven decisions. These tools are essential for turning raw data into actionable insights, automating data workflows, and ensuring data quality.
Typical features within this category include:
Data engineering solves critical problems such as siloed data, inefficient workflows, and a lack of reliable data for analytics. By streamlining the data process, organizations can unlock business value faster, improve decision-making accuracy, and gain a competitive edge. Furthermore, robust data pipelines are foundational for machine learning initiatives, enabling teams to build and deploy predictive models with confidence.