Top AI Tools for

πŸ“Š Data Engineer (172 ai tools)

  • Keebo
    Keebo The data team’s best friend

    Unlock seamless data warehouse and analytics optimizations for superior performance and cost savings with Keebo's fully automated solutions.

    • Contact for Pricing
  • Weld
    Weld AI-powered ETL platform for seamless data synchronization

    Weld is an AI-powered ETL platform that enables businesses to synchronize data across multiple applications, files, and databases with automated data transformation capabilities.

    • Freemium
    • From 79$
  • Securiti
    Securiti Enabling Safe Use of Data & AI

    Securiti provides a unified Data Command Center to enable safe use of data and AI, offering intelligence, controls, and orchestration across hybrid multicloud environments.

    • Contact for Pricing
  • Cloudera
    Cloudera All your data. One platform. Limitless possibilities.

    Cloudera is an enterprise-grade hybrid data platform that enables organizations to manage, analyze, and deploy AI solutions across any cloud or data center. It provides comprehensive data management and analytics capabilities built on Apache Iceberg.

    • Contact for Pricing
  • DatabaseSample
    DatabaseSample Open Source Database Designs for Collaborative Development

    A comprehensive database design platform offering sample schemas, sandbox environment, and tools for creating, testing, and exporting database structures.

    • Free
  • Kater
    Kater Data Storytelling, Reimagined

    Kater automates answers to data questions, defines next steps, and generates personalized weekly reports with actionable insights, empowering data-driven decision-making.

    • Free Trial
  • MLJAR Studio
    MLJAR Studio Desktop app for Data Science with interactive code recipes and AI assistant

    MLJAR Studio is a comprehensive desktop application designed for data scientists, featuring a Python editor with interactive code recipes, AI assistance, and one-click package installation capabilities.

    • Freemium
    • From 20$
  • Definite
    Definite Better decisions. Faster.

    Definite is a modern data platform that empowers teams to collect, store, and analyze data with AI, streamlining ETL, BI, and data warehousing.

    • Freemium
    • From 1000$
  • Buster
    Buster Open-source, AI-powered Data Workers

    Buster provides a team of AI data analysts and engineers to surface insights and improve your data stack. It's an open-source solution for data and analytics.

    • Usage Based
    • From 599$
  • MOSTLY AI
    MOSTLY AI Data Access and Data Insights for Everyone

    MOSTLY AI is an enterprise-grade synthetic data generation platform that leverages GenAI to create privacy-safe, high-quality synthetic datasets for data sharing, AI/ML development, and analytics.

    • Usage Based
    • From 3$
  • Fleak
    Fleak Real-time AI & Enrichment workflows

    Fleak is a low-code, serverless platform that enables data teams to build and deploy scalable APIs for AI workflows, offering seamless integration with existing AI and data stacks without infrastructure management.

    • Freemium
    • From 29$
  • Lilac
    Lilac Better data, better AI - Search, quantify and edit data for LLMs

    Lilac is a powerful data platform that enables efficient dataset exploration, quality control, and management for Large Language Models (LLMs). It offers fast dataset computations and advanced clustering capabilities for AI data processing.

    • Contact for Pricing
  • Ploomber
    Ploomber Enterprise features for your data apps with zero complexity

    Ploomber is a deployment platform that enables easy deployment of data applications with enterprise-grade features including authentication, custom domains, and auto-scaling capabilities.

    • Freemium
    • From 20$
  • JSON Data AI
    JSON Data AI Create AI-generated API endpoints from natural language prompts

    JSON Data AI is a tool that converts natural language prompts into API endpoints, allowing users to generate and fetch structured JSON data about any topic.

    • Freemium
  • PandaAI
    PandaAI Data analysis, effortless

    PandaAI offers a platform for data analysis with predictable pricing for all data teams. It provides a scalable solution from free exploration to enterprise-level cloud deployment.

    • Freemium
  • Secoda
    Secoda Reimagine data governance

    Secoda is an all-in-one, AI-powered data catalog, observability, lineage, access, and governance platform built for modern data teams. It helps discover, manage, and act on trusted data.

    • Freemium
  • Apache Samza
    Apache Samza A distributed stream processing framework

    Apache Samza is a distributed stream processing framework that allows you to build stateful applications for real-time data processing from multiple sources.

    • Free
  • Flexor
    Flexor Transform Unstructured Data Into Valuable Insights

    Flexor is a SQL-first platform that transforms textual data into structured, LLM-ready formats. It simplifies unstructured data preparation, ensuring accuracy, scalability, and governance.

    • Contact for Pricing
  • PublicAI
    PublicAI Web3 AI Data Infrastructure Powering Exceptional AI with Equitable Global Expertise

    PublicAI is a decentralized AI data infrastructure platform that enables global contributors to participate in AI training data creation and annotation while sharing revenue. It offers multi-modal data collection, labeling, and model evaluation services.

    • Freemium
  • airy.co
    airy.co AI Assistants & Copilots. Powered by real-time data.

    Airy is an open-source framework for building AI assistants on streaming data. It unifies data streaming, processing, and AI to deliver better context and accelerate AI development.

    • Freemium
    • From 49$
  • SingleAPI
    SingleAPI Convert the Internet into your own API in seconds

    SingleAPI is a GPT-4 powered solution that automatically transforms any website into a structured API, enabling seamless data extraction and enrichment without manual coding or selectors.

    • Freemium
    • From 75$
  • ActiveBatch
    ActiveBatch Centralized Workload Automation & Job Scheduling

    ActiveBatch is a workload automation and job scheduling platform that orchestrates your entire tech stack with no-code connectors and a low-code REST API adapter.

    • Contact for Pricing
  • Substratus
    Substratus End-to-End AI Solutions With Privacy at the Core

    Substratus provides enterprise-grade AI infrastructure solutions with a focus on privacy, security, and control, enabling organizations to run AI models on their own infrastructure.

    • Contact for Pricing
  • Hyperbrowser
    Hyperbrowser Browser Infrastructure for your AI Apps and Agents

    Hyperbrowser is a cloud platform that provides scalable headless browser infrastructure for web automation and AI-driven applications, offering secure containerized environments with sub-millisecond latency and high concurrent capacity.

    • Freemium
    • From 30$
  • PVML
    PVML Lead AI Innovation Without Data Privacy Risks

    PVML offers a privacy-first data infrastructure for building secure and scalable AI. It enables safe PII exposure to AI, ensures compliance, and prevents vendor lock-in, maximizing AI-driven innovation.

    • Contact for Pricing
  • Stitch
    Stitch Get to insights faster with fully automated cloud data pipelines

    Stitch provides fully automated cloud data pipelines, enabling businesses to rapidly move data from over 140 sources to a data warehouse without coding. It helps centralize siloed data for analysis and reduces ongoing maintenance.

    • Free Trial
    • From 100$
  • Posit
    Posit Empowering Data Scientists with Open-Source Tools

    Posit provides open-source and enterprise-ready professional software for data science, scientific research, and technical communication. It empowers data scientists with tools for centralized management, security, and collaboration.

    • Contact for Pricing
  • Metaplane
    Metaplane Trust your data platform

    Metaplane is a data observability platform that helps data teams monitor data quality, lineage, and spend, ensuring data reliability and trust.

    • Freemium
  • Bytewax
    Bytewax Python-Native Stream Processing

    Bytewax is a complete data processing solution offering a Python-native, stateful stream processor for building and deploying real-time data pipelines.

    • Free
  • Hightouch
    Hightouch Data and AI Platform for Personalization and Targeting

    Hightouch is a data and AI platform that enables businesses to personalize customer experiences and optimize marketing campaigns by leveraging their existing data warehouse.

    • Freemium
    • From 350$
  • Isima bi(OS)
    Isima bi(OS) The Real-time data+AI Cloud - The Fastest path from Data to data+AI Apps

    Isima bi(OS) is a real-time data+AI cloud platform that accelerates data-driven applications through easy development, lean architecture, and fast responsiveness, delivering outcomes in hours to weeks.

    • Freemium
  • Ocient Hyperscale Data Warehouse
    Ocient Hyperscale Data Warehouse Real-time analysis of complex, hyperscale datasets with 90% reduced energy consumption

    Ocient is a hyperscale data warehouse platform that delivers real-time analytics and OLAP workloads with integrated machine learning capabilities, designed for maximum performance while reducing costs and energy consumption.

    • Contact for Pricing
  • Baseplate
    Baseplate Connect Your Data to LLM Apps

    Baseplate is a comprehensive platform that enables teams to build AI applications with seamless data integration, embedding, and retrieval capabilities. It offers unified hybrid database management and multimodal LLM response functionality.

    • Contact for Pricing
  • Datavise
    Datavise Empower your business with tailored AI solutions and data analytics services designed to drive innovation, efficiency, and sustainable growth.

    Datavise provides tailored AI solutions and data analytics services, specializing in generative AI, data & BI consulting, cloud services, and AI/ML development to drive business growth and innovation.

    • Contact for Pricing
  • Weaviate
    Weaviate The AI-native database for a new generation of software

    Weaviate is an open-source vector database that enables developers to build AI-native applications with improved search capabilities, reduced hallucination, and enhanced data security. It supports hybrid search, RAG, and generative feedback loops.

    • Freemium
    • From 25$
  • Practicus AI
    Practicus AI The Unified Platform for Generative AI and Data Intelligence

    Practicus AI is a comprehensive platform for building and deploying generative AI models and data intelligence solutions. It offers a unified environment for data science, analytics, and observability, with deployment options across cloud, on-premises, and air-gapped networks.

    • Freemium
  • Wherobots
    Wherobots The Spatial Intelligence Cloud for Planetary-Scale Analytics

    Wherobots is a comprehensive spatial data platform that combines ETL, analytics, and AI capabilities for processing geospatial data at scale, created by the original developers of Apache Sedona.

    • Freemium
  • Hex
    Hex Bring everyone together with data

    Hex is an AI-powered collaborative workspace designed to streamline data workflows by integrating queries, scripts, and interactive reporting.

    • Freemium
    • From 36$
    • API
  • Chadview
    Chadview Real-time ChatGPT-powered meetings assistant for job interviews

    Chadview is a browser extension that provides real-time AI assistance during job interviews on Zoom, Google Meet, and Microsoft Teams by listening to conversations and generating instant answers to technical questions.

    • Freemium
    • From 20$
  • Devin
    Devin A collaborative AI teammate built to help ambitious engineering teams achieve more

    Devin is an AI-powered software engineering assistant that can handle code migrations, data engineering, bug fixes, and development tasks with proven efficiency gains of 8-12x and cost savings of up to 20x.

    • Paid
    • From 500$
  • Thunder Compute
    Thunder Compute Never pay for idle GPUs - Deploy AI models in under 60 seconds

    Thunder Compute is a cloud GPU platform that provides network-attached GPU virtualization, allowing developers to efficiently run AI and ML models without paying for idle resources.

    • Usage Based
  • Modal
    Modal Serverless Cloud for AI, ML, and Data Applications

    Modal provides high-performance, serverless cloud infrastructure optimized for AI, ML, and data applications. It offers rapid container starts, seamless autoscaling, and flexible environments for developers.

    • Usage Based
  • Hopsworks
    Hopsworks The AI Lakehouse for Your Data

    Hopsworks is an MLOps platform and feature store that enables organizations to build, deploy, and manage AI systems with reproducibility, consistency, and scalability. It offers a unified solution for GenAI, real-time applications, and traditional machine learning.

    • Freemium
  • Isomeric
    Isomeric Transform messy, unstructured text into machine readable JSON

    Isomeric is an AI-powered data extraction platform that converts unstructured text into structured JSON format, enabling efficient data gathering from websites, documents, and various text sources.

    • Paid
    • From 149$
  • Dataloop
    Dataloop AI Development Platform

    Dataloop is an AI development platform for building unstructured data pipelines and developing AI solutions with speed. It offers tools for data management, model deployment, pipeline orchestration, and human feedback integration.

    • Contact for Pricing
  • Apache Spark
    Apache Spark Unified Engine for Large-Scale Data Analytics

    Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

    • Free
  • jsonAI
    jsonAI Instantly Transform Data into Perfect JSON. Your Schema, Our API.

    jsonAI is a powerful tool that converts various data formats into structured JSON using AI, offering customizable schemas and dedicated API endpoints for seamless integration.

    • Freemium
    • From 3$
  • Dagster
    Dagster The Modern Data Orchestrator for Building Data Platforms

    Dagster is a data orchestrator designed for data engineers to build and manage data platforms. It enables rapid development, testing, and confident deployment of data pipelines.

    • Paid
    • From 10$
  • searchable.ai
    searchable.ai A Unified Data Platform for Federated Search and AI Applications

    Searchable.ai provides a unified data platform that connects to leading SaaS platforms, parses and normalizes data, and powers federated search and AI applications.

    • Contact for Pricing
  • Einblick
    Einblick Solve any data problem with just one sentence

    Einblick is an AI-native data notebook that enables seamless data science workflows through a visual canvas, code automation, and collaboration features.

    • Freemium
    • From 9$
    • API
  • Showing results 1 – 50 out of 172
    EliteAi.tools logo

    Elite AI Tools

    EliteAi.tools is the premier AI tools directory, exclusively featuring high-quality, useful, and thoroughly tested tools. Discover the perfect AI tool for your task using our AI-powered search engine.

    Subscribe to our newsletter

    Subscribe to our weekly newsletter and stay updated with the latest high-quality AI tools delivered straight to your inbox.

    Β© 2025 EliteAi.tools. All Rights Reserved.