What is HPCC Systems?
HPCC Systems is a comprehensive data engineering platform specifically developed to address big data challenges. It delivers ultra-high performance for building and managing data lakes, processing large-scale datasets, and running advanced analytics and machine learning workflows. Powered by a standards-based core and high productivity programming language (ECL), the platform offers efficient data processing, secure architecture, and cloud-native deployment options.
Open source and trusted for over a decade, HPCC Systems supports containerized and bare-metal deployments, easy integration with third-party tools, and a rich set of features for analytics and machine learning. Extensive documentation, free training resources, and an active community make it suitable for organizations seeking a scalable, reliable, and cost-effective big data solution.
Features
- Kubernetes Architecture: Supports containerized, scalable deployments using Kubernetes.
- Ultra Performance: Enables high-speed and efficient data processing for large datasets.
- Productive Programming Language: Includes ECL, designed for concise and powerful data programming.
- Analytics and Machine Learning: Built-in libraries and support for analytics and machine learning workflows.
- Third Party Integrations: Seamlessly integrates with multiple languages and external data sources.
- Open Source: Completely open source with a large and active developer community.
- Reliable & Secure: Offers a proven, stable, and secure platform trusted in production environments.
- MultiLanguage Support: Works with various programming languages and tools.
- Extensive Documentation: Comprehensive guides, wikis, and video resources for learning and troubleshooting.
Use Cases
- Building and managing enterprise-scale data lakes
- Processing and analyzing big data workloads in real time
- Developing and deploying machine learning models
- Integrating heterogeneous data sources into a unified platform
- Implementing secure, scalable data processing in cloud environments
- Enhancing productivity of data engineering teams with high-level programming
FAQs
-
What programming language does HPCC Systems use?
HPCC Systems utilizes ECL (Enterprise Control Language), which is designed for high productivity and efficient data processing. -
Can HPCC Systems be deployed on the cloud?
Yes, HPCC Systems supports cloud-native containerized deployments using platforms like Kubernetes, as well as traditional bare-metal installations. -
Is HPCC Systems suitable for real-time data processing?
The platform is built for high-speed, parallel data processing, enabling near real-time analytics for large-scale datasets. -
Does HPCC Systems offer machine learning capabilities?
Yes, it includes built-in libraries for analytics and machine learning, supporting advanced data science workflows. -
Is HPCC Systems really open source?
HPCC Systems is fully open source and supported by an active community of developers and users.