Apache Gobblin Description

A framework for distributed data integration that streamlines essential functions of Big Data integration, including data ingestion, replication, organization, and lifecycle management, is designed for both streaming and batch data environments. It operates as a standalone application on a single machine and can also function in an embedded mode. Additionally, it is capable of executing as a MapReduce application across various Hadoop versions and offers compatibility with Azkaban for initiating MapReduce jobs. In standalone cluster mode, it features primary and worker nodes, providing high availability and the flexibility to run on bare metal systems. Furthermore, it can function as an elastic cluster in the public cloud, maintaining high availability in this setup. Currently, Gobblin serves as a versatile framework for creating various data integration applications, such as ingestion and replication. Each application is usually set up as an independent job and managed through a scheduler like Azkaban, allowing for organized execution and management of data workflows. This adaptability makes Gobblin an appealing choice for organizations looking to enhance their data integration processes.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Apache Software Foundation
Headquarters:
United States
Website:
gobblin.apache.org

Media

Apache Gobblin Screenshot 1
Recommended Products
Auth for GenAI | Auth0 Icon
Auth for GenAI | Auth0

Enable AI agents to securely access tools, workflows, and data with fine-grained control and just a few lines of code.

Easily implement secure login experiences for AI Agents - from interactive chatbots to background workers with Auth0. Auth for GenAI is now available in Developer Preview
Try free now

Product Details

Platforms
Windows
Linux
Types of Training
Training Docs
Customer Support
Online Support

Apache Gobblin Features and Options

Big Data Platform

Collaboration
Data Blends
Data Cleansing
Data Mining
Data Visualization
Data Warehousing
High Volume Processing
No-Code Sandbox
Predictive Analytics
Templates

Apache Gobblin Lists

Apache Gobblin User Reviews

Write a Review
  • Previous
  • Next