What is apache atlas

The Governance and Compliance tool

Apache Atlas is open-source data governance and metadata management tool.

Apache Atlas can be easily integrated with popular big data tools like Hadoop, spark Kafka, hive etc.

It allows data engineers to ingest, classify, discover, and govern data assets from various data sources.

Atlas supports Lineage and has a simple UI to view the lineage of data as it moves through various processes

Some of the Capabilities of Apache Atlas are:

1. Data Classification

2. Search Lineage

3. Centralized Metadata

4. Security and Policy Engine

It has many pre-defined types, and users can add new types based on their requirements.

It supports a SQL like query engine to search entities.

Apache atlas also provides many REST APIs to access and update lineage.