Description
This course helps customers use Cloudera Data Platform to address data governance tasks, motivated by the need for compliance with regulations such as the European Union's General Data Protection Regulation (GDPR) and the United State's Health Insurance Portability and Accountability Act (HIPAA).
PUE, Cloudera Strategic Partner, is authorized by this multinational to provide official training in Cloudera technologies.
PUE is also accredited and recognized to carry out consulting and mentoring services in the implementation of Cloudera solutions in the business field with the added value in the practical and business approach to knowledge that is translated in its official courses.
Audience and prerequisites
This course is best suited for data stewards and others who are responsible for, or have an interest in, implementing regulatory compliance or performing typical data governance activities using the Cloudera Data Platform.
Prerequisites
Familiarity with basic data governance concepts is helpful, but not required.
Objectives
Students who successfully complete this course will be able to:
- Identify which tools in Cloudera Data Platform (CDP) to use for key data governance activities
- Organize data objects using classifications and business glossary terms
- Find access history for data objects and policies
- Use Data Catalog Profilers in CDP to assist in organizing data objects
- Use Data Catalog to foster collaboration with colleagues
- View and interpret a data object's lineage
- Create and apply resource- and tag-based access control policies
- Create policies for data masking and row-level filtering
Topics
Module 1: Data Governance Overview
- What is Data Governance?
- Basic Concepts
- SDX: Data Governance in CDP
Module 2: Organizing Data Objects
- Searching for Objects by Type
- Classifications
- Glossary Terms
Module 3: Auditing
- Auditing Overview
- Viewing Audit Information
Module 4: Working with Data Catalog
- Data Catalog Overview
- Sensitive Data Profiler
- Defining and Monitoring Data Quality
- Preparing for Audits Using Data Catalog
- Collaborating
Module 5: Lineage
- Inspecting Lineage
- Propagation and Lineage in Atlas
- Inspecting Lineage in Atlas
Module 6: Access Controls
- Apache Ranger Basics
- Creating Users and Roles
- Resource-Based Policies
- Tag-Based Policies
- Securing Metadata Objects
- Providing Partial Access
Module 7: Managing the Data Lifecycle
- Governing the Data Lifecycle