Get in Touch

Course Outline

Advanced Transformation Building Blocks

  • Handling complex data types.
  • Managing fields, metadata, and dynamic structures.
  • Reusing transformation patterns.

Parameters, Variables, and Job-Oriented Design

  • Understanding runtime variables and scoping.
  • Parameterizing transformations.
  • Structuring parent-child jobs.

Database Integration and Lookup Strategies

  • Leveraging advanced lookup steps.
  • Implementing caching strategies.
  • Designing efficient joins.

Working with Files, APIs, and External Systems

  • Processing JSON and XML data.
  • Invoking REST and SOAP services.
  • Executing streaming and batch loads.

Error Handling and Data Quality Techniques

  • Capturing and routing errors.
  • Applying data validation patterns.
  • Conducting auditing and logging.

Performance Tuning Essentials

  • Optimizing step design.
  • Addressing memory and threading considerations.
  • Identifying bottlenecks.

Introduction to Repository-Based Development

  • Utilizing the Pentaho repository.
  • Managing versions.
  • Adopting team collaboration practices.

Deployment and Migration Practices

  • Promoting jobs across environments.
  • Managing configurations.
  • Following operational best practices.

Summary and Next Steps

Requirements

  • A solid grasp of ETL fundamentals.
  • Prior experience working with Pentaho Data Integration.
  • Foundational knowledge of data warehousing concepts.

Audience

  • ETL developers.
  • Data engineers.
  • Technical professionals looking to expand their PDI expertise.
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories