Course Outline

Introduction to Apache Iceberg

  • Overview of Apache Iceberg
  • Review of basic concepts

Deep Dive into Iceberg Architecture

  • In-depth analysis of Iceberg's table format
  • Detailed architecture overview, including metadata and file layout
  • Internals of schema and partition evolution​

Advanced Installation and Configuration

  • Configuring Iceberg for optimal performance in different environments
  • Integration with various data processing engines
  • Advanced setup: security, encryption, and access controls
  • Setting up Iceberg in a distributed environment

Advanced Operations and Maintenance

  • Managing large-scale Iceberg tables
  • Implementing and managing complex schema changes
  • Handling partition evolution and hidden partitioning
  • Advanced CRUD operations with schema and partition changes

Query Optimization Techniques

  • Techniques for reducing query latency
  • Partition pruning and file pruning
  • Metadata caching and optimization strategies
  • Implementing and testing query optimization techniques​

Performance Tuning for Large Datasets

  • Optimizing performance for large-scale datasets
  • Using Iceberg's built-in features for performance tuning
  • Case studies on performance tuning in real-world scenarios
  • Tuning performance for large-scale datasets

Advanced Data Migration and Integration

  • Migrating complex data structures from other systems
  • Integrating Iceberg with real-time data streams
  • Migrating complex datasets and integrating real-time data streams​

Reliability and Consistency

  • Ensuring data consistency and integrity in distributed environments
  • Implementing and managing transactional guarantees
  • Handling failures and recovery mechanisms
  • Implementing reliability and consistency features​

Advanced Features and Customization

  • Custom catalog implementations
  • Extending Iceberg with custom features
  • Implementing custom catalog and extending Iceberg functionalities​

Data Governance and Compliance

  • Implementing data governance policies
  • Compliance with data regulations
  • Managing audit trails and data lineage
  • Implementing governance and compliance features​

Summary and Next Steps

Requirements

  • Familiarity with core concepts, basic operations, and Iceberg table management

Audience

  • Data engineers
  • Data architects
  • Data analysts
  • Software developers
 21 Hours

Delivery Options

Private Group Training

Our identity is rooted in delivering exactly what our clients need.

  • Pre-course call with your trainer
  • Customisation of the learning experience to achieve your goals -
    • Bespoke outlines
    • Practical hands-on exercises containing data / scenarios recognisable to the learners
  • Training scheduled on a date of your choice
  • Delivered online, onsite/classroom or hybrid by experts sharing real world experience

Private Group Prices RRP from €6840 online delivery, based on a group of 2 delegates, €2160 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.

Contact us for an exact quote and to hear our latest promotions


Public Training

Please see our public courses

Testimonials (3)

Provisional Upcoming Courses (Contact Us For More Information)

Related Categories