Course Outline
Introduction to Apache Iceberg
- Overview of Apache Iceberg
- Review of basic concepts
Deep Dive into Iceberg Architecture
- In-depth analysis of Iceberg's table format
- Detailed architecture overview, including metadata and file layout
- Internals of schema and partition evolution
Advanced Installation and Configuration
- Configuring Iceberg for optimal performance in different environments
- Integration with various data processing engines
- Advanced setup: security, encryption, and access controls
- Setting up Iceberg in a distributed environment
Advanced Operations and Maintenance
- Managing large-scale Iceberg tables
- Implementing and managing complex schema changes
- Handling partition evolution and hidden partitioning
- Advanced CRUD operations with schema and partition changes
Query Optimization Techniques
- Techniques for reducing query latency
- Partition pruning and file pruning
- Metadata caching and optimization strategies
- Implementing and testing query optimization techniques
Performance Tuning for Large Datasets
- Optimizing performance for large-scale datasets
- Using Iceberg's built-in features for performance tuning
- Case studies on performance tuning in real-world scenarios
- Tuning performance for large-scale datasets
Advanced Data Migration and Integration
- Migrating complex data structures from other systems
- Integrating Iceberg with real-time data streams
- Migrating complex datasets and integrating real-time data streams
Reliability and Consistency
- Ensuring data consistency and integrity in distributed environments
- Implementing and managing transactional guarantees
- Handling failures and recovery mechanisms
- Implementing reliability and consistency features
Advanced Features and Customization
- Custom catalog implementations
- Extending Iceberg with custom features
- Implementing custom catalog and extending Iceberg functionalities
Data Governance and Compliance
- Implementing data governance policies
- Compliance with data regulations
- Managing audit trails and data lineage
- Implementing governance and compliance features
Summary and Next Steps
Requirements
- Familiarity with core concepts, basic operations, and Iceberg table management
Audience
- Data engineers
- Data architects
- Data analysts
- Software developers
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €6840 online delivery, based on a group of 2 delegates, €2160 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (3)
Trainer had good grasp of concepts
Josheel - Verizon Connect
Course - Amazon Redshift
analytical functions
khusboo dassani - Tech Northwest Skillnet
Course - SQL Advanced
how the trainor shows his knowledge in the subject he's teachign