From the course: Hands-On with Apache Iceberg

Apache Iceberg

- Hello, everybody, my name is Alex Merced, and welcome to this course, Hands On with Apache Iceberg. Now, let's introduce the concepts that you'll be learning throughout this course. Now, in this course what we're going to do is we're going to take your already existing Apache Iceberg knowledge, so your knowledge of what is Apache Iceberg, the concepts and architecture of Apache Iceberg, and put that into practice by getting hands-on with some of our favorite data tools, such as Apache Spark and Dremio. So in the first two sections of the course, we're going to basically set up an environment where we can work with Apache Spark and Dremio, learn how to create tables with Apache Spark and Dremio, learn how to insert data into tables using Apache Spark and Dremio, and how to work with partitioning an Apache Iceberg table using Apache Spark or Dremio. We're then, in the third section, going to learn about maintaining our Iceberg tables, learning about what compaction is, what does it mean to expire snapshots, when should we do these things, why do they matter? And then also learning about the metadata tables, which are a very unique tool in Apache Iceberg tables that give us some visibility and observability into our Apache Iceberg tables to better understand how to maintain them and when we should maintain them. Now, basically, we'll be working through a couple different environments in Apache Spark and Dremio. If you want to find the instructions for the steps that I'll be taking throughout these lessons, do check out the GitHub repository for this course. And a lot of the courses surrounding Apache Iceberg are right here. So basically, you can either go to that URL or scan this QR code to get there. You'll find basically the step-by-step guides for the lessons that'll be coming right up, so, that way, you can follow them as we do them. And otherwise, I'll see you in the lessons. Have a great day, and enjoy.

Contents