Manning has just released a new (promising) book about Apache Airflow. The cover is currently under MEAP release, which means that we will have to be patient as the book chapters will be filled in little by little.
I have played with Airflow a couple if years ago, therefore this release will be a great opportunity for me to refresh my knowledge of the pipelines builder and scheduling tool.
As soon as I’m done studying the book, I will provide you with a detailed review of its content. In the meantime, if you wish to get your own, here is the link to Manning’s website
About the book
Data Pipelines with Apache Airflow is your essential guide to working with the powerful Apache Airflow pipeline manager. Expert data engineers Bas Harenslak and Julian de Ruiter take you through best practices for creating pipelines for multiple tasks, including data lakes, cloud deployments, and data science. Part desktop reference, part hands-on tutorial, this book teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. You’ll learn how to automate moving and transforming data, managing pipelines by backfilling historical tasks, developing custom components for your specific systems, and setting up Airflow in production environments. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management.