Big data technologies such as Apache Hadoop and Apache Spark are increasingly used for genomics research. This blog introduces learners to big data pipelines and workflows as well as processing and analysis of big data using Spark.