Category: article

Hive @ Freshers.in

Hive : Hive Optimizers: A Comprehensive Guide

Hive is a data warehousing tool that provides a SQL-like interface for querying large datasets stored in Hadoop Distributed File…

Continue Reading Hive : Hive Optimizers: A Comprehensive Guide
Hive @ Freshers.in

Hive : Comparison between the ORC and Parquet file formats in Hive

ORC (Optimized Row Columnar) and Parquet are two popular file formats for storing and processing large datasets in Hadoop-based systems…

Continue Reading Hive : Comparison between the ORC and Parquet file formats in Hive
Apache Airflow

Airflow : Using Boto3 in Airflow

Boto3 is the Amazon Web Services (AWS) SDK for Python, which allows Python developers to write software that makes use…

Continue Reading Airflow : Using Boto3 in Airflow