Category: article
JavaScript : How to returns the element of that index
JavaScript Arrays are a fundamental data structure that allows developers to store and manipulate collections of data efficiently. One of…
JavaScript : Removing Duplicates from Arrays in JavaScript
This article will show how to remove duplicates from an array in JavaScript using the Set object and the filter…
Airflow : Mastering Dependencies in Apache Airflow: A Comprehensive Guide to Labeling
While managing tasks and dependencies in a pipeline, labeling can be a helpful approach to improve readability and maintainability. This…
Airflow : Optimizing Airflow: Efficient resource clean-up techniques and code
Airflow is an open-source platform used to programmatically author, schedule and monitor workflows. It’s known for its rich feature set,…
PySpark : Identifying Data Skewness and Partition Row Counts in PySpark
Data skewness is a common issue in large scale data processing. It happens when data is not evenly distributed across…
Python : Automating S3 Data Movement with Python and Current Date Suffix
In this article, we will create a Python script that automates the process of moving S3 data with a current…
Hive : Understanding Array Aggregation in Apache Hive
Apache Hive offers many inbuilt functions to process data, among which collect_list() and collect_set() are commonly used to perform array aggregation….
Hive : Creating and Utilizing 64-bit Hash Values in Apache Hive
Apache Hive provides several inbuilt functions to process the data. One of these is the hash() function, which calculates a…
Hive : How can we return the average of non-NULL records in Hive ?
The function you’re need to refer in Apache Hive is the avg() function. It is an aggregate function that returns…
Hive : How to Delete Old Apache Hive Logs , increase space and boosting Cluster Performance
Apache Hive logs are a critical component for debugging and performance optimization. However, over time, these logs can occupy significant…