Tag: PySpark

PySpark @ Freshers.in

PySpark : Skipping Sundays in Date Computations

When working with data in fields such as finance or certain business operations, it’s often the case that weekends or…

Continue Reading PySpark : Skipping Sundays in Date Computations
PySpark @ Freshers.in

PySpark : Understanding Joins in PySpark using DataFrame API

Apache Spark, a fast and general-purpose cluster computing system, provides high-level APIs in various programming languages like Java, Scala, Python,…

Continue Reading PySpark : Understanding Joins in PySpark using DataFrame API
PySpark @ Freshers.in

PySpark : Reversing the order of lists in a dataframe column using PySpark

pyspark.sql.functions.reverse Collection function: returns a reversed string or an array with reverse order of elements. In order to reverse the…

Continue Reading PySpark : Reversing the order of lists in a dataframe column using PySpark
PySpark @ Freshers.in

PySpark : Generating a 64-bit hash value in PySpark

Introduction to 64-bit Hashing A hash function is a function that can be used to map data of arbitrary size…

Continue Reading PySpark : Generating a 64-bit hash value in PySpark
PySpark @ Freshers.in

PySpark : Introduction to BASE64_ENCODE and its Applications in PySpark

Introduction to BASE64_ENCODE and its Applications in PySpark BASE64 is a group of similar binary-to-text encoding schemes that represent binary…

Continue Reading PySpark : Introduction to BASE64_ENCODE and its Applications in PySpark