For Spark please visit (1) Spark Interview Questions (2) Spark Examples (3) PySpark Blogs 1.…
Category: article
AWS Glue and what is it used for – A easy to read introduction
AWS Glue is a fully managed extract, transform, load (ETL) service provided by Amazon Web Services (AWS). It is used…
Spark : Calculate the number of unique elements in a column using PySpark
pyspark.sql.functions.countDistinct In PySpark, the countDistinct function is used to calculate the number of unique elements in a column. This is…
NLTK : Natural Language Toolkit – An introduction
Natural Language Processing (NLP) is a field of computer science and artificial intelligence concerned with the interaction between computers and…
Go : How Go handles error handling
Error handling is an important aspect of programming, and Go provides several mechanisms for handling errors. In Go, errors are…
Go : Go language data types
Go is a statically-typed language, which means that the type of a variable must be specified when it is declared,…
Go : Explain about the Go language syntax
The syntax of the Go programming language is simple and straightforward, making it easy for developers to pick up and…
Go : How Go different from other programming languages
Go is a statically-typed, concurrent, garbage-collected programming language that was created by Google in 2009. It was designed to address…
Go : What is Go language and why is it important ?
Go is a statically-typed, concurrent, garbage-collected programming language created by Google in 2009. Go is designed to be a simple,…
Spark : Advantages of Google’s Serverless Spark
Google’s Serverless Spark has several advantages compared to traditional Spark clusters: Cost-effective: Serverless Spark eliminates the need for dedicated servers…
DBT : Automate selection of different table name in development and production in DBT using macro
Macros are a powerful tool in dbt (Data Build Tool) that allow you to reuse code and make your SQL…