Category: article

Hive – What are the metastore tables in Hive ?

Metastore is the central repository of Apache Hive metadata. It stores metadata for Hive tables AUX_TABLE BUCKETING_COLS CDS COLUMNS_V2 COMPACTION_QUEUE…

Continue Reading Hive – What are the metastore tables in Hive ?

How to transfer file from SFTP server to Local using Python

There are situation you may need to programmatically transfer file from SFTP server to you local environment. Here we will…

Continue Reading How to transfer file from SFTP server to Local using Python

How to remove csv header using Spark (PySpark)

A common use case when dealing with CSV file is to remove the header from the source to do data…

Continue Reading How to remove csv header using Spark (PySpark)

How to access hive using Python (Source code )

Use case : If you want to do some scheduling or some automation , we may need to access Hive…

Continue Reading How to access hive using Python (Source code )

Matplotlib Cheat Sheet for Quick reference

Matplotlib Matplotlib Cheat Sheet for Quick reference Matplotlib is a Python 2D plotting library which produces publication-quality figures in a…

Continue Reading Matplotlib Cheat Sheet for Quick reference

NumPy Cheat Sheet for Quick reference

NumPy The NumPy library is the core library for scientific computing in Python. This Python NumPy cheat sheet is a…

Continue Reading NumPy Cheat Sheet for Quick reference

Pandas Cheat Sheet for Quick reference

Syntax for Creating DataFrames Specify values for each column. a b c 1 4 7 10 2 5 8 11…

Continue Reading Pandas Cheat Sheet for Quick reference
Snowflake

What are the Best Practices when using Snowflake Transactions?

A transaction should contain statements that are related and should succeed or fail together. Larger transactions can improve performance in…

Continue Reading What are the Best Practices when using Snowflake Transactions?