Use case : If you have multiple files for example chapter wise question papers etc.…
Month: October 2021
What is the problem in having lots of small files in HDFS? What is the remediation plan?
user October 18, 2021 0 Comments on What is the problem in having lots of small files in HDFS? What is the remediation plan?
In Hadoop ecosystem we are storing files under folders in HDFS, most of the time the folder name we are…
Explain distributed cache in Hadoop ?
Distributed cache is a facility provided by Hadoop map reduce framework to access small file needed by application during its…
What is Swappiness Value? What is the role of Swappiness Value during the cluster set up?
user October 18, 2021 0 Comments on What is Swappiness Value? What is the role of Swappiness Value during the cluster set up?
vm.swappiness is one of the Kernel Parameter in Linux or UNIX, vm.swappiness value is from 0-100 which controls the swapping…
What are the Python Modules provided in AWS Glue
AWS Glue version 2.0 supports the following python modules. Note : Different Glue versions support different Python versions. boto3==1.12.4 botocore==1.15.4…
Snowflake : How to load data from Amazon S3 to Snowflake table using Copy
user October 10, 2021 0 Comments on Snowflake : How to load data from Amazon S3 to Snowflake table using Copy
With Snowflake COPY command you can load data from staged files on internal/external locations to an existing table or vice…