Open in app

Sign In

Write

Sign In

Sunny Srinidhi
Sunny Srinidhi

2.6K Followers

Home

About

Jun 13, 2022

Optimising Hive Queries with Tez Query Engine

Tuning configuration parameters for a better performing Hive — Hive provides us the option of executing SQL queries with a few different query engines. It ships with the native MapReduce engine. But we can switch that to Tez which has gained popularity since its launch, or we can also use Apache Spark as well. …

Hive

4 min read

Optimising Hive Queries with Tez Query Engine
Optimising Hive Queries with Tez Query Engine
Hive

4 min read


Published in Towards Data Science

·Jan 17, 2022

Cleaning and Normalizing Data Using AWS Glue DataBrew

Automate data cleaning with AWS DataBrew without writing any code — A major part of any data pipeline is the cleaning of data. Depending on the project, cleaning data could mean a lot of things. But in most cases, it means normalizing data and bringing data into a format that is accepted within the project. …

AWS

9 min read

Cleaning and Normalizing Data Using AWS Glue DataBrew
Cleaning and Normalizing Data Using AWS Glue DataBrew
AWS

9 min read


Published in CodeX

·Nov 28, 2021

The Dunning-Kruger Effect In Tech

This is not the kind of post I usually write on my blog. This is more of a psychology lecture than a how-to tech tutorial. …

The Dunning Kruger Effect

6 min read

The Dunning-Kruger Effect In Tech
The Dunning-Kruger Effect In Tech
The Dunning Kruger Effect

6 min read


Published in Towards Data Science

·Nov 18, 2021

Understanding Apache Hive LLAP

Apache Hive is a complex system when you look at it, but once you go looking for more info, it’s more interesting than complex. There are multiple query engines available for Hive, and then there’s LLAP on top of the query engines to make real-time, interactive queries more workable. …

Hive

8 min read

Understanding Apache Hive LLAP
Understanding Apache Hive LLAP
Hive

8 min read


Published in DataSeries

·Nov 5, 2021

Installing Hadoop on the New M1 Pro and M1 Max MacBook Pro

We’ll see how to install and configure Hadoop and its components on MacOS running on the new M1 Pro and M1 Max chips by Apple — In the previous series of posts, I wrote about how to install the complete Hadoop stack on Windows 11 using WSL 2. And now that the new MacBook Pro laptops are available with the brand new M1 Pro and M1 Max SOCs, here’s a guide on how to install the…

Hadoop

8 min read

Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
Installing Hadoop on the new M1 Pro and M1 Max MacBook Pro
Hadoop

8 min read


Published in Towards Data Science

·Nov 1, 2021

Installing Hadoop on Windows 11 with WSL2

How to install and configure Hadoop and its components on Windows 11 running a Linux distro using WSL 1 or 2. — In the previous post, we saw how to install a Linux distro on Windows 11 using WSL2 and then how to install Zsh and on-my-zsh to make the terminal more customizable. …

Big Data

8 min read

Installing Hadoop on Windows 11 with WSL2
Installing Hadoop on Windows 11 with WSL2
Big Data

8 min read


Oct 27, 2021

Installing Zsh and Oh-my-zsh on Windows 11 with WSL2

Originally published at https://blog.contactsunny.com on October 27, 2021. Before we begin, you might ask, why am I writing on something this trivial? I sold off my old MacBook Pro because I’m super excited about the new M1 Pro MacBook Pros. I have pre-ordered one of those and am waiting for…

Windows 11

5 min read

Installing Zsh and Oh-my-zsh on Windows 11 with WSL2
Installing Zsh and Oh-my-zsh on Windows 11 with WSL2
Windows 11

5 min read


Published in DataDrivenInvestor

·Oct 11, 2021

Getting Started With Apache Airflow

Apache Airflow is another awesome tool that I discovered just recently. Just a couple of months after discovering it, I can’t imagine not using it now. It’s reliable, configurable, and dynamic. Because it’s all driven by code, you can version control it too. It’s just awesome! …

Airflow

11 min read

Getting Started With Apache Airflow
Getting Started With Apache Airflow
Airflow

11 min read


Published in Towards Data Science

·Sep 30, 2021

Fake (almost) everything with Faker

I was recently tasked with creating some random customer data, with names, phone numbers, addresses, and the usual other stuff. At first, I thought I’ll just generate random strings and numbers (some gibberish) and call it a day. But then I remembered my colleagues using a package for that. …

Python

4 min read

Fake (almost) everything with Faker
Fake (almost) everything with Faker
Python

4 min read


Jun 30, 2021

Querying Hive Tables From a Spring Boot App

Originally published at https://blog.contactsunny.com on June 30, 2021. In this post, we’ll see how we can query tables that reside in Hive using a Spring Boot application. As always, I’m going to use a Spring Boot web app with a few GET APIs to show how we can query data…

Hive

4 min read

Querying Hive Tables From a Spring Boot App
Querying Hive Tables From a Spring Boot App
Hive

4 min read

Sunny Srinidhi

Sunny Srinidhi

2.6K Followers

Coding, machine learning, reading, sleeping, listening, potato. blog.contactsunny.com, linkedin.com/in/sunnysrinidhi/, and twitter.com/contactsunny

Following
  • ReadWrite

    ReadWrite

  • PCMag

    PCMag

  • Hunter Walk

    Hunter Walk

  • SF Ali (Farooq)

    SF Ali (Farooq)

  • Marek Kirejczyk

    Marek Kirejczyk

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech