Sep 25, 2021SQL Server — SQL Agent jobs not running on schedule after time zone changeToday’s post is a real incident that happened with me this week. I want to share it with you all while details are still fresh in my memory. It’s an interesting case of interrupted scheduled SQL agent jobs after time zone modifications on server running SQL Server. I would like…Sql Dba4 min readSql Dba4 min read
Sep 19, 2021Sqoop Import from MySQL to Hive & HBaseIn last post we discussed Sqoop concepts and demos on how to move data from RDBMS to HDFS and from HDFS back to RDBMS. Today we will extend the data movement from RDBMS to NoSQL database — HBase and Hadoop Data warehouse Hive. Sqoop import command can be used to…Big Data4 min readBig Data4 min read
Sep 11, 2021Apache Sqoop — SQL to HadoopToday we are going to discuss a very efficient tool — Apache Sqoop. The heading itself tells you the meaning and purpose of tool which is SQL to Hadoop. We will first build some concepts and then have some demos that would help us understand these concepts better. Alright! Let’s…Big Data Analytics6 min readBig Data Analytics6 min read
Aug 29, 2021SQL Server — Start-up ProceduresToday I want to discuss with you all a great feature that is available with SQL Server which is ‘scan for startup procs’. It’s an advanced configuration within SQL server and enables SQL server to scan for any procedures marked as start-up procs and run them. What makes it so great? You all must be…3 min read3 min read
Aug 20, 2021Twitter Sentiments AnalysisFew weeks back we had used flume to download tweets from twitter on the topic that we wanted (I had used BigData world keywords). Today, we will level up and analyze the tweets and determine sentiments of tweets. We can do our own analysis on what is trending in the…Sentimentsanalysis4 min readSentimentsanalysis4 min read
Aug 18, 2021Flume & Spark Streaming IntegrationIn this post, we will integrate Flume and Spark Streaming. We will use Spark Streaming to process data that we will get from Flume. Flume will be our source for streaming data. Flume is considered to be to most suitable to have source from log files. We will get log…Data Streaming5 min readData Streaming5 min read
Aug 15, 2021Spark Streaming: Streaming Word CountIn this blog, we are going to process streaming data but will use popular and efficient Spark Streaming for getting data from source and processing it. Streaming data from HDFS and processing to get word count for files We will look upon a simple example on Streaming Data from an HDFS folder/location on your Hadoop cluster to Spark where we will perform…3 min read3 min read
Aug 8, 2021Data & BigData – databare.comData & BigData – databare.com What is Data? In our digital world, Data is defined as facts or figures, or information that’s stored in or used by a computer. …Dada7 min readDada7 min read